Real-time parallel image processing applications on multicore CPUs with OpenMP and GPGPU with CUDA


Aydin S., SAMET R., BAY Ö. F.

JOURNAL OF SUPERCOMPUTING, cilt.74, sa.6, ss.2255-2275, 2018 (SCI-Expanded) identifier identifier

  • Yayın Türü: Makale / Tam Makale
  • Cilt numarası: 74 Sayı: 6
  • Basım Tarihi: 2018
  • Doi Numarası: 10.1007/s11227-017-2168-6
  • Dergi Adı: JOURNAL OF SUPERCOMPUTING
  • Derginin Tarandığı İndeksler: Science Citation Index Expanded (SCI-EXPANDED), Scopus
  • Sayfa Sayıları: ss.2255-2275
  • Anahtar Kelimeler: Parallel computing, Real-time image processing, Image segmentation, Thresholding, Multicore programming, GPU programming, TREE INTERCONNECTION NETWORK, SEGMENTATION, EXTRACTION, ALGORITHM
  • Gazi Üniversitesi Adresli: Evet

Özet

This paper presents real-time image processing applications using multicore and multiprocessing technologies. To this end, parallel image segmentation was performed on many images covering the entire surface of the same metallic and cylindrical moving objects. Experimental results on multicore CPU with OpenMP platform showed that by increasing the chunk size, the execution time decreases approximately four times in comparison with serial computing. The same experiments were implemented on GPGPU using four techniques: (1) Single image transmission with single pixel processing; (2) Single image transmission with multiple pixel processing; (3) Multiple image transmission with single pixel processing; and (4) Multiple image transmission with multiple pixel processing. All techniques were implemented on GeForce, Tesla K20 and Tesla K40. Experimental results of GPU with CUDA platform showed that by increasing the core number speedup is increased. Tesla K40 gave the best results of 35 and 12 (for the first technique), 36 and 13 (for the second technique), 54 and 16 (for the third technique), 71 and 17 (for the fourth technique) times improvement without and with data transmission time in comparison with serial computing. As a result, users are suggested to use Tesla K40 GPU and Multiple image transmission with multiple pixel processing to get the maximum performance.