Object Detection with Low Capacity GPU Systems Using Improved Faster R-CNN

Korez, Atakan; BARIŞÇI, NECAATTİN

doi:10.3390/app10010083

Object Detection with Low Capacity GPU Systems Using Improved Faster R-CNN

Atıf İçin Kopyala

Korez A., BARIŞÇI N.

APPLIED SCIENCES-BASEL, cilt.10, sa.1, 2020 (SCI-Expanded)

Yayın Türü: Makale / Tam Makale
Cilt numarası: 10 Sayı: 1
Basım Tarihi: 2020
Doi Numarası: 10.3390/app10010083
Dergi Adı: APPLIED SCIENCES-BASEL
Derginin Tarandığı İndeksler: Science Citation Index Expanded (SCI-EXPANDED), Scopus, Aerospace Database, Agricultural & Environmental Science Database, Applied Science & Technology Source, Communication Abstracts, INSPEC, Metadex, Directory of Open Access Journals, Civil Engineering Abstracts
Anahtar Kelimeler: deep learning, object detection, remote sensing, deformable convolutional network, weight standardization
Gazi Üniversitesi Adresli: Evet

Özet

Object detection in remote sensing images has been frequently used in a wide range of areas such as land planning, city monitoring, traffic monitoring, and agricultural applications. It is essential in the field of aerial and satellite image analysis but it is also a challenge. To overcome this challenging problem, there are many object detection models using convolutional neural networks (CNN). The deformable convolutional structure has been introduced to eliminate the disadvantage of the fixed grid structure of the convolutional neural networks. In this study, a multi-scale Faster R-CNN method based on deformable convolution is proposed for single/low graphics processing unit (GPU) systems. Weight standardization (WS) is used instead of batch normalization (BN) to make the proposed model more efficient for a small batch size (1 img/per GPU) on single GPU systems. Experiments were conducted on the publicly available 10-class geospatial object detection (NWPU-VHR 10) dataset to evaluate the object detection performance of the proposed model. Experiment results show that our model achieved a 92.3 mAP. This is a 1.7% mAP increase when compared to the best results in the models using the same dataset.