Implementasi EfficientDet untuk Mendeteksi Objek Bergerak di Atas Konveyor Peraga

TARIQ FITRIA AZIZ

TARIQ FITRIA AZIZ, Enas Dhuhri Kusuma, S.T., M.Eng.; Ir. Sujoko Sumaryono, M.T.

2023 | Skripsi | TEKNIK ELEKTRO

Abstrak
File Pdf

Teknologi visi komputer telah banyak diaplikasikan di industri tetapi tidak banyak ditemukan di laboratorium tempat mahasiswa memperoleh edukasi. Untuk menghindari celah pengetahuan antara universitas dan industri, sebuah alat peraga yang menjalankan fungsi sejenis perlu dirancang. Alat peraga yang dimaksud adalah detektor objek berbasis Single-Board Computer (SBC) yang ditanam sebuah model atau algoritma visi komputer. Model ini akan memetakan gambar masukan menjadi nilai prediksi kelas dan koordinat kotak batas (bounding box) sehingga diperoleh nama objek dan lokasinya pada citra. Solusi yang dipilih adalah Convolutional Neural Network (CNN) dengan perangkat keras SBC dan kamera. Gabungan dari software dan hardware ini diharapkan memenuhi spesifikasi: akurasi 0,4133 mean Average Precision (mAP), konsumsi energi 4,325 Watt-jam, dan kecepatan inferensi 15 frame per second (FPS).

Implementasi rancangan produk dilakukan dengan mengikuti tiga tahapan: persiapan/manajemen data, pengembangan model, dan deployment. Dari proses tersebut, tiga luaran diperoleh: perangkat dan program deteksi berbasis Raspberry Pi 4, model deteksi berbasis EfficientDet-Lite, serta dataset latih dalam format PASCAL VOC XML. Program yang dikembangkan mampu mengeluarkan pesan di terminal, visualisasi objek di konveyor, dan visualisasi data deteksi. Deteksi dilakukan dengan mengikuti tiga skenario, tiga variasi kecepatan objek, dan empat jenis resolusi citra sehingga diperoleh 108 kasus deteksi. Dari hasil pengujian, rancangan yang dibuat mampu memenuhi spesifikasi: akurasi deteksi 0,315-0,502 mAP, konsumsi energi 6,5 Watt-jam, dan kecepatan inferensi 1-7 FPS. Pengujian menunjukkan bahwa kinerja deteksi akan lebih baik apabila konveyor diturunkan kecepatannya dan objek yang dipilih berbeda warna dari konveyor. Peningkatan resolusi tidak akan memperbaiki kinerja deteksi. Ukuran model yang lebih besar dapat memberikan peningkatan kinerja tetapi tidak menjamin hasil optimal.

Computer vision technology has been widely applied in industry but is not found in laboratories where students receive their education. To avoid the knowledge gap between universities and industry, a trainer that performs similar functions needs to be designed. The trainer is a Single-Board Computer (SBC) based object detector embedded with a computer vision model or algorithm. This model will map the input image into predicted class values and bounding box coordinates to obtain the name of the object and its location in the image. The chosen solution is a Convolutional Neural Network (CNN) with SBC and camera hardware. The combination of software and hardware is expected to meet the specifications: 0.4133 mean Average Precision (mAP) accuracy, 4.325 Watt-hours energy consumption, and 15 frames per second (FPS) inference speed.

The implementation of the product design was carried out following three stages: data preparation/management, model development, and deployment. From the process, three outputs were obtained: a Raspberry Pi 4-based detection device and program, an EfficientDet-Lite-based detection model, and a training dataset in PASCAL VOC XML format. The developed program is capable of outputting messages on the terminal, visualization of objects on the conveyor, and visualization of detection data. Detection is performed by following three scenarios, three variations of object speed, and four types of image resolution, resulting in 108 detection cases. From the test results, the design was able to meet the specifications: 0.315-0.502 mAP detection accuracy, 6.5 Watt-hours energy consumption, and 1-7 FPS inference speed. The tests show that the detection performance will be better if the conveyor is reduced in speed and the selected object is a different color from the conveyor. Increasing the resolution will not improve the detection performance. Larger model size may provide improved performance but does not guarantee optimal results.

Kata Kunci : EfficientDet, Computer vision, Object detection, Raspberry Pi, Manufacturing automation

S1-2023-424984-abstract.pdf
S1-2023-424984-bibliography.pdf
S1-2023-424984-tableofcontent.pdf
S1-2023-424984-title.pdf

LAYANAN

E-Resources

Quick Access