Please use this identifier to cite or link to this item: https://hdl.handle.net/10356/175269
Title: Comparative analysis of YOLO and transformers for pedestrian detection
Authors: Wong, Ying Xuan
Keywords: Computer and Information Science
Issue Date: 2024
Publisher: Nanyang Technological University
Source: Wong, Y. X. (2024). Comparative analysis of YOLO and transformers for pedestrian detection. Final Year Project (FYP), Nanyang Technological University, Singapore. https://hdl.handle.net/10356/175269
Project: SCSE23-0720 
Abstract: This report aims to study and compare the performance of two state-of-the-art real-time object detectors – YOLOv8 (You Only Look Once, 8th version) and RT-DETR (Real-Time Detection Transformers) in tackling pedestrian detection. Throughout the report, both models were trained and evaluated on different pedestrian datasets, including TJU-DHD-Traffic, Caltech Pedestrian, KITTI, INRIA Person and Cityscapes. Besides, the performance of the integrated models between YOLOv8 and RT-DETR was also investigated. Thorough analyses were conducted, and it was concluded that YOLOv8 achieved a faster inference speed than RT-DETR regarding limited GPU resources. Besides, the integrated achieved comparable speed with YOLOv8, with accuracies comparable to or surpassing the RT-DETR models, highlighting the feasibility of integrating both detectors. Future work can include alternating integrated models to attain optimal results. Besides, tuning and experimenting on larger batch sizes shall also be included to conduct a more comprehensive comparison.
URI: https://hdl.handle.net/10356/175269
Schools: School of Computer Science and Engineering 
Fulltext Permission: restricted
Fulltext Availability: With Fulltext
Appears in Collections:SCSE Student Reports (FYP/IA/PA/PI)

Files in This Item:
File Description SizeFormat 
FYP_Report.pdf
  Restricted Access
1.51 MBAdobe PDFView/Open

Page view(s)

164
Updated on Mar 16, 2025

Download(s)

6
Updated on Mar 16, 2025

Google ScholarTM

Check

Items in DR-NTU are protected by copyright, with all rights reserved, unless otherwise indicated.