Bulletin of Surveying and Mapping ›› 2024, Vol. 0 ›› Issue (11): 61-67.doi: 10.13474/j.cnki.11-2246.2024.1111

Previous Articles    

Improved object detection algorithm HCAM-YOLO in traffic scenes based on YOLOv5

WANG Zhitao, ZHANG Ruiju, WANG Jian, ZHAO Jiaxing, LIU Yantao   

  1. School of Geomatics and Urban Spatial Information, Beijing University of Civil Engineering and Architecture, Beijing 102600, China
  • Received:2024-03-21 Published:2024-12-05

Abstract: The rapid and precise detection of targets in traffic scenarios is crucial for intelligent traffic management and driving path decision-making. Traditional target detection models often grapple with issues such as inadequate detection accuracy, high rates of leak detection and false detection due to the complexity and variability of the traffic environment, and the diversity and sparsity of target features. To address these challenges, this paper introduces a YOLOv5 target detection model, HCAM-YOLO, which leverages the HcPAN feature fusion network. The crux of this approach lies in addressing the issue of local information being easily lost during the PAN network's feature fusion process. A hybrid convolutional attention mechanism(HCAM) is designed to enhance multi-scale information extraction in feature fusion networks. By integrating the HCAM module into the PAN's underlying structure, the sensitivity of key local features is enhanced, while the fusion effect of deep semantic information and shallow positional data is strengthened. This method's novelty lies in its use of an attention mechanism to optimize the feature fusion process, thereby improving the model's detection performance of pedestrians, motor vehicles, and other targets in complex traffic environments. The experimental dataset comprises the Rope 3D dataset, Road Veh dataset, and Road Ped dataset. The results demonstrate that the HCAM module is more suitable for integration into the underlying PAN network than other attention mechanisms. When compared to the basic YOLOv5 model, the precision and recall of the final HCAM-YOLO algorithm model increased by 3.4% and 3.2%,respectively, and mAP@0.5/% by 3.8%. The HCAM-YOLO algorithm model proposed in this paper exhibits strong adaptability to target detection tasks in traffic scenes with complex backgrounds.

Key words: object detection, traffic scene, feature enhancement, attention mechanism, YOLOv5

CLC Number: