测绘通报 ›› 2026, Vol. 0 ›› Issue (2): 68-73,80.doi: 10.13474/j.cnki.11-2246.2026.0211

• 学术研究 • 上一篇    下一篇

基于改进YOLOv8s的无人机影像车辆目标检测算法

滕敏1, 张博2, 徐佳伟2, 林聪2,3, 沈雨2,3, 储征伟2,3   

  1. 1. 连云港职业技术学院, 江苏 连云港 222000;
    2. 南京市测绘勘察研究院股份有限公司, 江苏 南京 210019;
    3. 自然资源部国土环境与灾害监测重点实验室, 江苏 徐州 221116
  • 收稿日期:2025-05-15 发布日期:2026-03-12
  • 通讯作者: 林聪。E-mail:lcnju1994@163.com
  • 作者简介:滕敏(1973—),女,硕士,副教授,主要研究方向为大数据分析。E-mail:13709106@qq.com
  • 基金资助:
    江苏省前沿技术研发计划(BF2024009);测绘股份科技项目(2023RD02)

Vehicle object detection approach in drone imagery based on improved YOLOv8s

TENG Min1, ZHANG Bo2, XU Jiawei2, LIN Cong2,3, SHEN Yu2,3, CHU Zhengwei2,3   

  1. 1. LianyungangTechnical College, Lianyungang 222000, China;
    2. Nanjing Research Institute of Surveying, Mapping and Geotechnical Investigation, Co., Ltd., Nanjing 210019, China;
    3. Key Laboratory of Land Environment and Disaster Monitoring of Natural Resources, Xuzhou 221116, China
  • Received:2025-05-15 Published:2026-03-12

摘要: 准确、实时的车辆检测跟踪结果是交通流估算、智慧交通管理的重要数据支撑,无人机影像已经成为车辆检测任务的重要数据源。针对现有YOLO模型在复杂场景中识别小目标的能力弱、无人机车辆检测任务数据集匮乏等问题,本文提出了面向车辆检测任务的YOLOv8s-VOD模型,并制作开源数据集NJVOD。该方法通过构建C2F-PTB和BiFPN-GLSA模块,实现了全局-局部特征的协同提取及多尺度语义-边缘信息的有效融合,在降低网络复杂度的前提下,提高了检测精度。试验结果表明,YOLOv8s-VOD在最小参数量的情况下达到了最高的检测精度,相较于已有方法,在VEDAI和NJVOD数据集上分别提升了2.4~12.2个百分点和4.1~5.3个百分点。本文提出的C2F-PTB、BiFPN-GLSA模块均能有效提升小目标检测精度,制作的NJVOD数据集对相关研究工作有重要的支撑作用。

关键词: YOLOv8, 无人机影像, 车辆检测, 注意力机制, 特征融合

Abstract: Accurate and real-time vehicle detection and tracking provide crucial data support for traffic flow estimation and intelligent traffic management.Drone imagery has emerged as a vital data source for vehicle detection tasks.To address the weak ability of existing YOLO models to detect small objects within complex scenarios and the scarcity of open-source datasets for drone vehicle detection,this paper proposes the YOLOv8s-VOD model specifically designed for vehicle detection tasks,and introduces the open-source dataset NJVOD.This method constructs C2f-PTB and BiFPN-GLSA modules to achieve collaborative extraction of global-local featuresand effective fusion of multi-scale semantic and edge information,thereby improving detection accuracywhile reducing network complexity.Experimental results show that YOLOv8s-VOD achieves the highest detection accuracy with minimal parameters,outperforming existing methods by 2.4~12.2 percentage poin on the VEDAI dataset and 4.1~5.3 percentage point on the NJVOD dataset;The C2f-PTB and BiFPN-GLSA modules proposed in this work both effectively improve small-object detection accuracy.Additionally,the newly created NJVOD dataset offers crucial support for related research.

Key words: YOLOv8, drone imagery, vehicle detection, attention mechanism, feature fusion

中图分类号: