应用大语言模型的无人机精确着落框架

doi:10.13474/j.cnki.11-2246.2026.0220

测绘通报 ›› 2026, Vol. 0 ›› Issue (2): 126-130.doi: 10.13474/j.cnki.11-2246.2026.0220

应用大语言模型的无人机精确着落框架

陈立军¹, 陈青²

1. 广州软件学院软件与人工智能学院, 广东广州 510990;
2. 百色学院工商管理学院, 广西百色 533000

收稿日期:2025-05-22 发布日期:2026-03-12
作者简介:陈立军(1974—),男,硕士,副教授,主要研究方向为人工智能。E-mail:372158286@qq.com
基金资助:
2025年度广州软件学院科研项目(KY202504)

A drone precision landing framework using large language models

CHEN Lijun¹, CHEN Qing²

1. School of Software and Artificial Intelligence, Guangzhou University of Software, Guangzhou 510990, China;
2. School of Business Administration, Baise University, Baise 533000, China

Received:2025-05-22 Published:2026-03-12

摘要/Abstract

摘要： 在动态、非结构化的环境中,无人机语义感知能力有限,且依赖于固定的、对上下文不敏感的安全系数,传统着落方法往往存在不足。为了解决这些问题,本文提出了一种混合框架LLM_Land,将大语言模型(LLM)与模型预测控制(MPC)相结合,从视觉语言编码器(VLE)(如BLIP)开始,将实时图像转换为简洁的文本场景描述。由配备检索增强生成(RAG)的轻量级LLM(如 Qwen 2.5 1.5B 或 LLaMA 3.2 1B)处理,以对场景元素进行分类,并推断出上下文感知安全缓冲区,如行人3 m,车辆5 m,由此产生的语义标志和不安全区域随后被输入MPC模块,从而实现实时轨迹重新规划,在保持高着陆精度的同时避免碰撞。在ROS-Gazebo模拟器中验证了本文框架,其性能始终优于基于视觉的常规MPC基线。结果表明,动态障碍物导致的近失事故大幅减少,同时在杂乱环境中保持了精确着陆。

关键词: 无人机语义感知, 自主着陆, 大型语言模型, 模型预测控制

Abstract: Traditional landing methods often fall short in response to the limited semantic sensing capabilities of UAVs in dynamic,unstructured environments and their reliance on fixed,context-insensitive safety factors.To address these limitations,a hybrid framework,LLM_Land,is proposed that combines a large language model (LLM)with model predictive control (MPC),starting from a visual language encoder (VLE)(e.g.,BLIP),which converts real-time images into succinct textual scene descriptions,which is used by a retrieval-augmented generation (RAG)-equipped lightweight LLM (e.g.,Qwen 2.5 1.5B or LLaMA 3.2 1B)to categorise scene elements and infer context-aware safety buffers,e.g.,3 m for pedestrians and 5 m for vehicles,and the resulting semantic flags and unsafe zones are subsequently fed into the MPC module,enabling real-time trajectory replanning for collision avoidance whilst maintaining a high level of landing accuracy.The proposed framework is validated in the ROS-Gazebo simulator,which consistently outperformed the conventional vision-based MPC baseline,and the results showed a significant reduction in near-miss accidents due to dynamic obstacles,while maintaining accurate landings in a cluttered environment.

Key words: UAV semantic perception, autonomous landing, large-scale language modelling, model predictive control

中图分类号:

P208
P23

陈立军, 陈青. 应用大语言模型的无人机精确着落框架[J]. 测绘通报, 2026, 0(2): 126-130.

CHEN Lijun, CHEN Qing. A drone precision landing framework using large language models[J]. Bulletin of Surveying and Mapping, 2026, 0(2): 126-130.

参考文献

[1] ZHOU Zhenyu,FENG Junhao,TAN Lu,et al.An air-ground integration approach for mobile edge computing in IoT[J].IEEE Communications Magazine,2018,56(8):40-47.
[2] JUNAID A B,KONOIKO A,ZWEIRI Y,et al.Autonomous wireless self-charging for multi-rotor unmanned aerial vehicles[J].Energies,2017,10(6):803.
[3] YU Shuai,GONG Xiaowen,SHI Qian,et al.EC-SAGINs:edge-computing-enhanced space-air-ground-integrated networks for Internet of vehicles[J].IEEE Internet of Things Journal,2022,9(8):5742-5754.
[4] HANOVER D,FOEHN P,SUN Sihao,et al.Performance,precision,and payloads:adaptive nonlinear MPC for quadrotors[J].IEEE Robotics and Automation Letters,2022,7(2):690-697.
[5] RADFORD A,KIM J W,HALLACY C,et al.Learning transferable visual models from natural language supervision[C]//Proceedings of 2009 International Conference on Machine Learning.[S.l.]:Computer Science,2009.
[6] RAM O,LEVINE Y,DALMEDIGOS I,et al.In-context retrieval-augmented language models[J].Transactions of the Association for Computational Linguistics,2023,11:1316-1331.
[7] NIU Guanchong,YANG Qingkai,GAO Yunfan,et al.Vision-based autonomous landing for unmanned aerial and ground vehicles cooperative systems[J].IEEE Robotics and Automation Letters,2021,7(3):6234-6241.
[8] ZHANG Baozhe,CHEN Xinwei,LI Zhehan,et al.CoNi-MPC:cooperative non-inertial frame based model predictive control[J].IEEE Robotics and Automation Letters,2023,8(12):8082-8089.
[9] JAVANMARDI A,LIU Min,HE Chuanni,et al.Improving construction meeting effectiveness:trade-offs between reactive and proactive site-level planning discussions[J].Journal of Management in Engineering,2024,40(5):13.
[10] WOLFRATH N M,VERHAGEN N B,CROTTY B H,et al.Augmenting large language models via vector embeddings to improve domain-specific responsiveness[J].Journal of Visualized Experiments,2024,214(2):e66796.
[11] ALI M N.Enhancing investigative journalism:leveraging large language models and vector databases[J].UiT Norges Arktiske Universitet,2024,3(1):1-5.
[12] KIM J S,KIM S H.Development and evaluation of a computer science question generative model using generative AI technology[J].Journal of Digital Contents Society,2024,25(11):3309-3317.
[13] LEI Bo,LIU Bo,WANG Changhong.Robust geometric control for a quadrotor UAV with extended Kalman filter estimation[J].Actuators,2024,13(6):20-29.
[14] YAN Liqi,MA Siqi,WANG Qifan,et al.Video captioning using global-local representation[J].IEEE Transactions on Circuits and Systems for Video Technology,2022,32(10):6642-6656.

[1]	戴志敏, 王海燕, 张兴, 汤浩, 唐铭, 钟勇, 吴宝佑. 基于AI+GIS的多维度智能选址选线应用研究[J]. 测绘通报, 2026, 0(2): 174-179,186.
[2]	王立志, 肖东升, 张英豪. 基于多因子叠加与双层优化模型的eVTOL起降场选址——以成都市为例[J]. 测绘通报, 2025, 0(12): 58-64,70.
[3]	苏学武, 王永红, 秦坤, 成王玉, 王国玺. 祁连山地区积雪覆盖与雪线时空变化特征及影响因素[J]. 测绘通报, 2025, 0(12): 150-157.
[4]	杜莹, 邓国臣, 韦原原. 多粒度时空对象的Geo-AI建模及公交线路规划应用[J]. 测绘通报, 2025, 0(10): 20-25.
[5]	邹伟, 张天然, 王波, 秦战. 结合互联网位置大数据的跨区域通勤识别方法与实践——以上海及周边城市为例[J]. 测绘通报, 2025, 0(10): 26-29.
[6]	何璇, 徐胜华, 车向红, 王琢璐, 唐晴, 杨澜. 顾及周期性时序行为与社交关系的下一个兴趣点推荐[J]. 测绘通报, 2025, 0(10): 106-113.
[7]	梁海磊, 王勇, 杜凯旋, 周伟祥. Geo-Agent:支持自然语言交互的地理信息智能体架构[J]. 测绘通报, 2025, 0(10): 114-118,126.
[8]	姚顺福, 王守芬, 顾建祥. 基于城市三维全空间数字底座的城市信息模型构建的支撑方法[J]. 测绘通报, 2025, 0(10): 144-151.
[9]	尹言军, 肖琨, 黄海涛. 基于序列测绘成果的城市建设用地时空演变特征分析[J]. 测绘通报, 2025, 0(10): 152-156.
[10]	景红霞, 刘宇硕, 李霞, 蔡喜琴, 胡晓娟. 甘肃省内祁连山区域冰川时空变化及其驱动因素分析[J]. 测绘通报, 2025, 0(10): 157-162.
[11]	张心洁, 杨永崇, 王涛, 张怡英, 杜义博. 不同生态源地识别方法对生态网络构建结果的影响[J]. 测绘通报, 2025, 0(10): 163-168.
[12]	宁宇光, 李质轩, 严德培, 吴铮, 张健. 基于先验query的分合流检测网络[J]. 测绘通报, 2025, 0(10): 169-174.
[13]	袁丽莉, 杨欣慰, 李梦华, 陈玉权, 唐伯惠. 多源遥感数据融合的大湄公河次区域森林地上生物量制图[J]. 测绘通报, 2025, 0(8): 43-47.
[14]	魏智威, 杨乃, 丁愫, 陈业滨, 郭仁忠. 地理动画中点要素注记稳定更新的遗传算法[J]. 测绘通报, 2025, 0(8): 83-88,94.
[15]	卞玉霞, 朱自杰, 周业, 李心怡. 基于加权贝叶斯网络的目标车辆轨迹推演方法[J]. 测绘通报, 2025, 0(8): 89-94.

应用大语言模型的无人机精确着落框架

A drone precision landing framework using large language models

PDF

可视化

摘要/Abstract

引用本文

使用本文

参考文献

相关文章 15

编辑推荐

Metrics

本文评价