Synthetic Data Generation for Computer Vision Tasks in SITL Systems with Unreal Engine and Airsim

Authors

DOI:

https://doi.org/10.15407/intechsys.2026.02.006

Keywords:

synthetic data, computer vision, deep learning, robotics, SITL, Unreal Engine, AirSim, YOLO

Abstract

Sustainable object detection, classification, and tracking are a critical path in robotics at per ception leg. Deep learning models used for detection require vast and diverse datasets with accurate annotations. However, acquiring real-world data that captures rare, complex, or hazardous scenarios is both time-consuming and costly. To address this limitation, synthetic data generation using virtual environments has emerged as a promising alternative. This research investigates the application of synthetic datasets, created within high-fidelity 3D vir tual reality environments, for training deep learning models in object detection tasks. The proposed method leverages tools such as Unreal Engine and procedural scripting to generate large volumes of realistic, annotated image data. These synthetic scenes are carefully con structed to include various object types, dynamic interactions, lighting variations, occlusions, and edge-case scenarios that are often missing from real-world datasets. The study employs the YOLO family of deep learning models, which are known for their high accuracy and low latency, making them suitable for real-time robotic applications. Experimental results de mon strate that models trained exclusively on synthetic data demonstrate partial transferability to real-world scenarios, while the addition of a small fraction of real data significantly improves performance. Key advantages include reduced development time, lower annotation costs, im proved coverage of rare scenarios, and enhanced model generalization. The integration of syn thetic data pipelines into the ML lifecycle also facilitates better experiment management and scalability. The research concludes that synthetic data is a powerful enabler for developing robust object detection systems in robotics and AI. Future work will focus on narrowing the domain gap between synthetic and real-world imagery, enhancing realism with photometric effects, and expanding the dataset to include additional object classes and behaviors for multi-task learning.

References

Shah S., Dey D., Lovett C., Kapoor A. AirSim: High-Fidelity Visual and Physical Simulation for Autonomous Vehicles. Field and Service Robotics, 2018, Vol. 5, 621–635. https://doi.org/10.1007/978-3-319-67361-5_40

Laux L., Schirmer S., Schopferer S., Dauer J.C. Build Your Own Training Data – Synthetic Data for Object Detection in Aerial Images. In: Software Engineering 2022 Workshops, Bonn, 2022, 182–190. https://doi.org/10.18420/se2022-ws-18

Kiefer B., Ott D., Zell A. Leveraging Synthetic Data in Object Detection on Unmanned Aerial Vehicles. 26th International Conference on Pattern Recognition (ICPR), Montreal, QC, Canada, 2022, 3564–3571. https://doi.org/10.1109/ICPR56361.2022.9956710

Barisic A., Petric F., Bogdan S. Sim2Air – Synthetic Aerial Dataset for UAV Monitoring. Sensors, 2022 , Vol. 22 (14), Article 5074. https://doi.org/10.3390/s22145074

Maxey C., Choi J., Lee H., Manocha D., Kwon H. UAV-Sim: NeRF-based Synthetic Data Generation for UAV-based Perception. arXiv preprint, 2023, arXiv:2310.16255. https://doi.org/10.48550/arXiv.2310.16255

Collins T., Waggoner J., Stanciulescu B., et all. Scalable Modular Synthetic Data Generation for Advancing Aerial Autonomy. arXiv preprint, 2022, arXiv:2211.05335. https://doi.org/10.48550/arXiv.2211.05335

Downloads

Published

2026-06-01

How to Cite

Ryabokon, D. (2026). Synthetic Data Generation for Computer Vision Tasks in SITL Systems with Unreal Engine and Airsim. Information Technologies and Systems, 8(2), 6–17. https://doi.org/10.15407/intechsys.2026.02.006

Issue

Section

Computer Vision and Pattern Recognition