Edge-Deployed Generative Language-Based Retrieval for Aerospace Asset Health Management
##plugins.themes.bootstrap3.article.main##
##plugins.themes.bootstrap3.article.sidebar##
Abstract
Aerospace asset health management increasingly relies on access to large volumes of maintenance documentation; however, operational environments are often constrained by limited connectivity and computational resources, restricting the use of cloud-based intelligence systems. This paper presents a fully offline, edge-deployable retrieval-augmented generation (RAG) framework for aerospace maintenance and prognostics using technical documentation. The framework integrates a locally hosted lightweight large language model with vector retrieval and cross-encoder reranking to support natural language querying of Airworthiness Directives (ADs) and maintenance records. Deployed on an NVIDIA Jetson Orin Nano 8GB device, the system performs document ingestion, indexing, retrieval, reranking, and response generation entirely on-device without cloud connectivity. Experimental evaluation on real aerospace maintenance documents demonstrates the ability to identify failure mechanisms, failure modes, root causes, affected components, and maintenance procedures described in FAA documentation. The cross-encoder reranking stage improves retrieval precision by refining semantically overlapping maintenance evidence prior to generation. The framework achieved an average fidelity score of 0.83, indicating that most generated responses remained grounded in retrieved FAA evidence. Across representative AD queries, the system achieved practical edge inference latency of approximately 4–10 seconds on the Jetson platform. The results demonstrate the feasibility of privacy-preserving, low-latency generative artificial intelligence for aerospace maintenance decision support on resource-constrained edge devices.
How to Cite
##plugins.themes.bootstrap3.article.details##
Retrieval-Augmented Generation (RAG), Large Language Models (LLMs), Edge AI, Edge Computing, Aerospace Maintenance, Aerospace Asset Health Management, Cross-Encoder Reranking, FAISS Vector Retrieval, FAA Airworthiness Directives
Payette, M., & Abdul-Nour, G. (2023). Asset management, reliability and prognostics modeling techniques. Sustainability, 15(9), 7493.
Hou, L., Jia, B., Xing, C., Chen, Z., & Du, Z. (2025). Applied research on an aircraft maintenance assistant based on a large language model. In Proceedings of the 2025 4th International Conference on Intelligent Systems, Communications and Computer Networks (pp. 1–7).
Chen, F., Wen, Z., & Liu, B. (2025). A question answering system for aerospace large language models based on knowledge graph and RAG collaboration. In Proceedings of the 2025 6th International Conference on Education, Knowledge and Information Management (pp. 446–455).
Yadav, S. (2024). AeroQuery RAG and LLM for aerospace query in designs, development, standards, certifications. In 2024 IEEE International Conference on Electronics, Computing and Communication Technologies (CONECCT) (pp. 1–6). IEEE.
Seemakhupt, K., Liu, S., & Khan, S. (2024). EdgeRAG: Online-indexed RAG for edge devices. arXiv preprint arXiv:2412.21023.
Lee, J., Bang, J., Shim, K., Yang, S., & Chang, S. (2025). Chain-of-rank: Enhancing large language models for domain-specific RAG in edge device. In Findings of the Association for Computational Linguistics: NAACL 2025 (pp. 5601–5608).
Federal Aviation Administration. (2024). Airworthiness Directives Manual. Washington, DC: FAA.
Rasaq, L., Ferguson, K., Teasley, W., Xu, M., Blond, K. E., & Yadav, O. P. (2024). Sensor and maintenance strategy evaluation for Boeing 767 commercial fleets. In 2024 Annual Reliability and Maintainability Symposium (RAMS) (pp. 1–6). IEEE.
Karpukhin, V., Oguz, B., Min, S., Lewis, P., Wu, L., Edunov, S., ... Yih, W. T. (2020). Dense passage retrieval for open-domain question answering. In Proceedings of the 2020 Conference on Empirical Methods in Natural Language Processing (EMNLP) (pp. 6769–6781).
Johnson, J., Douze, M., & Jégou, H. (2019). Billion-scale similarity search with GPUs. IEEE Transactions on Big Data, 7(3), 535–547.
Lewis, P., Perez, E., Piktus, A., Petroni, F., Karpukhin, V., Goyal, N., ... Kiela, D. (2020). Retrieval-augmented generation for knowledge-intensive NLP tasks. Advances in Neural Information Processing Systems, 33, 9459–9474.
Reimers, N., & Gurevych, I. (2019). Sentence-BERT: Sentence embeddings using Siamese BERT-networks. In Proceedings of the 2019 Conference on Empirical Methods in Natural Language Processing and the 9th International Joint Conference on Natural Language Processing (EMNLP-IJCNLP) (pp. 3982–3992).
Touvron, H., Martin, L., Stone, K., Albert, P., Almahairi, A., Babaei, Y., Bashlykov, N., Batra, S., Bhargava, P., Bhosale, S., & Bikel, D. (2023). Llama 2: Open foundation and fine-tuned chat models. arXiv preprint arXiv:2307.09288.
Nogueira, R., & Cho, K. (2019). Passage re-ranking with BERT. arXiv preprint arXiv:1901.04085.
NVIDIA. (n.d.). NVIDIA Jetson Orin Nano 8GB Developer Kit. Retrieved from https://www.nvidia.com/en-us/autonomous-machines/embedded-systems/jetson-orin/nano-super-developer-kit/
Facebook BlenderBot. (n.d.). Retrieved from https://huggingface.co/facebook/blenderbot-400M-distill
Llama 3.2 3B. (n.d.). Retrieved from https://huggingface.co/meta-llama/Llama-3.2-3B
Cross-encoder/ms-marco-TinyBERT-L2-v2. (n.d.). Retrieved from https://huggingface.co/cross-encoder/ms-marco-TinyBERT-L2-v2
Sentence-transformers/all-MiniLM-L12-v2. (n.d.). Retrieved from https://huggingface.co/sentence-transformers/all-MiniLM-L12-v2

This work is licensed under a Creative Commons Attribution 3.0 Unported License.
The Prognostic and Health Management Society advocates open-access to scientific data and uses a Creative Commons license for publishing and distributing any papers. A Creative Commons license does not relinquish the author’s copyright; rather it allows them to share some of their rights with any member of the public under certain conditions whilst enjoying full legal protection. By submitting an article to the International Conference of the Prognostics and Health Management Society, the authors agree to be bound by the associated terms and conditions including the following:
As the author, you retain the copyright to your Work. By submitting your Work, you are granting anybody the right to copy, distribute and transmit your Work and to adapt your Work with proper attribution under the terms of the Creative Commons Attribution 3.0 United States license. You assign rights to the Prognostics and Health Management Society to publish and disseminate your Work through electronic and print media if it is accepted for publication. A license note citing the Creative Commons Attribution 3.0 United States License as shown below needs to be placed in the footnote on the first page of the article.
First Author et al. This is an open-access article distributed under the terms of the Creative Commons Attribution 3.0 United States License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are credited.