Edge-Deployed Generative Language-Based Retrieval for Aerospace Asset Health Management

##plugins.themes.bootstrap3.article.main##

##plugins.themes.bootstrap3.article.sidebar##

Published Jul 3, 2026
Lukmon Rasaq Madhuri Siddula Om Prakash Yadav Rhonda Walthall Joseph Ensberg Piyush Yadav

Abstract

Aerospace asset health management increasingly relies on access to large volumes of maintenance documentation; however, operational environments are often constrained by limited connectivity and computational resources, restricting the use of cloud-based intelligence systems. This paper presents a fully offline, edge-deployable retrieval-augmented generation (RAG) framework for aerospace maintenance and prognostics using technical documentation. The framework integrates a locally hosted lightweight large language model with vector retrieval and cross-encoder reranking to support natural language querying of Airworthiness Directives (ADs) and maintenance records. Deployed on an NVIDIA Jetson Orin Nano 8GB device, the system performs document ingestion, indexing, retrieval, reranking, and response generation entirely on-device without cloud connectivity. Experimental evaluation on real aerospace maintenance documents demonstrates the ability to identify failure mechanisms, failure modes, root causes, affected components, and maintenance procedures described in FAA documentation. The cross-encoder reranking stage improves retrieval precision by refining semantically overlapping maintenance evidence prior to generation. The framework achieved an average fidelity score of 0.83, indicating that most generated responses remained grounded in retrieved FAA evidence. Across representative AD queries, the system achieved practical edge inference latency of approximately 4–10 seconds on the Jetson platform. The results demonstrate the feasibility of privacy-preserving, low-latency generative artificial intelligence for aerospace maintenance decision support on resource-constrained edge devices.

How to Cite

Rasaq, L., Siddula, M. ., Yadav, O. P. ., Walthall, R. ., Ensberg, J., & Yadav, P. (2026). Edge-Deployed Generative Language-Based Retrieval for Aerospace Asset Health Management. PHM Society European Conference, 9(1), 1–11. https://doi.org/10.36001/phme.2026.v9i1.5017
Abstract 0 | PDF Downloads 0

##plugins.themes.bootstrap3.article.details##

Keywords

Retrieval-Augmented Generation (RAG), Large Language Models (LLMs), Edge AI, Edge Computing, Aerospace Maintenance, Aerospace Asset Health Management, Cross-Encoder Reranking, FAISS Vector Retrieval, FAA Airworthiness Directives

References
Fu, S., & Avdelidis, N. P. (2023). Prognostic and health management of critical aircraft systems and components: An overview. Sensors, 23(19), 8124.

Payette, M., & Abdul-Nour, G. (2023). Asset management, reliability and prognostics modeling techniques. Sustainability, 15(9), 7493.

Hou, L., Jia, B., Xing, C., Chen, Z., & Du, Z. (2025). Applied research on an aircraft maintenance assistant based on a large language model. In Proceedings of the 2025 4th International Conference on Intelligent Systems, Communications and Computer Networks (pp. 1–7).

Chen, F., Wen, Z., & Liu, B. (2025). A question answering system for aerospace large language models based on knowledge graph and RAG collaboration. In Proceedings of the 2025 6th International Conference on Education, Knowledge and Information Management (pp. 446–455).

Yadav, S. (2024). AeroQuery RAG and LLM for aerospace query in designs, development, standards, certifications. In 2024 IEEE International Conference on Electronics, Computing and Communication Technologies (CONECCT) (pp. 1–6). IEEE.

Seemakhupt, K., Liu, S., & Khan, S. (2024). EdgeRAG: Online-indexed RAG for edge devices. arXiv preprint arXiv:2412.21023.

Lee, J., Bang, J., Shim, K., Yang, S., & Chang, S. (2025). Chain-of-rank: Enhancing large language models for domain-specific RAG in edge device. In Findings of the Association for Computational Linguistics: NAACL 2025 (pp. 5601–5608).

Federal Aviation Administration. (2024). Airworthiness Directives Manual. Washington, DC: FAA.

Rasaq, L., Ferguson, K., Teasley, W., Xu, M., Blond, K. E., & Yadav, O. P. (2024). Sensor and maintenance strategy evaluation for Boeing 767 commercial fleets. In 2024 Annual Reliability and Maintainability Symposium (RAMS) (pp. 1–6). IEEE.

Karpukhin, V., Oguz, B., Min, S., Lewis, P., Wu, L., Edunov, S., ... Yih, W. T. (2020). Dense passage retrieval for open-domain question answering. In Proceedings of the 2020 Conference on Empirical Methods in Natural Language Processing (EMNLP) (pp. 6769–6781).

Johnson, J., Douze, M., & Jégou, H. (2019). Billion-scale similarity search with GPUs. IEEE Transactions on Big Data, 7(3), 535–547.

Lewis, P., Perez, E., Piktus, A., Petroni, F., Karpukhin, V., Goyal, N., ... Kiela, D. (2020). Retrieval-augmented generation for knowledge-intensive NLP tasks. Advances in Neural Information Processing Systems, 33, 9459–9474.

Reimers, N., & Gurevych, I. (2019). Sentence-BERT: Sentence embeddings using Siamese BERT-networks. In Proceedings of the 2019 Conference on Empirical Methods in Natural Language Processing and the 9th International Joint Conference on Natural Language Processing (EMNLP-IJCNLP) (pp. 3982–3992).

Touvron, H., Martin, L., Stone, K., Albert, P., Almahairi, A., Babaei, Y., Bashlykov, N., Batra, S., Bhargava, P., Bhosale, S., & Bikel, D. (2023). Llama 2: Open foundation and fine-tuned chat models. arXiv preprint arXiv:2307.09288.

Nogueira, R., & Cho, K. (2019). Passage re-ranking with BERT. arXiv preprint arXiv:1901.04085.

NVIDIA. (n.d.). NVIDIA Jetson Orin Nano 8GB Developer Kit. Retrieved from https://www.nvidia.com/en-us/autonomous-machines/embedded-systems/jetson-orin/nano-super-developer-kit/

Facebook BlenderBot. (n.d.). Retrieved from https://huggingface.co/facebook/blenderbot-400M-distill

Llama 3.2 3B. (n.d.). Retrieved from https://huggingface.co/meta-llama/Llama-3.2-3B

Cross-encoder/ms-marco-TinyBERT-L2-v2. (n.d.). Retrieved from https://huggingface.co/cross-encoder/ms-marco-TinyBERT-L2-v2

Sentence-transformers/all-MiniLM-L12-v2. (n.d.). Retrieved from https://huggingface.co/sentence-transformers/all-MiniLM-L12-v2
Section
Technical Papers