Edge-Deployed Generative Language-Based Retrieval for Aerospace  Asset Health Management

Lukmon Rasaq; Madhuri  Siddula; Om Prakash  Yadav; Rhonda  Walthall; Joseph  Ensberg; Piyush  Yadav

doi:10.36001/phme.2026.v9i1.5017

Edge-Deployed Generative Language-Based Retrieval for Aerospace Asset Health Management

PDF

Published Jul 3, 2026

DOI https://doi.org/10.36001/phme.2026.v9i1.5017

Lukmon Rasaq

North Carolina A&T State University

Madhuri Siddula

North Carolina A&T State University

Om Prakash Yadav

North Carolina A&T State University

Rhonda Walthall

Collins Aerospace

Joseph Ensberg

Collins Aerospace

Piyush Yadav

Collins Aerospace

Abstract

Aerospace asset health management increasingly relies on access to large volumes of maintenance documentation; however, operational environments are often constrained by limited connectivity and computational resources, restricting the use of cloud-based intelligence systems. This paper presents a fully offline, edge-deployable retrieval-augmented generation (RAG) framework for aerospace maintenance and prognostics using technical documentation. The framework integrates a locally hosted lightweight large language model with vector retrieval and cross-encoder reranking to support natural language querying of Airworthiness Directives (ADs) and maintenance records. Deployed on an NVIDIA Jetson Orin Nano 8GB device, the system performs document ingestion, indexing, retrieval, reranking, and response generation entirely on-device without cloud connectivity. Experimental evaluation on real aerospace maintenance documents demonstrates the ability to identify failure mechanisms, failure modes, root causes, affected components, and maintenance procedures described in FAA documentation. The cross-encoder reranking stage improves retrieval precision by refining semantically overlapping maintenance evidence prior to generation. The framework achieved an average fidelity score of 0.83, indicating that most generated responses remained grounded in retrieved FAA evidence. Across representative AD queries, the system achieved practical edge inference latency of approximately 4–10 seconds on the Jetson platform. The results demonstrate the feasibility of privacy-preserving, low-latency generative artificial intelligence for aerospace maintenance decision support on resource-constrained edge devices.

How to Cite

Rasaq, L., Siddula, M. ., Yadav, O. P. ., Walthall, R. ., Ensberg, J., & Yadav, P. (2026). Edge-Deployed Generative Language-Based Retrieval for Aerospace Asset Health Management. PHM Society European Conference, 9(1), 1–11. https://doi.org/10.36001/phme.2026.v9i1.5017

Abstract 67 | PDF Downloads 40

Keywords

Retrieval-Augmented Generation (RAG), Large Language Models (LLMs), Edge AI, Edge Computing, Aerospace Maintenance, Aerospace Asset Health Management, Cross-Encoder Reranking, FAISS Vector Retrieval, FAA Airworthiness Directives

References

Fu, S., & Avdelidis, N. P. (2023). Prognostic and health management of critical aircraft systems and components: An overview. Sensors, 23(19), 8124.

Payette, M., & Abdul-Nour, G. (2023). Asset management, reliability and prognostics modeling techniques. Sustainability, 15(9), 7493.

Hou, L., Jia, B., Xing, C., Chen, Z., & Du, Z. (2025). Applied research on an aircraft maintenance assistant based on a large language model. In Proceedings of the 2025 4th International Conference on Intelligent Systems, Communications and Computer Networks (pp. 1–7).

Chen, F., Wen, Z., & Liu, B. (2025). A question answering system for aerospace large language models based on knowledge graph and RAG collaboration. In Proceedings of the 2025 6th International Conference on Education, Knowledge and Information Management (pp. 446–455).

Yadav, S. (2024). AeroQuery RAG and LLM for aerospace query in designs, development, standards, certifications. In 2024 IEEE International Conference on Electronics, Computing and Communication Technologies (CONECCT) (pp. 1–6). IEEE.

Seemakhupt, K., Liu, S., & Khan, S. (2024). EdgeRAG: Online-indexed RAG for edge devices. arXiv preprint arXiv:2412.21023.

Lee, J., Bang, J., Shim, K., Yang, S., & Chang, S. (2025). Chain-of-rank: Enhancing large language models for domain-specific RAG in edge device. In Findings of the Association for Computational Linguistics: NAACL 2025 (pp. 5601–5608).

Federal Aviation Administration. (2024). Airworthiness Directives Manual. Washington, DC: FAA.

Rasaq, L., Ferguson, K., Teasley, W., Xu, M., Blond, K. E., & Yadav, O. P. (2024). Sensor and maintenance strategy evaluation for Boeing 767 commercial fleets. In 2024 Annual Reliability and Maintainability Symposium (RAMS) (pp. 1–6). IEEE.

Karpukhin, V., Oguz, B., Min, S., Lewis, P., Wu, L., Edunov, S., ... Yih, W. T. (2020). Dense passage retrieval for open-domain question answering. In Proceedings of the 2020 Conference on Empirical Methods in Natural Language Processing (EMNLP) (pp. 6769–6781).

Johnson, J., Douze, M., & Jégou, H. (2019). Billion-scale similarity search with GPUs. IEEE Transactions on Big Data, 7(3), 535–547.

Lewis, P., Perez, E., Piktus, A., Petroni, F., Karpukhin, V., Goyal, N., ... Kiela, D. (2020). Retrieval-augmented generation for knowledge-intensive NLP tasks. Advances in Neural Information Processing Systems, 33, 9459–9474.

Reimers, N., & Gurevych, I. (2019). Sentence-BERT: Sentence embeddings using Siamese BERT-networks. In Proceedings of the 2019 Conference on Empirical Methods in Natural Language Processing and the 9th International Joint Conference on Natural Language Processing (EMNLP-IJCNLP) (pp. 3982–3992).

Touvron, H., Martin, L., Stone, K., Albert, P., Almahairi, A., Babaei, Y., Bashlykov, N., Batra, S., Bhargava, P., Bhosale, S., & Bikel, D. (2023). Llama 2: Open foundation and fine-tuned chat models. arXiv preprint arXiv:2307.09288.

Nogueira, R., & Cho, K. (2019). Passage re-ranking with BERT. arXiv preprint arXiv:1901.04085.

NVIDIA. (n.d.). NVIDIA Jetson Orin Nano 8GB Developer Kit. Retrieved from https://www.nvidia.com/en-us/autonomous-machines/embedded-systems/jetson-orin/nano-super-developer-kit/

Facebook BlenderBot. (n.d.). Retrieved from https://huggingface.co/facebook/blenderbot-400M-distill

Llama 3.2 3B. (n.d.). Retrieved from https://huggingface.co/meta-llama/Llama-3.2-3B

Cross-encoder/ms-marco-TinyBERT-L2-v2. (n.d.). Retrieved from https://huggingface.co/cross-encoder/ms-marco-TinyBERT-L2-v2

Sentence-transformers/all-MiniLM-L12-v2. (n.d.). Retrieved from https://huggingface.co/sentence-transformers/all-MiniLM-L12-v2

Issue

Vol. 9 No. 1 (2026): Proceedings of the European Conference of the PHM Society 2026

Section

Technical Papers

This work is licensed under a Creative Commons Attribution 3.0 Unported License.

The Prognostic and Health Management Society advocates open-access to scientific data and uses a Creative Commons license for publishing and distributing any papers. A Creative Commons license does not relinquish the author’s copyright; rather it allows them to share some of their rights with any member of the public under certain conditions whilst enjoying full legal protection. By submitting an article to the International Conference of the Prognostics and Health Management Society, the authors agree to be bound by the associated terms and conditions including the following:

As the author, you retain the copyright to your Work. By submitting your Work, you are granting anybody the right to copy, distribute and transmit your Work and to adapt your Work with proper attribution under the terms of the Creative Commons Attribution 3.0 United States license. You assign rights to the Prognostics and Health Management Society to publish and disseminate your Work through electronic and print media if it is accepted for publication. A license note citing the Creative Commons Attribution 3.0 United States License as shown below needs to be placed in the footnote on the first page of the article.

First Author et al. This is an open-access article distributed under the terms of the Creative Commons Attribution 3.0 United States License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are credited.

##plugins.themes.bootstrap3.article.main##

##plugins.themes.bootstrap3.article.sidebar##

Abstract

How to Cite

##plugins.themes.bootstrap3.article.details##