Abstract
We present a novel, efficient, and scalable approach for generating knowledge graphs (KGs) tailored to specific competency questions, leveraging large language model (LLM)-based retrieval-augmented generation (RAG) as a source of high-quality text data. Our method utilises a predefined ontology and defines two agents: The first agent extracts entities and triplets from the text corpus maintained by the RAG, while the second agent merges similar entities based on labels and descriptions, using embedding functions and LLM reasoning. This approach does not require fine-tuning or additional AI training, and relies solely on off-the-shelf technologies. Additionally, due to the use of RAG, the method can be used with a text corpus of arbitrary size. We applied our method to the high-pressure die casting domain, focusing on defects and their causes. In the absence of annotated datasets, manual evaluation of the resulting KGs showed over 90% precision in entity extraction and around 70% precision in triplet extraction, the main source of error being the RAG itself. Our findings suggest that this method can significantly aid in the rapid generation of customised KGs for specific applications.
Original language | English |
---|---|
Title of host publication | Proceedings of the 21st International Conference on Informatics in Control |
Pages | 365-376 |
Number of pages | 12 |
DOIs | |
Publication status | Accepted/In press - 2024 |
Event | 21st International Conference in Informatics in Control, Automation and Robotics - Vila Galé Porto Hotel, Porto, Portugal Duration: 18 Nov 2024 → 20 Nov 2024 Conference number: 21 https://icinco.scitevents.org/ |
Publication series
Name | Proceedings of the 21st International Conference on Informatics in Control, Automation and Robotics |
---|
Conference
Conference | 21st International Conference in Informatics in Control, Automation and Robotics |
---|---|
Abbreviated title | ICINCO |
Country/Territory | Portugal |
City | Porto |
Period | 18/11/24 → 20/11/24 |
Internet address |
Research Field
- Complex Dynamical Systems
Keywords
- Knowledge Graph Extraction
- Knowledge Graph Generation
- Large Language Model
- Retrieval-Augmented Generation
- High-Pressure Die Casting