Abstract
Text-based entity matching facilitates interoperability between heterogeneous systems by aligning textual person descriptions. We propose an entity matching methodology that integrates rule-based feature extraction, similarity measures, and supervised machine learning classifiers, rigorously evaluated on a person matching problem. We constructed a feature space by extracting domain-specific person attributes from text via a combination of string similarity scores and similarities of inverse document frequency (TF-IDF) embeddings. Next, we evaluated multiple supervised classification models including Multi-Layer Perceptron, Random Forest, and XGBoost, to determine their effectiveness. For evaluation, we created a new domain-specific entity matching dataset named Real Scenario Text-based Person Matching (RSTPM), and assessed the person matching performance of all models in terms of classification metrics and computational cost. In addition, we studied the classification impact of the various features. The proposed approach was shown to achieve an increase of 27.47 percentage points (from 55.41\% to 82.88\%) in F1-Score compared to the baseline and a total Accuracy of 92.14\%, thus demonstrating significant improvements in textual person matching whilst exhibiting a moderate increase in computational demand.
| Originalsprache | Englisch |
|---|---|
| Titel | 2025 IEEE International Conference on Systems, Man, and Cybernetics (SMC) |
| Seiten | 7080 - 7085 |
| ISBN (elektronisch) | 979-8-3315-3358-8 |
| Publikationsstatus | Veröffentlicht - 28 Jan. 2026 |
| Veranstaltung | 2025 IEEE International Conference on Systems, Man, and Cybernetics (SMC) - Austria Center Vienna, Vienna, Österreich Dauer: 5 Okt. 2025 → 8 Okt. 2025 https://www.ieeesmc2025.org/ |
Konferenz
| Konferenz | 2025 IEEE International Conference on Systems, Man, and Cybernetics (SMC) |
|---|---|
| Kurztitel | IEEE SMC 2025 |
| Land/Gebiet | Österreich |
| Stadt | Vienna |
| Zeitraum | 5/10/25 → 8/10/25 |
| Internetadresse |
Research Field
- Responsive Sensing & Analytics
Fingerprint
Untersuchen Sie die Forschungsthemen von „Text-based entity matching for entity resolution and data fusion applied to person descriptions“. Zusammen bilden sie einen einzigartigen Fingerprint.-
Enhancing Maritime Situational Awareness through Multimodal Fusion: Insights from a Real-World Experiment
Wohlleben, K. (Vortragende:r), Hubner, M., Markchom, T., Boyle, J., Ferryman, J., Veigl, S., Opitz, A., Gkamaris, A. & Bratskas, R., 13 Jan. 2026, 17th Symposium Sensor Data Fusion - SSDF. 8 S.Publikation: Beitrag in Buch oder Tagungsband › Vortrag mit Beitrag in Tagungsband › Begutachtung
-
Advanced Sensor Fusion for Railway Security - A Hierarchical Graph-Based Approach
Hubner, M. (Vortragende:r), Nausner, J. & Wohlleben, K., 5 Feb. 2025, 2024 Sensor Data Fusion: Trends, Solutions, Applications, SDF 2024. S. 1-7 7 S. (2024 Sensor Data Fusion: Trends, Solutions, Applications, SDF 2024).Publikation: Beitrag in Buch oder Tagungsband › Vortrag mit Beitrag in Tagungsband › Begutachtung
Open Access -
Bayesian Optimization for Parameter Selection in Fusion Systems
Wohlleben, K. (Autor:in und Vortragende:r), Siems, F., Nausner, J. & Hubner, M., 26 Aug. 2025, Proceedings of the 2025 28th International Conference on Information Fusion, FUSION 2025. 7 S.Publikation: Beitrag in Buch oder Tagungsband › Vortrag mit Beitrag in Tagungsband › Begutachtung
Diese Publikation zitieren
- APA
- Author
- BIBTEX
- Harvard
- Standard
- RIS
- Vancouver