Abstract
Text-based entity matching facilitates interoperability between heterogeneous systems by aligning textual person descriptions. We propose an entity matching methodology that integrates rule-based feature extraction, similarity measures, and supervised machine learning classifiers, rigorously evaluated on a person matching problem. We constructed a feature space by extracting domain-specific person attributes from text via a combination of string similarity scores and similarities of inverse document frequency (TF-IDF) embeddings. Next, we evaluated multiple supervised classification models including Multi-Layer Perceptron, Random Forest, and XGBoost, to determine their effectiveness. For evaluation, we created a new domain-specific entity matching dataset named Real Scenario Text-based Person Matching (RSTPM), and assessed the person matching performance of all models in terms of classification metrics and computational cost. In addition, we studied the classification impact of the various features. The proposed approach was shown to achieve an increase of 27.47 percentage points (from 55.41\% to 82.88\%) in F1-Score compared to the baseline and a total Accuracy of 92.14\%, thus demonstrating significant improvements in textual person matching whilst exhibiting a moderate increase in computational demand.
| Original language | English |
|---|---|
| Title of host publication | 2025 IEEE International Conference on Systems, Man, and Cybernetics (SMC) |
| Pages | 7080 - 7085 |
| ISBN (Electronic) | 979-8-3315-3358-8 |
| Publication status | Published - 28 Jan 2026 |
| Event | 2025 IEEE International Conference on Systems, Man, and Cybernetics (SMC) - Austria Center Vienna, Vienna, Austria Duration: 5 Oct 2025 → 8 Oct 2025 https://www.ieeesmc2025.org/ |
Conference
| Conference | 2025 IEEE International Conference on Systems, Man, and Cybernetics (SMC) |
|---|---|
| Abbreviated title | IEEE SMC 2025 |
| Country/Territory | Austria |
| City | Vienna |
| Period | 5/10/25 → 8/10/25 |
| Internet address |
Research Field
- Responsive Sensing & Analytics
Fingerprint
Dive into the research topics of 'Text-based entity matching for entity resolution and data fusion applied to person descriptions'. Together they form a unique fingerprint.-
Enhancing Maritime Situational Awareness through Multimodal Fusion: Insights from a Real-World Experiment
Wohlleben, K. (Speaker), Hubner, M., Markchom, T., Boyle, J., Ferryman, J., Veigl, S., Opitz, A., Gkamaris, A. & Bratskas, R., 13 Jan 2026, 17th Symposium Sensor Data Fusion - SSDF. 8 p.Research output: Chapter in Book or Conference Proceedings › Conference Proceedings with Oral Presentation › peer-review
-
Bayesian Optimization for Parameter Selection in Fusion Systems
Wohlleben, K. (Author and Speaker), Siems, F., Nausner, J. & Hubner, M., 26 Aug 2025, Proceedings of the 2025 28th International Conference on Information Fusion, FUSION 2025. 7 p.Research output: Chapter in Book or Conference Proceedings › Conference Proceedings with Oral Presentation › peer-review
-
A Bayesian Approach - Data Fusion for robust detection of Vandalism and Trespassing related events in the context of railway security
Hubner, M. (Speaker), Wohlleben, K., Litzenberger, M., Veigl, S., Opitz, A., Grebien, S. & Maria-Theresia Dvorak, 15 Oct 2024, Proceedings of ISIF International Conference on Information Fusion (FUSION 2024). Vol. 27. p. 1-7Research output: Chapter in Book or Conference Proceedings › Conference Proceedings with Oral Presentation › peer-review
Cite this
- APA
- Author
- BIBTEX
- Harvard
- Standard
- RIS
- Vancouver