Cross-lingual Entity Alignment via Joint Attribute-Preserving Embedding
read the original abstract
Entity alignment is the task of finding entities in two knowledge bases (KBs) that represent the same real-world object. When facing KBs in different natural languages, conventional cross-lingual entity alignment methods rely on machine translation to eliminate the language barriers. These approaches often suffer from the uneven quality of translations between languages. While recent embedding-based techniques encode entities and relationships in KBs and do not need machine translation for cross-lingual entity alignment, a significant number of attributes remain largely unexplored. In this paper, we propose a joint attribute-preserving embedding model for cross-lingual entity alignment. It jointly embeds the structures of two KBs into a unified vector space and further refines it by leveraging attribute correlations in the KBs. Our experimental results on real-world datasets show that this approach significantly outperforms the state-of-the-art embedding approaches for cross-lingual entity alignment and could be complemented with methods based on machine translation.
This paper has not been read by Pith yet.
Forward citations
Cited by 1 Pith paper
-
HELEA: Hard-Negative Benchmark and LLM-based Reranking for Robust Entity Alignment
HELEA creates hard-negative benchmarks (DW-HN29K, DY-HN27K) where name-overlap baselines fail and reports F1 0.967 on the new sets while preserving strong standard-benchmark scores via encoder retrieval plus untrained...
discussion (0)
Sign in with ORCID, Apple, or X to comment. Anyone can read and Pith papers without signing in.