093 Keyword Disambiguation Using Transformers and Clustering to Build Cleaner Knowledge – NODES2022

21 Nov, 2022

Natural language processing is an indispensable toolkit to build knowledge graphs from unstructured data. However, it comes with a price. Keywords and entities in unstructured texts are ambiguous – the same concept can be expressed by many different linguistic variations. The resulting knowledge graph would thus be polluted with many nodes representing the same entity without any order. In this session, we show how the semantic similarity based on transformer embeddings and agglomerative clustering can help in the domain of academic disciplines and research fields and how Neo4j improves the browsing experience of this knowledge graph. Speakers: Federica Ventruto, Alessia Melania Lonoce Format: Full Session 30-45 min Level: Advanced Topics: #KnowledgeGraph, #MachineLearning, #Visualization, #General, #Advanced Region: EMEA Slides: https://dist.neo4j.com/nodes-20202-slides/093%20Keyword%20Disambiguation%20Using%20Transformers%20and%20Clustering%20to%20Build%20Cleaner%20Knowledge%20Graphs%20-%20NODES2022%20EMEA%20Advanced%206%20-%20Federica%20Ventruto%2C%20Alessia%20Melania%20Lonoce.pdf Visit https://neo4j.com/nodes-2022 learn more at https://neo4j.com/developer/get-started and engage at https://community.neo4j.com

Related Videos