Creating graphs from a corpus of documents
In this section, we will use the information extracted in the previous section using the different text engines to build networks that relate the different information. In particular, we will focus on two kinds of graphs:
- Knowledge-based graphs, where we will use the semantic meaning of sentences to infer relationships between the different entities.
- Bipartite graphs, connecting the documents with the entities appearing in the text. We will then project the bipartite graph into a homogeneous graph, which might be made of either document or entity nodes only.
Knowledge graphs
A knowledge graph is a structured representation of entities, their attributes, and relationships, organized as a graph. It encodes semantic relationships to enable reasoning, querying, and knowledge discovery. It is commonly used in AI and machine learning to power applications like recommendation systems, natural language understanding...