From the course: Introduction to Probabilistic Knowledge Graphs: AI-Driven Inference and Real-World Applications
Unlock this course with a free trial
Join today to access over 25,500 courses taught by industry experts.
Limitations of unstructured data
From the course: Introduction to Probabilistic Knowledge Graphs: AI-Driven Inference and Real-World Applications
Limitations of unstructured data
In earlier videos, we showed how unstructured data like text, audio, images, with a focus on text, like how they can be processed and mapped into a graph schema, then annotated with probabilities in a probabilistic knowledge graph. Today, we'll turn to the important question. What are the limits of doing that? And why does having a solid ontology or schema matter when working with unstructured sources? First, even though we rely on a fixed schema or ontology for mapping extracted facts, unstructured sources often do not align neatly with that schema. The text or media might express relationships in ways the schema did not anticipate or omit information that your ontology expects. That means the extraction pipeline may fail to map facts or mismap them because the schema lacks the matching predicate or class. This mismatch introduces noise, ambiguity, and gaps. Second, ontologies themselves are static relative to the fluid world of unstructured content. When you extract from text using…
Contents
-
-
-
-
-
-
(Locked)
Graph schema design6m 38s
-
(Locked)
Example ontology template4m 40s
-
(Locked)
Structured data for PKGs3m 32s
-
(Locked)
Typical structured sources4m 26s
-
(Locked)
Limitations of structured data2m 36s
-
(Locked)
Real-world example: Structured data3m 26s
-
(Locked)
Unstructured data for PKGs4m 13s
-
(Locked)
Limitations of unstructured data3m 46s
-
(Locked)
Example: Biomedical literature mining4m 42s
-
(Locked)
Wrapping up data types for PKGs1m 12s
-
(Locked)
-
-