Connecting Book Knowledge to RDF Knowledge Spaces
In the π Book Knowledge Materialization In SIE, we explained how book information can be materialized into KnowledgeNodes. However, information about a book alone is not sufficient to form a rich knowledge space.
In real-world knowledge applications, books are connected to authors, publishers, people, organizations, locations, historical contexts, and related works.
Rather than consolidating all knowledge into a single database, the Semantic Integration Engine (SIE) (SIE) connects multiple RDF knowledge spaces and enables them to be used as a virtual knowledge graph.
Why RDF Integration Matters
The value of a book extends far beyond its bibliographic metadata through its relationships with surrounding knowledge.
For example, a book may have relationships such as the following.
Book
ββ Author
ββ Publisher
ββ Subject
ββ Place
ββ Historical Period
ββ Related Works
For example, The Tale of Genji naturally connects to knowledge about Murasaki Shikibu, the Heian period, Kyoto, Japanese literature, and court culture.
Such knowledge is typically not stored within a single data source.
RDF Knowledge Spaces Around Books
Various types of knowledge spaces can be used to enrich book knowledge.
RDF Native Sources
These knowledge spaces publish RDF data directly.
-
Wikidata
-
DBpedia
They provide broad knowledge covering people, organizations, places, and events.
Authority Sources
These knowledge spaces focus on identity and authority management.
-
VIAF
-
ISNI
-
ORCID
They are used to identify the same person or organization across multiple systems.
SIE Linking Model
SIE does not take the approach of importing external knowledge spaces into a massive centralized database.
Instead, SIE maintains links from KnowledgeNodes under its control to external knowledge spaces and constructs a federated virtual knowledge space.
Book KnowledgeNode
ββ OpenBD
ββ OpenLibrary
ββ Wikidata
ββ VIAF
ββ ISNI
ββ National Diet Library
The KnowledgeNode acts as a hub connecting multiple knowledge spaces.
RDF Node Resolution
The same person may be represented by different identifiers in different knowledge spaces.
For example:
Author
ββ OpenBD Author
ββ OpenLibrary Author
ββ Wikidata Entity
ββ VIAF Record
ββ NDL Authority
SIE maps these records to a single KnowledgeNode.
OpenBD Author
β
KnowledgeNode(Person)
β
Wikidata Person
β
VIAF
β
NDL Authority
This enables integrated use of person knowledge across multiple knowledge spaces.
Federated Knowledge Expansion
Book knowledge can be expanded transitively into many other forms of knowledge.
Book
β
Author
β
Organization
β
Place
β
Historical Period
β
Related Works
Users can explore knowledge starting from a book as the entry point.
Virtual Public Knowledge Space
Each RDF knowledge space operates as an independent knowledge island.
OpenBD
OpenLibrary
Wikidata
DBpedia
VIAF
ISNI
NDL
Europeana
SIE does not consolidate these spaces; it connects them.
As a result, users can access them as if they were a single large knowledge space.
This is not a physical integration but a logical and virtual one.
Architecture Overview
+------------------+
| KnowledgeNode |
+---------+--------+
|
+------------------+------------------+
| | |
v v v
OpenBD OpenLibrary Wikidata
|
v
VIAF
|
v
ISNI
|
v
NDL
Multiple knowledge spaces are connected around the KnowledgeNode.
Conclusion
The value of book knowledge comes not only from bibliographic metadata but also from its connections to external knowledge spaces.
SIE connects multiple RDF knowledge spaces around KnowledgeNodes and forms a virtual knowledge graph.
As a result, a book becomes more than bibliographic metadataβit becomes an entry point into a broader knowledge ecosystem.
References
Glossary
- RDF
-
A W3C-standardized data model that represents information as subjectβpredicateβobject triples.
- Semantic Integration Engine (SIE)
-
An integration engine that unifies structured knowledge (RDF) and document knowledge (SmartDox) derived from the BoK, making them directly accessible to AI.
- knowledge graph
-
A semantic graph-based knowledge base where nodes represent entities or concepts and edges represent their relationships.