LOT4KG is methodology for KG lifecycle that extends the LOT ontology engineering methodology with KG construction, ontology evolution and the subsequent KG evolution.
Change Analysis.The goal of this activity is to formally capture the changes that are to be implemented. The output of this sub-activity is a list of formalised changes. During the change evaluation activity, the formalised changes are evaluated against the ontology, to ensure that the ontology remains consistent and coherent. Therefore, this activity requires the list of formalised changes for the evaluation to be executed and outputs an evaluation report.
Ontology Update. In the first sub-activity of ontology update, a change is encoded (change encoding). In this activity, an engineer applies the necessary formalised changes to the ontology, similarly to the ontology encoding sub-activity within the high-level ontology implementation activity. The output of this activity is a new ontology in code format (ontology code).
Knowledge Graph Implementation. The knowledge graph implementation activity aims to construct and validate the knowledge graph. It is composed by a set of sub-activities that transform the input data, which can be of any type and format (e.g., tabular in CSV, text in PDF, etc.), into the knowledge graph and its later validation over a set of constraints. The output of this activity is the implemented knowledge graph (virtual or materialized) and the associated rules for constructing (e.g., RML, SPARQL-Anything, etc.) and validating it (e.g., ShEx or SHACL shapes). The latter may also include a validation report as output.
Knowledge Graph Publication. The high-level KG publication activity is a counterpart to the ontology publication activity. It captures the necessary tasks and steps which are taken to document and make the KG available online. The output of this activity is the documented KG and the online accessible resource. The lower-level activities part of the KG publication are documentation and data publication. During the documentation step, the mappings, RDF data, SHACL shapes and validation report are used to document the process and output of the implementation. The output is the HTML documentation, which can then be published during the data publication step alongside the online KG.
Knowledge Graph Maintenance. This task is modelled along the same lines as the ontology maintenance task. It is specifically aimed at fixing bugs in the already published KG. This step does not capture proper evolution, the changes within the ontology or the data sources. Therefore, the detailed activity is named bug detection. The output of the activity is the issues and bugs to be fixed by backtracking in the process.
Change Detection and Impact Analysis Step. To be able to update an already existing KG, the changes applied to the ontology need to be examined and analysed against the KG. We define three sub-activities: detect delta, refine relevant changes and analyse change impact. However, only the middle activity is mandatory. The main output of this activity is a list of relevant changes, relevant for the update of the KG.
Knowledge Graph Update. This activity, just as the one for updating the ontology, serves as a mirror to the KG implementation activity. Its purpose is to bring the KG up to date with regards to any types of changes in the ontology or source data. Hence, the KG update activity can be triggered from the change detection activities which provide the list of ontology changes, or from changes to the source data, depicted by the arrow connecting the KG maintenance activity with KG update. The KG update activity has four sub-activities that make it possible to update the KG, each of them associated with the corresponding assets of the KG implementation (mappings, RDF graph, constraints and validation report). The high-level output is the updated RDF graph and its associated assets (i.e. mappings, data constraints, and validation report), which are to be published using the KG publishing activity.
Coming soon...
The LOT4KG methodology figures are available for reuse in the LOT Github repository under the Creative Commons Attribution Share Alike license.
If you are using the content of LOT4KG mehtodology you should cite: Romana Pernisch, David Chaves-Fraga, María Poveda-Villalón, Lise Stork, and Diego Conde-Herreros. LOT4KG: A Joint Methodology for the Ontology and Knowledge Graph Lifecycle. Under Review (ISWC2024).
If you are using the content of LOT mehtodology you should cite: Poveda-Villalón, M., Fernández-Izquierdo, A., Fernández-López, M., & García-Castro, R. (2022). LOT: An industrial oriented ontology engineering framework. Engineering Applications of Artificial Intelligence, 111, 104755. https://doi.org/10.1016/j.engappai.2022.104755