Table Of ContentLink¨opingStudiesinScienceandTechnology. ThesisNo. 1644
LicentiateThesis
Integration of Ontology Alignment
and Ontology Debugging for
Taxonomy Networks
by
Valentina Ivanova
DepartmentofComputerandInformationScience
Link¨opingUniversity
SE-58183Link¨oping,Sweden
Link¨oping2014
ThisisaSwedishLicentiate’sThesis
SwedishpostgraduateeducationleadstoaDoctor’sdegreeand/oraLicentiate’sdegree.
ADoctor’sdegreecomprises240ECTScredits(4yearoffull-timestudies).
ALicentiate’sdegreecomprises120ECTScredits.
Copyright(cid:13)c 2014ValentinaIvanova
ISBN978-91-7519-417-2
ISSN0280–7971
PrintedbyLiUTryck2014
URL:http://urn.kb.se/resolve?urn=urn:nbn:se:liu:diva-102953
Abstract
Semantically-enabled applications, such as ontology-based search and data inte-
gration, take into account the semantics of the input data in their algorithms.
Such applications often use ontologies, which model the application domains in
question,aswellasalignments,whichprovideinformationabouttherelationships
between the terms in the different ontologies.
The quality and reliability of the results of such applications depend directly on
the correctness and completeness of the ontologies and alignments they utilize.
Traditionally, ontology debugging discovers defects in ontologies and alignments
and provides means for improving their correctness and completeness, while on-
tology alignment establishes the relationships between the terms in the different
ontologies, thus addressing completeness of alignments.
This thesis focuses on the integration of ontology alignment and debugging for
taxonomy networks which are formed by taxonomies, the most widely used kind
of ontologies, connected through alignments.
The contributions of this thesis include the following. To the best of our knowl-
edge,wehavedevelopedthefirstapproachandframeworkthatintegrateontology
alignment and debugging, and allow debugging of modelling defects both in the
structureofthetaxonomiesaswellasintheiralignments. Asdebuggingmodelling
defectsrequiresdomainknowledge,wehavedevelopedalgorithmsthatemploythe
domainknowledgeintrinsictothenetworktodetectandrepairmodellingdefects.
Further, a system has been implemented and several experiments with real-world
ontologies have been performed in order to demonstrate the advantages of our
integrated ontology alignment and debugging approach. For instance, in one of
theexperimentswiththewell-knownontologiesandalignmentfromtheAnatomy
track in Ontology Alignment Evaluation Initiative 2010, 203 modelling defects
(concerning incomplete and incorrect information) were discovered and repaired.
This work has been supported by the Swedish National Graduate School in Com-
puterScience(CUGS),theSwedishe-ScienceResearchCenter(SeRC)andVeten-
skapsr˚adet (VR).
v
Acknowledgements
When life brought me to Sweden I had never imagined the wonderful possi-
bilities I would discover. They did not come for granted, though. The path
through the research world is thorny, going up and down, turning at the
most unpredictable moments. I believe I have managed to put those to my
advantage and now I welcome the next challenge.
IamsincerelythankfultomysupervisorProfessorPatrickLambrixwho
has introduced me to the challenging area of ontologies. While working
under his supervision I have improved my calm judgement of circumstances
and, in general, my analytical skills. He provided encouraging and relaxed
workenvironmentandguidedmeduringallstagesofthiswork. Thankyou,
Patrick!
I am especially grateful to Professor Nahid Shahmehri, my second su-
pervisor, who is the main reason for me being at this university. She is the
one who first believed in my research talent and kindly advised me. I am
also thankful to Associate Professor Lena Str¨omb¨ack and David Byers who
made me believe I possess the strength to take this adventure. They have
introduced me to the wonderful world of research.
Thetimeherewouldnothavebeenthatenjoyablewithoutmycolleagues
who make the work environment so friendly. I also thank the people at the
IDA administrative department, and especially Anne, for their timely and
always kind assistance in various administrative issues. I say thank you
to Brittany Shahmehri for proof reading this thesis and providing valuable
remarks.
I am greatly thankful to my family and friends for their unquestion-
ing support and encouragement. Their belief in the successful end of this
adventure has always been driving me forward.
This work would not have been possible without my life partner Pavel.
He shares the sunny and stormy weather with me. Thank you, Pavel, for
your love and for being here!
Valentina Ivanova
January 2014
Link¨oping, Sweden
vii
Contents
1 Introduction 1
1.1 Semantic Web . . . . . . . . . . . . . . . . . . . . . . . . . . . 1
1.2 Ontologies . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 3
1.2.1 Ontology alignment . . . . . . . . . . . . . . . . . . . 4
1.2.2 Ontology debugging . . . . . . . . . . . . . . . . . . . 4
1.2.3 Ontology networks . . . . . . . . . . . . . . . . . . . . 5
1.2.4 Benefits from the integration of ontology alignment
and ontology debugging . . . . . . . . . . . . . . . . . 5
1.3 Problem formulation . . . . . . . . . . . . . . . . . . . . . . . 6
1.4 Contributions . . . . . . . . . . . . . . . . . . . . . . . . . . . 7
1.5 Thesis structure. . . . . . . . . . . . . . . . . . . . . . . . . . 8
1.6 List of publications . . . . . . . . . . . . . . . . . . . . . . . . 9
1.6.1 Thesis based on. . . . . . . . . . . . . . . . . . . . . . 9
1.6.2 Related publications . . . . . . . . . . . . . . . . . . . 9
1.6.3 Other publications . . . . . . . . . . . . . . . . . . . . 10
2 Background 11
2.1 Ontologies . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 11
2.1.1 Components . . . . . . . . . . . . . . . . . . . . . . . 12
2.1.2 Classification . . . . . . . . . . . . . . . . . . . . . . . 15
2.1.3 Applications . . . . . . . . . . . . . . . . . . . . . . . 17
2.2 Ontology alignment. . . . . . . . . . . . . . . . . . . . . . . . 17
2.3 Ontology debugging . . . . . . . . . . . . . . . . . . . . . . . 20
2.3.1 Classification of defects . . . . . . . . . . . . . . . . . 21
2.4 Definitions. . . . . . . . . . . . . . . . . . . . . . . . . . . . . 23
2.4.1 Ontologies and ontology networks . . . . . . . . . . . 23
2.4.2 Knowledge bases . . . . . . . . . . . . . . . . . . . . . 23
3 Framework and Algorithms 25
3.1 Framework and workflow . . . . . . . . . . . . . . . . . . . . 26
3.2 Methods in the framework . . . . . . . . . . . . . . . . . . . . 28
3.2.1 Detect missing and wrong is-a relations and mappings 28
3.2.2 Repair missing and wrong is-a relations and mappings 31
3.3 Algorithms in the debugging component . . . . . . . . . . . . 35
ix
CONTENTS
3.3.1 Detect and validate candidate missing is-a relations
and mappings . . . . . . . . . . . . . . . . . . . . . . . 35
3.3.2 Repair missing and wrong is-a relations and mappings 38
3.4 Algorithms in the alignment component . . . . . . . . . . . . 43
3.4.1 Detect and validate candidate missing mappings . . . 43
3.4.2 Repair missing and wrong mappings . . . . . . . . . . 44
3.5 Interactions between the alignment component and the de-
bugging component . . . . . . . . . . . . . . . . . . . . . . . . 45
4 Implemented System 47
4.1 Detectandvalidatecandidatemissingis-arelationsandmap-
pings . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 48
4.1.1 Detect and validate candidate missing is-a relations . 48
4.1.2 Detect and validate candidate missing mappings . . . 49
4.2 Repair missing and wrong is-a relations and mappings . . . . 51
4.2.1 Repair wrong is-a relations and mappings . . . . . . . 51
4.2.2 Repair missing is-a relations and mappings . . . . . . 52
5 Experiments and Discussions 55
5.1 Ontology debugging . . . . . . . . . . . . . . . . . . . . . . . 55
5.1.1 OAEI Anatomy 2010 . . . . . . . . . . . . . . . . . . . 55
5.2 Integration of ontology debugging and ontology alignment . . 60
5.2.1 OAEI Anatomy 2011 . . . . . . . . . . . . . . . . . . . 60
5.2.2 OAEI Benchmark 2010 . . . . . . . . . . . . . . . . . 64
5.2.3 ToxOntology-MeSH use case . . . . . . . . . . . . . . 70
5.3 Discussion . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 76
6 Related work 79
6.1 Ontology debugging . . . . . . . . . . . . . . . . . . . . . . . 79
6.1.1 Debugging modelling defects . . . . . . . . . . . . . . 79
6.1.2 Debugging semantic defects . . . . . . . . . . . . . . . 82
6.2 Ontology alignment. . . . . . . . . . . . . . . . . . . . . . . . 86
6.3 Integration of ontology alignment and ontology debugging . . 88
7 Conclusions and Future Work 91
7.1 Conclusions . . . . . . . . . . . . . . . . . . . . . . . . . . . . 91
7.1.1 Debugging of ontologies and alignments . . . . . . . . 92
7.1.2 Benefits from the integration of ontology alignment
and ontology debugging . . . . . . . . . . . . . . . . . 92
7.1.3 Implemented system . . . . . . . . . . . . . . . . . . . 93
7.2 Future work . . . . . . . . . . . . . . . . . . . . . . . . . . . . 93
7.2.1 Extending the system . . . . . . . . . . . . . . . . . . 94
7.2.2 Long-term future work . . . . . . . . . . . . . . . . . . 95
x
List of Figures
2.1 (Part of an) Ontology network. . . . . . . . . . . . . . . . . . 13
2.2 Part of the is-a hierarchy in the Wine ontology. . . . . . . . . 14
2.3 Part of the Wine ontology. . . . . . . . . . . . . . . . . . . . . 15
2.4 A general alignment framework. . . . . . . . . . . . . . . . . . 18
2.5 An unsatisfiable concept in the Pizza ontology. . . . . . . . . 22
3.1 Workflow. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 27
3.2 Initialization for detection.. . . . . . . . . . . . . . . . . . . . 35
3.3 Initialization for repairing. . . . . . . . . . . . . . . . . . . . . 38
3.4 Algorithm for generating repairing actions for wrong is-a re-
lations and mappings. . . . . . . . . . . . . . . . . . . . . . . 39
3.5 Algorithm for generating repairing actions for missing is-a
relations and mappings. . . . . . . . . . . . . . . . . . . . . . 41
4.1 Generating and validating CMIs. . . . . . . . . . . . . . . . . 49
4.2 Aligning.. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 50
4.3 Repairing wrong is-a relations. . . . . . . . . . . . . . . . . . 51
4.4 Repairing missing is-a relations. . . . . . . . . . . . . . . . . . 53
xi
LIST OF FIGURES
xii
Description:Abstract Semantically-enabled applications, such as ontology-based search and data inte-gration, take into account the semantics of the input data in their algorithms.