ebook img

Semantic Keyword-Based Search on Structured Data Sources: COST Action IC1302 Second International KEYSTONE Conference, IKC 2016, Cluj-Napoca, Romania, September 8–9, 2016, Revised Selected Papers PDF

201 Pages·2017·15.89 MB·English
Save to my drive
Quick download
Download
Most books are stored in the elastic cloud where traffic is expensive. For this reason, we have a limit on daily download.

Preview Semantic Keyword-Based Search on Structured Data Sources: COST Action IC1302 Second International KEYSTONE Conference, IKC 2016, Cluj-Napoca, Romania, September 8–9, 2016, Revised Selected Papers

Andrea Calì Dorian Gorgan Martín Ugarte (Eds.) Semantic Keyword-Based 1 5 1 0 Search on Structured 1 S C Data Sources N L COST Action IC1302 Second International KEYSTONE Conference, IKC 2016 Cluj-Napoca, Romania, September 8–9, 2016 Revised Selected Papers 123 Lecture Notes in Computer Science 10151 Commenced Publication in 1973 Founding and Former Series Editors: Gerhard Goos, Juris Hartmanis, and Jan van Leeuwen Editorial Board David Hutchison Lancaster University, Lancaster, UK Takeo Kanade Carnegie Mellon University, Pittsburgh, PA, USA Josef Kittler University of Surrey, Guildford, UK Jon M. Kleinberg Cornell University, Ithaca, NY, USA Friedemann Mattern ETH Zurich, Zurich, Switzerland John C. Mitchell Stanford University, Stanford, CA, USA Moni Naor Weizmann Institute of Science, Rehovot, Israel C. Pandu Rangan Indian Institute of Technology, Madras, India Bernhard Steffen TU Dortmund University, Dortmund, Germany Demetri Terzopoulos University of California, Los Angeles, CA, USA Doug Tygar University of California, Berkeley, CA, USA Gerhard Weikum Max Planck Institute for Informatics, Saarbrücken, Germany More information about this series at http://www.springer.com/series/7409 ì Andrea Cal Dorian Gorgan (cid:129) í Mart n Ugarte (Eds.) Semantic Keyword-Based Search on Structured Data Sources COST Action IC1302 Second International KEYSTONE Conference, IKC 2016 – Cluj-Napoca, Romania, September 8 9, 2016 Revised Selected Papers 123 Editors Andrea Calì Martín Ugarte Department ofComputer Scienceand ComputerandDecisionEngineering(CoDE) Information Systems Department Birkbeck University of London UniversitéLibre deBruxelles London Brussels UK Belgium Dorian Gorgan Computer Science Department Technical University of Cluj-Napoca Cluj-Napoca Romania ISSN 0302-9743 ISSN 1611-3349 (electronic) Lecture Notesin Computer Science ISBN 978-3-319-53639-2 ISBN978-3-319-53640-8 (eBook) DOI 10.1007/978-3-319-53640-8 LibraryofCongressControlNumber:2017931541 LNCSSublibrary:SL3–InformationSystemsandApplications,incl.Internet/Web,andHCI ©SpringerInternationalPublishingAG2017 Thisworkissubjecttocopyright.AllrightsarereservedbythePublisher,whetherthewholeorpartofthe material is concerned, specifically the rights of translation, reprinting, reuse of illustrations, recitation, broadcasting, reproduction on microfilms or in any other physical way, and transmission or information storageandretrieval,electronicadaptation,computersoftware,orbysimilarordissimilarmethodologynow knownorhereafterdeveloped. Theuseofgeneraldescriptivenames,registerednames,trademarks,servicemarks,etc.inthispublication doesnotimply,evenintheabsenceofaspecificstatement,thatsuchnamesareexemptfromtherelevant protectivelawsandregulationsandthereforefreeforgeneraluse. Thepublisher,theauthorsandtheeditorsaresafetoassumethattheadviceandinformationinthisbookare believedtobetrueandaccurateatthedateofpublication.Neitherthepublishernortheauthorsortheeditors give a warranty, express or implied, with respect to the material contained herein or for any errors or omissionsthatmayhavebeenmade.Thepublisherremainsneutralwithregardtojurisdictionalclaimsin publishedmapsandinstitutionalaffiliations. Printedonacid-freepaper ThisSpringerimprintispublishedbySpringerNature TheregisteredcompanyisSpringerInternationalPublishingAG Theregisteredcompanyaddressis:Gewerbestrasse11,6330Cham,Switzerland Preface Indatamanagementwefacetheproblemofhandlingandqueryingverylargedatasets with large and partially unknown schemas, possibly containing billions of instances. Animportantissueinthiscontextistoefficientlyperformkeywordsearches.Thesize of the datasets poses challenges related to both scalability and semantic analysis. Another challenge is the discovery of suitable data sources for keyword search, given that one would want to process queries on relevant sources. The Second International KEYSTONE Conference (IKC 2016), organized within the Cost Action IC1302 (Semantic Keyword-Based Search on Structured Data Sour- ces),attractedseveralcontributionsintheareaofkeywordandsemanticsearchonlarge structured data. In all, 14 papers were selected; the topics covered, among others, the areas of keyword extraction, natural language searches, graph databases, information retrieval techniques for keyword search and document retrieval. The program also included invited talks by experts in thefield: Maria-Esther Vidal(University of Bonn, Germany), Dan Olteanu (University of Oxford, UK), Mihai Dinsoreanu (Recognos, Romania), Stefan Dietze (LS3 Research Centre, University of Hannover, Germany), Dragan Ivanovic (University of Novi Sad, Serbia), and Radu Tudoran (Huawei, Germany). An exciting panel moderated by Yannis Velegrakis took place with the participation ofRaduTudoran andVaganTerziyan.The programwas stimulatingand managed to keep the participants in the lecture room despite the wonderful sights of Cluj-Napoca. The successofthis conferenceistheresultoftheeffort ofmany. We wouldlike to thanktheauthors,theinvitedspeakers,theconferenceparticipants,themembersofthe Program Committee, and the external referees. We would also like to thank Springer for providing assistance in the preparation of the proceedings, the University of Cluj-Napoca for providing local facilities, and the local organizers and students who helped run the event. COST (European Cooperation in Science and Technology) is a pan-European intergovernmental framework. Its mission is to enable break through scientific and technological developments leading to new concepts and products and thereby con- tribute toward strengthening Europe’s research and innovation capacities. It allows researchers, engineers, and scholars to jointly develop their own ideas and take new initiatives across all fields of science and technology, while promoting multi- and interdisciplinary approaches. COST aims at fostering a better integration of less research-intensivecountriestotheknowledgehubsoftheEuropeanresearcharea.The COST Association, an international not-for-profit association under Belgian law, VI Preface integrates all management, governing, and administrative functions necessary for the operation of the framework. The COST Association currently has 36 member countries. September 2016 Andrea Calì Dorian Gorgan Martín Ugarte Organization IKC 2016 was organized within the Cost Action 1302 (Semantic Keyword-Based Search on Structured Data Sources), by the Computer Science Department, Faculty of Automation and Computer Science of the Technical University of Cluj-Napoca. General Chair Riccardo Torlone Università Roma Tre, Italy Program Chairs Andrea Calì Birkbeck University of London, UK Dorian Gorgan Technical University of Cluj-Napoca, Romania Martín Ugarte Université Libre de Bruxelles, Belgium Organizing Chairs Dorian Gorgan Technical University of Cluj-Napoca, Romania Victor Bacu Technical University of Cluj-Napoca, Romania Invited Speakers Maria-Esther Vidal University of Bonn, Germany Dan Olteanu University of Oxford, UK Dragan Ivanovic University of Novi Sad, Serbia Radu Tudoran Huawei Research Engineer, Germany Additional Reviewers V. Alexiev J. Espinoza R. Amaro F. Guerra I. Anagnostopoulos S. Ilarri M. Bielikova E. Ioannou C. Bobed D. Ivanovic F. Bobillo A. Kovacevic K. Belhajjame M. López Nores E. Domnori J. Lacasta M. Dragoni M. Lupu VIII Organization F. Mandreoli F. Pop A. Mestrovic V. Stoykova P. Missier G. Vargas-Solar A. Nuernberger L. Vintan Sponsoring Institutions COST: European Cooperation in Science and Technology (www.cost.eu) Contents Invited Papers Retrieval, Crawling and Fusion of Entity-centric Data on the Web . . . . . . . . 3 Stefan Dietze Data Multiverse: The Uncertainty Challenge of Future Big Data Analytics. . . 17 Radu Tudoran, Bogdan Nicolae, and Götz Brasche Information Extraction and Retrieval Experiments with Document Retrieval from Small Text Collections Using Latent Semantic Analysis or Term Similarity with Query Coordination and Automatic Relevance Feedback. . . . . . . . . . . . . . . . . . . . 25 Colin Layfield, Joel Azzopardi, and Chris Staff Unsupervised Extraction of Conceptual Keyphrases from Abstracts. . . . . . . . 37 Philipp Ludwig, Marcus Thiel, and Andreas Nürnberger Back to the Sketch-Board: Integrating Keyword Search, Semantics, and Information Retrieval. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 49 Joel Azzopardi, Fabio Benedetti, Francesco Guerra, and Mihai Lupu Topic Detection in Multichannel Italian Newspapers . . . . . . . . . . . . . . . . . . 62 Laura Po, Federica Rollo, and Raquel Trillo Lado Random Walks Analysis on Graph Modelled Multimodal Collections . . . . . . 76 Serwah Sabetghadam, Mihai Lupu, and Andreas Rauber Text and Digital Libraries A Software Processing Chain for Evaluating Thesaurus Quality . . . . . . . . . . 91 Javier Lacasta, Gilles Falquet, Javier Nogueras-Iso, and Javier Zarazaga-Soria Comparison of Collaborative and Content-Based Automatic Recommendation Approaches in a Digital Library of Serbian PhD Dissertations. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 100 Joel Azzopardi, Dragan Ivanovic, and Georgia Kapitsaki Keyword-Based Search on Bilingual Digital Libraries . . . . . . . . . . . . . . . . . 112 Ranka Stanković, Cvetana Krstev, Duško Vitas, Nikola Vulović, and Olivera Kitanović

See more

The list of books you might like

Most books are stored in the elastic cloud where traffic is expensive. For this reason, we have a limit on daily download.