Lecture Notes in Computer Science 6716 CommencedPublicationin1973 FoundingandFormerSeriesEditors: GerhardGoos,JurisHartmanis,andJanvanLeeuwen EditorialBoard DavidHutchison LancasterUniversity,UK TakeoKanade CarnegieMellonUniversity,Pittsburgh,PA,USA JosefKittler UniversityofSurrey,Guildford,UK JonM.Kleinberg CornellUniversity,Ithaca,NY,USA AlfredKobsa UniversityofCalifornia,Irvine,CA,USA FriedemannMattern ETHZurich,Switzerland JohnC.Mitchell StanfordUniversity,CA,USA MoniNaor WeizmannInstituteofScience,Rehovot,Israel OscarNierstrasz UniversityofBern,Switzerland C.PanduRangan IndianInstituteofTechnology,Madras,India BernhardSteffen TUDortmundUniversity,Germany MadhuSudan MicrosoftResearch,Cambridge,MA,USA DemetriTerzopoulos UniversityofCalifornia,LosAngeles,CA,USA DougTygar UniversityofCalifornia,Berkeley,CA,USA GerhardWeikum MaxPlanckInstituteforInformatics,Saarbruecken,Germany Rafael Muñoz Andrés Montoyo Elisabeth Métais (Eds.) Natural Language Processing and Information Systems 16th International Conference onApplications ofNaturalLanguagetoInformationSystems,NLDB2011 Alicante, Spain, June 28-30, 2011 Proceedings 1 3 VolumeEditors RafaelMuñoz AndrésMontoyo UniversityofAlicante,SchoolofComputing,03080Alicante,Spain E-mail:[email protected];[email protected] ElisabethMétais CNAM-LaboratoireCédric,292RueSt.Martin,75141ParisCedex03,France E-mail:[email protected] ISSN0302-9743 e-ISSN1611-3349 ISBN978-3-642-22326-6 e-ISBN978-3-642-22327-3 DOI10.1007/978-3-642-22327-3 SpringerHeidelbergDordrechtLondonNewYork LibraryofCongressControlNumber:2011930837 CRSubjectClassification(1998):I.2.7,H.3,H.2,I.5,J.3,H.2.8,I.2.6 LNCSSublibrary:SL3–InformationSystemsandApplication,incl.Internet/Web andHCI ©Springer-VerlagBerlinHeidelberg2011 Thisworkissubjecttocopyright.Allrightsarereserved,whetherthewholeorpartofthematerialis concerned,specificallytherightsoftranslation,reprinting,re-useofillustrations,recitation,broadcasting, reproductiononmicrofilmsorinanyotherway,andstorageindatabanks.Duplicationofthispublication orpartsthereofispermittedonlyundertheprovisionsoftheGermanCopyrightLawofSeptember9,1965, initscurrentversion,andpermissionforusemustalwaysbeobtainedfromSpringer.Violationsareliable toprosecutionundertheGermanCopyrightLaw. Theuseofgeneraldescriptivenames,registerednames,trademarks,etc.inthispublicationdoesnotimply, evenintheabsenceofaspecificstatement,thatsuchnamesareexemptfromtherelevantprotectivelaws andregulationsandthereforefreeforgeneraluse. Typesetting:Camera-readybyauthor,dataconversionbyScientificPublishingServices,Chennai,India Printedonacid-freepaper SpringerispartofSpringerScience+BusinessMedia(www.springer.com) Preface The 16th International Conference on Applications of Natural Language to In- formationSystems (NLDB 2011)was held during June 28–30,2011,at the Uni- versity of Alicante, Spain. Since the first NLDB conference in 1995, the main goalhasbeentoprovideaforumforthediscussionanddisseminationofresearch on the integration of natural language resources within the field of information system engineering. The development and convergence of computing, telecommunications and information systems has already led to a revolution in the way that we work, communicate with each other, buy goods and use services, and even in the way weentertainandeducateourselves.Astherevolutioncontinues,largevolumesof informationareincreasinglystoredinamannerwhichismorenaturalforusersto exploit than the data presentation formats typical of legacy computer systems. Natural language processing (NLP) is crucial to solving these problems and language technologies are indispensable to the success of information systems. We hope that NLDB 2011 was a modest contribution toward this goal. NLDB 2011contributedtothe increaseinthe numberofgoalsandthe inter- national standing of NLP conferences, largely due to its Program Committee, whose members are renowned researchers in the field of NLP and information system engineering. These high standards which have been set for NLDB can also be measured by the significant number of papers submitted (74) in this edition. Eachpaper was reviewedby three members of the ProgramCommittee or by their recommended subreviewers. As a result of the review process, 11 articles were accepted as regular papers and another 11 were accepted as short papers. Additional contributions were selected for the Poster and the Doctoral Symposium sessions held at the conference. Finally, we would like to thank all the reviewers for their involvement and excellent work. We extend these thanks to our invited speakers, Michael Thel- wall and Horacio Saggion, for their valuable contribution, which undoubtedly increased the interest in the conference. We would also like to express our grat- itude to the people who helped in the organization of the different parts of the conference program. We would especially like to thank Miguel Angel Varo´ for setting up and maintaing all Web services for the conference. June 2011 Rafael Mun˜oz Andr´es Montoyo Elisabeth M´etais Organization Conference Chairs Rafael Mun˜oz University of Alicante, Spain Elisabeth M´etais CEDRIC/CNAM, France Program Chair Andr´es Montoyo University of Alicante, Spain Doctoral Symposium Paloma Moreda University of Alicante, Spain Elena Lloret University of Alicante, Spain Poster Session Alexandra Balahur University of Alicante, Spain Jesu´s M. Hermida University of Alicante, Spain Organizing Committee Alexandra Balahur University of Alicante, Spain Ester Boldrini University of Alicante, Spain Antonio Ferra´ndez University of Alicante, Spain Jesu´s M. Hermida University of Alicante, Spain Fernando Llopis University of Alicante, Spain Elena Lloret University of Alicante, Spain Patricio Mart´ınez-Barco University of Alicante, Spain Andr´es Montoyo University of Alicante, Spain Paloma Moreda University of Alicante, Spain Rafael Mun˜oz University of Alicante, Spain Manuel Palomar University of Alicante, Spain Jesu´s Peral University of Alicante, Spain Program Committee Akhilesh Bajaj University of Tulsa, USA Alexander Hinneburg University of Halle, Germany Alfredo Cuzzocrea University of Calabria,Italy Andreas Hotho University of Kassel, Germany VIII Organization Andr´es Montoyo Universidad de Alicante, Spain Antje Du¨sterh¨oft Hochschule Wismar, Germany Bernhard Thalheim Kiel University, Germany Cedric du Mouza CNAM, France Christian Kop University of Klagenfurt, Austria Christian Winkler University of Klagenfurt, Austria Christina J. Hopfe Cardiff University, UK Deryle Lonsdale Brigham Young Uinversity, USA Elisabeth M´etais CNAM, France Epaminondas Kapetanios University of Westminster, UK Fabio Rinaldi University of Zurich, Switzerland Farid Meziane Salford University, UK Frederic Andres University of Advanced Studies, Japan Georgia Koutrika Stanford University, USA Grigori Sidorov National Researcher of Mexico, Mexico Gu¨nter Neumann DFKI, Germany Gu¨nther Fliedl University of Klagenfurt, Austria Hae-Chang Rim Korea University, Korea Harmain Harmain United Arab Emirates University, UAE Heinrich C. Mayr University of Klagenfurt, Austria Helmut Horacek Saarland University, Germany Hiram Calvo National Polytechnic Institute, Mexico Irena Spasic Manchester Centre for Integrative Systems Biology, UK Isabelle Comyn-Wattiau CNAM, France Jacky Akoka CNAM, France Jana Lewerenz Capgemini Du¨sseldorf, Germany Jian-Yun Nie Universit´e de Montr´eal, Canada Jon Atle Gulla Norwegian University of Science and Technology, Norway Juan Carlos Trujillo Universidad de Alicante, Spain Ju¨rgen Rilling Concordia University, Canada Karin Harbusch Universit¨at Koblenz-Landau, Germany KrishnaprasadThirunarayan Wright State University, USA Leila Kosseim Concordia University, Canada Luis Alfonso Uren˜a Universidad de Ja´en, Spain Luisa Mich University of Trento, Italy Magdalena Wolska Saarland University, Germany Manuel Palomar Universidad de Alicante, Spain Max Silberztein Universit´e de Franche-Comt´e, France Nadira Lammari CNAM, France Odile Piton Universit´e Paris I Panth´e on-Sorbonne, France Panos Vassiliadis University of Ioannina, Greece Paul Johannesson Stockholm University, Sweden Paul McFetridge Simon Fraser University, Canada Organization IX Philipp Cimiano CITEC, University of Bielefeld, Germany Pit Pichappan Annamalai University, India Rafael Mun˜oz Universidad de Alicante, Spain Ren´e Witte Concordia University, Canada Roger Chiang University of Cincinnati, USA Rossi Setchi Cardiff University, UK Samira Si-Said Cherfi CNAM, France St´ephane Lopes Universit´e de Versailles, France Udo Hahn Friedrich-Schiller-Universit¨atJena, Germany Veda Storey Georgia State University, USA Vijay Sugumaran Oakland University Rochester, USA Yacine Rezgui University of Salford, UK Zornitsa Kozareva Information Science Institute, University of South California, USA Zoubida Kedad Universit´e de Versailles, France Additional Reviewers Alexandra Balahur John McCrae Arturo Montejo Maria Bergholtz Doina Tatar Mar´ıa Teresa Mart´ın Valdivia Elena Lloret O´scar Ferra´ndez Jesu´s M. Hermida Sponsoring Institutions Conseller´ıa d’Educacio´, Generalitat Valenciana, Spain University of Alicante, Spain Table of Contents Invited Talks .................................................... 1 Horacio Saggion and Michael Thelwall Full Papers COMPENDIUM: A Text Summarization System for Generating Abstracts of Research Papers...................................... 3 Elena Lloret, Mar´ıa Teresa Roma´-Ferri, and Manuel Palomar Automatic Generation of Semantic Features and Lexical Relations Using OWL Ontologies ........................................... 15 Maha Al-Yahya, Hend Al-Khalifa, Alia Bahanshal, and Iman Al-Oudah EmotiNet: A Knowledge Base for Emotion Detection in Text Built on the Appraisal Theories............................................ 27 Alexandra Balahur, Jesu´s M. Hermida, Andr´es Montoyo, and Rafael Mun˜oz Querying Linked Data Using Semantic Relatedness: A Vocabulary Independent Approach............................................ 40 Andr´e Freitas, Joa˜o Gabriel Oliveira, Sea´n O’Riain, Edward Curry, and Joa˜o Carlos Pereira da Silva Extracting Explicit and Implicit Causal Relations from Sparse, Domain-Specific Texts ............................................ 52 Ashwin Ittoo and Gosse Bouma Topics Inference by Weighted Mutual Information Measures Computed from Structured Corpus........................................... 64 Harry Chang Improving Subtree-Based Question Classification Classifiers with Word-Cluster Models............................................. 76 Le Minh Nguyen and Akira Shimazu Data-Driven Approach Based on Semantic Roles for Recognizing Temporal Expressions and Events in Chinese ........................ 88 Hector Llorens, Estela Saquete, Borja Navarro, Liu Li, and Zhongshi He Information Retrieval Techniques for Corpus Filtering Applied to External PlagiarismDetection ..................................... 100 Daniel Micol, O´scar Ferra´ndez, and Rafael Mun˜oz XII Table of Contents WordSenseDisambiguation:AGraph-BasedApproachUsingN-Cliques Partitioning Technique ........................................... 112 Yoan Guti´errez, Sonia Va´zquez, and Andr´es Montoyo OntoFIS as a NLP Resource in the Drug-Therapy Domain: Design Issues and Solutions Applied ...................................... 125 Mar´ıa Teresa Roma´-Ferri, Jesu´s M. Hermida, and Manuel Palomar Short Papers Exploiting Unlabeled Data for Question Classification ................ 137 David Toma´s and Claudio Giuliano A System for Adaptive Information Extraction from Highly Informal Text............................................................ 145 Laura Alonso i Alemany and Rafael Carrascosa Pythia: Compositional Meaning Construction for Ontology-Based Question Answering on the Semantic Web........................... 153 Christina Unger and Philipp Cimiano ‘twazn me!!! ;(’ Automatic Authorship Analysis of Micro-Blogging Messages........................................................ 161 Rui Sousa Silva, Gustavo Laboreiro, Lu´ıs Sarmento, Tim Grant, Eug´enio Oliveira, and Belinda Maia Opinion Classification Techniques Applied to a Spanish Corpus ........ 169 Eugenio Mart´ınez-Ca´mara, M. Teresa Mart´ın-Valdivia, and L. Alfonso Uren˜a-Lo´pez Prosody Analysis of Thai Emotion Utterances ....................... 177 Sukanya Yimngam, Wichian Premchaisawadi, and Worapoj Kreesuradej Repurposing Social Tagging Data for Extraction of Domain-Level Concepts ....................................................... 185 Sandeep Purao, Veda C. Storey, Vijayan Sugumaran, Jordi Conesa, Julia` Minguillo´n, and Joan Casas Ontology-GuidedApproach to Feature-BasedOpinion Mining ......... 193 Isidro Pen˜alver-Mart´ınez, Rafael Valencia-Garc´ıa, and Francisco Garc´ıa-Sa´nchez A Natural Language Interface for Data Warehouse Question Answering ...................................................... 201 Nicolas Kuchmann-Beauger and Marie-Aude Aufaure