Natural Language Processing for Global and Local Business Fatih Pinarbasi Istanbul Medipol University, Turkey M. Nurdan Taskiran Istanbul Medipol University, Turkey A volume in the Advances in Business Information Systems and Analytics (ABISA) Book Series Published in the United States of America by IGI Global Business Science Reference (an imprint of IGI Global) 701 E. Chocolate Avenue Hershey PA, USA 17033 Tel: 717-533-8845 Fax: 717-533-8661 E-mail: [email protected] Web site: http://www.igi-global.com Copyright © 2021 by IGI Global. All rights reserved. No part of this publication may be reproduced, stored or distributed in any form or by any means, electronic or mechanical, including photocopying, without written permission from the publisher. Product or company names used in this set are for identification purposes only. Inclusion of the names of the products or companies does not indicate a claim of ownership by IGI Global of the trademark or registered trademark. Library of Congress Cataloging-in-Publication Data Names: Pinarbași, Fatih, 1991- editor. | Taşkıran, Nurdan Öncel, 1960- editor. Title: Natural language processing for global and local business / Fatih Pinarbași and M. Nurdan Oncel Taskiran, editors. Description: Hershey, PA : Business Science Reference, [2020] | Includes bibliographical references and index. | Summary: “This book explores the theoretical and practical phenomenon of natural language processing through different languages and platforms in terms of today’s conditions”-- Provided by publisher. Identifiers: LCCN 2019059926 (print) | LCCN 2019059927 (ebook) | ISBN 9781799842408 (hardcover) | ISBN 9781799851349 (paperback) | ISBN 9781799842415 (ebook) Subjects: LCSH: Natural language processing (Computer science) | Computational linguistics. | Discourse analysis--Data processing. | Emotive (Linguistics) Classification: LCC QA76.9.N38 N388 2020 (print) | LCC QA76.9.N38 (ebook) | DDC 006.3/5--dc23 LC record available at https://lccn.loc.gov/2019059926 LC ebook record available at https://lccn.loc.gov/2019059927 This book is published in the IGI Global book series Advances in Business Information Systems and Analytics (ABISA) (ISSN: 2327-3275; eISSN: 2327-3283) British Cataloguing in Publication Data A Cataloguing in Publication record for this book is available from the British Library. All work contributed to this book is new, previously-unpublished material. The views expressed in this book are those of the authors, but not necessarily of the publisher. For electronic access to this publication, please contact: [email protected]. Advances in Business Information Systems and Analytics (ABISA) Book Series Madjid Tavana La Salle University, USA ISSN:2327-3275 EISSN:2327-3283 Mission The successful development and management of information systems and business analytics is crucial to the success of an organization. New technological developments and methods for data analysis have allowed organizations to not only improve their processes and allow for greater productivity, but have also provided businesses with a venue through which to cut costs, plan for the future, and maintain competitive advantage in the information age. The Advances in Business Information Systems and Analytics (ABISA) Book Series aims to present diverse and timely research in the development, deployment, and management of business information systems and business analytics for continued organizational development and improved business value. Coverage • Performance Metrics IGI Global is currently accepting manuscripts • Algorithms for publication within this series. To submit a pro- • Business Models posal for a volume in this series, please contact our • Data Strategy Acquisition Editors at [email protected] • Forecasting or visit: http://www.igi-global.com/publish/. • Legal information systems • Business Information Security • Statistics • Strategic Information Systems • Geo-BIS The Advances in Business Information Systems and Analytics (ABISA) Book Series (ISSN 2327-3275) is published by IGI Global, 701 E. Chocolate Avenue, Hershey, PA 17033-1240, USA, www.igi-global.com. This series is composed of titles available for purchase individu- ally; each title is edited to be contextually exclusive from any other title within the series. For pricing and ordering information please visit http://www.igi-global.com/book-series/advances-business-information-systems-analytics/37155. Postmaster: Send all address changes to above address. © © 2021 IGI Global. All rights, including translation in other languages reserved by the publisher. No part of this series may be reproduced or used in any form or by any means – graphics, electronic, or mechanical, including photocopying, recording, taping, or informa- tion and retrieval systems – without written permission from the publisher, except for non commercial, educational use, including classroom teaching purposes. The views expressed in this series are those of the authors, but not necessarily of IGI Global. Titles in this Series For a list of additional titles in this series, please visit: https://www.igi-global.com/book-series/advances-business-information-systems-analytics/37155 Applications of Big Data and Business Analytics in Management Sneha Kumari (Vaikunth Mehta National Institute of Cooperative Management, India) K. K. Tripathy (Vaikunth Mehta National Institute of Cooperative Management, India) and Vidya Kumbhar (Symbiosis International Uni- versity (Deemed, India) Business Science Reference • © 2020 • 300pp • H/C (ISBN: 9781799832614) • US $225.00 Handbook of Research on Integrating Industry 4.0 in Business and Manufacturing Isak Karabegović (Academy of Sciences and Arts of Bosnia and Herzegovina, Bosnia and Herzegovina) Ahmed Kovačević (City, University London, UK) Lejla Banjanović-Mehmedović (University of Tuzla, Bosnia and Her- zegovina) and Predrag Dašić (High Technical Mechanical School of Professional Studies in Trstenik, Serbia) Business Science Reference • © 2020 • 661pp • H/C (ISBN: 9781799827252) • US $265.00 Internet of Things (IoT) Applications for Enterprise Productivity Erdinç Koç (Bingol University, Turkey) Business Science Reference • © 2020 • 357pp • H/C (ISBN: 9781799831754) • US $215.00 Trends and Issues in International Planning for Businesses Babayemi Adekunle (Arden University, UK) Husam Helmi Alharahsheh (University of Wales Trinity Saint David, UK) and Abraham Pius (Arden University, UK) Business Science Reference • © 2020 • 225pp • H/C (ISBN: 9781799825470) • US $225.00 Institutional Assistance Support for Small and Medium Enterprise Development in Africa Isaac Oluwajoba Abereijo (Obafemi Awolowo University, Nigeria) Business Science Reference • © 2020 • 280pp • H/C (ISBN: 9781522594819) • US $205.00 Role of Regional Development Agencies in Entrepreneurial and Rural Development Emerging Research and Opportunities Milan B. Vemić (Union – Nikola Tesla University, Serbia) Business Science Reference • © 2020 • 246pp • H/C (ISBN: 9781799826415) • US $175.00 Using Applied Mathematical Models for Business Transformation Antoine Trad (IBISTM, France) and Damir Kalpić (University of Zagreb, Croatia) Business Science Reference • © 2020 • 543pp • H/C (ISBN: 9781799810094) • US $265.00 701 East Chocolate Avenue, Hershey, PA 17033, USA Tel: 717-533-8845 x100 • Fax: 717-533-8661 E-Mail: [email protected] • www.igi-global.com Table of Contents Foreword..............................................................................................................................................xv Preface.................................................................................................................................................xvi Acknowledgment................................................................................................................................xxi Section 1 A General Outlook on Natural Language Processing Chapter 1 AcademyandCompanyNeeds:ThePastandFutureofNLP................................................................1 Tiago Martins da Cunha, UNILAB, Brazil Chapter 2 DerivingBusinessValueFromOnlineDataSourcesUsingNaturalLanguageProcessing Techniques............................................................................................................................................17 Stephen Camilleri, University of Malta, Malta Chapter 3 NaturalLanguageProcessinginOnlineReviews.................................................................................40 Gunjan Ansari, JSS Academy of Technical Education, Noida, India Shilpi Gupta, JSS Academy of Technical Education, Noida, India Niraj Singhal, Shobhit Institute of Engineering and Technology (Deemed), Meerut, India Chapter 4 SentimentAnalysisasaRestrictedNLPProblem................................................................................65 Akshi Kumar, Delhi Technological University, India Divya Gupta, Galgotias University, India Chapter 5 DeepLearningforSentimentAnalysis:AnOverviewandPerspectives..............................................97 Vincent Karas, University of Augsburg, Germany Björn W. Schuller, University of Augsburg, Germany Section 2 Natural Language Processing in Business Chapter 6 MetaphorsinBusinessApplications:ModellingSubjectivityThroughEmotionsforMetaphor Comprehension...................................................................................................................................134 Sunny Rai, Mahindra Ecole Centrale, Hyderabad, India Shampa Chakraverty, Netaji Subhas University of Technology, India Devendra Kumar Tayal, Indira Gandhi Delhi Technical University for Women, India Chapter 7 EstimatingImportanceFromWebReviewsThroughTextualDescriptionandMetrics Extraction............................................................................................................................................154 Roney Lira de Sales Santos, University of Sao Paulo, Brazil Carlos Augusto de Sa, Federal University of Piaui, Brazil Rogerio Figueredo de Sousa, University of Sao Paulo, Brazil Rafael Torres Anchiêta, University of Sao Paulo, Brazil Ricardo de Andrade Lira Rabelo, Federal University of Piaui, Brazil Raimundo Santos Moura, Federal University of Piaui, Brazil Chapter 8 DiscoveryofSustainableTransportModesUnderlyingTripAdvisorReviewsWithSentiment Analysis:TransportDomainAdaptationofSentimentLabelledDataSet.........................................180 Ainhoa Serna, University of the Basque Country, Spain Jon Kepa Gerrikagoitia, BRTA Basque Research and Technology Alliance, Spain Chapter 9 ResearchJourneyofHateContentDetectionFromCyberspace........................................................200 Sayani Ghosal, Ambedkar Institute of Advanced Communication Technologies and Research, India Amita Jain, Ambedkar Institute of Advanced Communication Technologies and Research, India Chapter 10 TheUseofNaturalLanguageProcessingforMarketOrientationonRareDiseases.........................226 Matthias Hölscher, Institute for IT Management and Digitization, FOM University, Germany Rudiger Buchkremer, Institute for IT Management and Digitization, FOM University, Germany Chapter 11 QualityAssuranceinComputer-AssistedTranslationinBusinessEnvironments.............................247 Sanja Seljan, Faculty of Humanities and Social Sciences, University of Zagreb, Croatia Nikolina Škof Erdelja, Ciklopea, Croatia Vlasta Kučiš, Faculty of Arts, University of Maribor, Slovenia Ivan Dunđer, Faculty of Humanities and Social Sciences, University of Zagreb, Croatia Mirjana Pejić Bach, Faculty of Economics and Business, University of Zagreb, Croatia Section 3 Diversity Among Languages Over Natural Language Processing Chapter 12 AnExtensiveTextMiningStudyfortheTurkishLanguage:AuthorRecognition,Sentiment Analysis,andTextClassification........................................................................................................272 Durmuş Özkan Şahin, Ondokuz Mayıs University, Turkey Erdal Kılıç, Ondokuz Mayıs University, Turkey Chapter 13 SentimentAnalysisofArabicDocuments:MainChallengesandRecentAdvances.........................307 Hichem Rahab, ICISI Laboratory, University of Khenchela, Algeria Mahieddine Djoudi, TechNE Laboratory, University of Poitiers, France Abdelhafid Zitouni, LIRE Laboratory, University of Constantine 2, Algeria Chapter 14 BuildingLexicalResourcesforDialecticalArabic............................................................................332 Sumaya Sulaiman Al Ameri, Khalifa University of Science and Technology, UAE Abdulhadi Shoufan, Center for Cyber-Physical Systems, Khalifa University of Science and Technology, UAE Chapter 15 ACriticalReviewoftheCurrentStateofNaturalLanguageProcessinginMexicoandChile.........365 César Aguilar, Pontificia Universidad Católica de Chile, Chile Olga Acosta, Singularyta SpA, Chile Compilation of References...............................................................................................................390 About the Contributors....................................................................................................................443 Index...................................................................................................................................................450 Detailed Table of Contents Foreword..............................................................................................................................................xv Preface.................................................................................................................................................xvi Acknowledgment................................................................................................................................xxi Section 1 A General Outlook on Natural Language Processing Chapter 1 AcademyandCompanyNeeds:ThePastandFutureofNLP................................................................1 Tiago Martins da Cunha, UNILAB, Brazil ThischapterpresentsaviewofhowtheuseofNLPknowledgemightchangetherelationbetween universitiesandcompanies.ProductsfromNLPanalysisareexpectedinbothendsofthisattimesnot soreciprocalexchange.Buthistoryhasshowntheproductsdevelopedbyuniversitiesandcompaniesare complementaryforthedevelopmentofNLP.Thegreatvolumeofdatatheworldisproducingisrequiring newerperspectivestoprovideunderstanding.Theseneweraspectsfoundonbigdatamayprovidethe comprehensionofhumanlanguagecategorizationandthereforepossiblyhumanlanguageacquisition. Buttoprocessdatamoredataneedtobeproducedandnotallcompanieshavethetimetodedicatefor thistask.Thischapteraimstopresentthroughsharingliteraturereviewandexperienceinthefieldthat partnershipsarethemostreliableresourceforthecycleofknowledgeproductioninNLP.Companies needtobereceptiveofthetheoreticalknowledgetheuniversitymayprovide,anduniversitiesmustturn theirtheoreticalknowledgeforamoreappliedenvionment. Chapter 2 DerivingBusinessValueFromOnlineDataSourcesUsingNaturalLanguageProcessing Techniques............................................................................................................................................17 Stephen Camilleri, University of Malta, Malta Thewealthofinformationproducedovertheinternetempowersbusinessestobecomedata-driven organizations,increasingtheirabilitytopredictconsumerbehavior,takemoreinformedstrategicdecisions, andremaincompetitiveonthemarket.However,pastresearchdidnotidentifywhichonlinedatasources companiesshouldchoosetoachievesuchanobjective.Thischapteraimstoanalysehowonlinenews articles,socialmediamessages,anduserreviewscanbeexploitedbybusinessesusingnaturallanguage processing(NLP)techniquestobuildbusinessintelligence.NLPtechniquesassistcomputerstounderstand andderiveavaluablemeaningfromhuman(natural)languages.FollowingabriefintroductiontoNLP andadescriptionofhowthesethreetextstreamsdifferfromeachother,thechapterdiscussessixmain factorsthatcanassistbusinessesinchoosingonedatasourcefromanother.Thechapterconcludeswith futuredirectionstowardsimprovingbusinessapplicationsinvolvingNLPtechniques. Chapter 3 NaturalLanguageProcessinginOnlineReviews.................................................................................40 Gunjan Ansari, JSS Academy of Technical Education, Noida, India Shilpi Gupta, JSS Academy of Technical Education, Noida, India Niraj Singhal, Shobhit Institute of Engineering and Technology (Deemed), Meerut, India Theanalysisoftheonlinedatapostedonvariouse-commercesitesisrequiredtoimproveconsumer experienceandthusenhanceglobalbusiness.Theincreaseinthevolumeofsocialmediacontentin therecentyearsledtotheproblemofoverfittinginreviewclassification.Thus,therearisesaneedto selectrelevantfeaturestoreducecomputationalcostandimproveclassifierperformance.Thischapter investigatesvariousstatisticalfeatureselectionmethodsthataretimeefficientbutresultinselectionof fewredundantfeatures.Toovercomethisissue,wrappermethodssuchassequentialfeatureselection (SFS)andrecursivefeatureelimination(RFE)areemployedforselectionofoptimalfeatureset.The empiricalanalysiswasconductedonmoviereviewdatasetusingthreedifferentclassifiersandtheresults depictthatSVMcouldachievef-measureof96%withonly8%selectedfeaturesusingRFEmethod. Chapter 4 SentimentAnalysisasaRestrictedNLPProblem................................................................................65 Akshi Kumar, Delhi Technological University, India Divya Gupta, Galgotias University, India Withtheacceleratedevolutionofsocialnetworks,thereisatremendousincreaseinopinionsbythe peopleaboutproductsorservices.Whilethisuser-generatedcontentinnaturallanguageisintendedto bevaluable,itslargeamountsrequireuseofcontentminingmethodsandNLPtouncovertheknowledge forvarioustasks.Inthisstudy,sentimentanalysisisusedtoanalyzeandunderstandtheopinionsof usersusingstatisticalapproaches,knowledge-basedapproaches,hybridapproaches,andconcept-based ontologies.Unfortunately,sentimentanalysisalsoexperiencesarangeofdifficultieslikecolloquialwords, negationhandling,ambiguityinwordsense,coreferenceresolution,whichhighlightanotherperspective emphasizingthatsentimentanalysisiscertainlyarestrictedNLPproblem.Thepurposeofthischapteris todiscoverhowsentimentanalysisisarestrictedNLPproblem.Thus,thischapterdiscussedtheconcept ofsentimentanalysisinthefieldofNLPandexploredthatsentimentanalysisisarestrictedNLPproblem duetothesophisticatednatureofnaturallanguage. Chapter 5 DeepLearningforSentimentAnalysis:AnOverviewandPerspectives..............................................97 Vincent Karas, University of Augsburg, Germany Björn W. Schuller, University of Augsburg, Germany Sentimentanalysisisanimportantareaofnaturallanguageprocessingthatcanhelpinformbusiness decisionsbyextractingsentimentinformationfromdocuments.Thepurposeofthischapteristointroduce thereadertoselectedconceptsandmethodsofdeeplearningandshowhowdeepmodelscanbeused toincreaseperformanceinsentimentanalysis.Itdiscussesthelatestadvancesinthefieldandcovers