ebook img

Speech and Computer : 19th International Conference, SPECOM 2017, Hatfield, UK, September 12-16, 2017, Proceedings PDF

845 Pages·2017·66.373 MB·English
Save to my drive
Quick download
Download
Most books are stored in the elastic cloud where traffic is expensive. For this reason, we have a limit on daily download.

Preview Speech and Computer : 19th International Conference, SPECOM 2017, Hatfield, UK, September 12-16, 2017, Proceedings

Alexey Karpov Rodmonga Potapova Iosif Mporas (Eds.) 8 5 4 0 1 I A Speech and Computer N L 19th International Conference, SPECOM 2017 Hatfield, UK, September 12–16, 2017 Proceedings 123 fi Lecture Notes in Arti cial Intelligence 10458 Subseries of Lecture Notes in Computer Science LNAI Series Editors Randy Goebel University of Alberta, Edmonton, Canada Yuzuru Tanaka Hokkaido University, Sapporo, Japan Wolfgang Wahlster DFKI and Saarland University, Saarbrücken, Germany LNAI Founding Series Editor Joerg Siekmann DFKI and Saarland University, Saarbrücken, Germany More information about this series at http://www.springer.com/series/1244 Alexey Karpov Rodmonga Potapova (cid:129) Iosif Mporas (Eds.) Speech and Computer 19th International Conference, SPECOM 2017 fi – Hat eld, UK, September 12 16, 2017 Proceedings 123 Editors Alexey Karpov Iosif Mporas SPIIRAS University of Hertfordshire Saint Petersburg Hatfield Russia UK Rodmonga Potapova Moscow State Linguistic University Moscow Russia ISSN 0302-9743 ISSN 1611-3349 (electronic) Lecture Notesin Artificial Intelligence ISBN 978-3-319-66428-6 ISBN978-3-319-66429-3 (eBook) DOI 10.1007/978-3-319-66429-3 LibraryofCongressControlNumber:2017949519 LNCSSublibrary:SL7–ArtificialIntelligence ©SpringerInternationalPublishingAG2017 Thisworkissubjecttocopyright.AllrightsarereservedbythePublisher,whetherthewholeorpartofthe material is concerned, specifically the rights of translation, reprinting, reuse of illustrations, recitation, broadcasting, reproduction on microfilms or in any other physical way, and transmission or information storageandretrieval,electronicadaptation,computersoftware,orbysimilarordissimilarmethodologynow knownorhereafterdeveloped. Theuseofgeneraldescriptivenames,registerednames,trademarks,servicemarks,etc.inthispublication doesnotimply,evenintheabsenceofaspecificstatement,thatsuchnamesareexemptfromtherelevant protectivelawsandregulationsandthereforefreeforgeneraluse. Thepublisher,theauthorsandtheeditorsaresafetoassumethattheadviceandinformationinthisbookare believedtobetrueandaccurateatthedateofpublication.Neitherthepublishernortheauthorsortheeditors give a warranty, express or implied, with respect to the material contained herein or for any errors or omissionsthatmayhavebeenmade.Thepublisherremainsneutralwithregardtojurisdictionalclaimsin publishedmapsandinstitutionalaffiliations. Printedonacid-freepaper ThisSpringerimprintispublishedbySpringerNature TheregisteredcompanyisSpringerInternationalPublishingAG Theregisteredcompanyaddressis:Gewerbestrasse11,6330Cham,Switzerland Preface The Speechand ComputerInternational Conference(SPECOM) hasbecomearegular eventsincethefirstSPECOM,whichwasheldinSt.Petersburg,RussianFederation,in 1996.Twentyoneyearsago,SPECOM wasestablished bytheSt.PetersburgInstitute for Informatics and Automation of the Russian Academy of Sciences and State Pedagogical University of Russia thanks to the efforts of Prof. Yuri Kosarev and Prof. Rajmund Piotrowski. SPECOMisaconferencewithalongtraditionthatattractsresearchersintheareaof computer speech processing (recognition, synthesis, understanding, etc.) and related domains (including signal processing, language and text processing, computational paralinguistics, multi-modal speech processing, and human–computer interaction). TheSPECOMInternationalConferenceisanidealplatformforknow-howexchange– especially for experts working on Slavic and other highly inflectional languages – including both under-resourced and regular well-resourced languages. In its long history, the SPECOM conference has been organized alternately by the St. Petersburg Institute for Informatics and Automation of the Russian Academy of Sciences (SPIIRAS) and by the Moscow State Linguistic University (MSLU) in their home cities. Furthermore, in 1997 it was organized by the Cluj-Napoca Subsidiary oftheResearchInstituteforComputerTechnique(Romania),in2005bytheUniversity of Patras (in Patras, Greece), in 2011 by the Kazan Federal University (Russian Fed- eration,RepublicofTatarstan),in2013bytheUniversityofWestBohemia(inPilsen, Czech Republic), and in 2014 by the University of Novi Sad (Serbia), in 2015 by the University of Patras (in Athens, Greece), and in 2016 by the Budapest University of Technology and Economics (in Budapest, Hungary). SPECOM2017wasthe19theventintheseriesandthistimeitwasorganizedbythe University of Hertfordshire, in cooperation with the St. Petersburg Institute for Infor- maticsandAutomationoftheRussianAcademyofSciences(SPIIRAS),MoscowState Linguistic University (MSLU), and St. Petersburg National Research University of Information Technologies, Mechanics and Optics (ITMO University). The conference washeldjointlywiththeSecondInternationalConferenceonInteractiveCollaborative Robotics (ICR) – where problems and modern solutions of human–robot interaction were discussed – during September 12–16, 2017 at the College Lane campus of the University ofHertfordshire,which islocatedinHatfield, UK20miles(30kilometres) north of London, just 20 minutes by train from London’s King’s Cross station. During the conference two invited talks were given by Prof. Mark J.F. Gales (Engineering Department, University of Cambridge, UK) and Prof. Björn W. Schuller (University of Passau, Germany and Imperial College London, UK) on the latest achievements in speech technology, automatic speech recognition, keyword spotting speakeranalysisandcomputationalparalinguistics.Theinvitedpapersarepublishedas a first part of the SPECOM 2017 proceedings. VI Preface This volume contains a collection of submitted papers presented at the conference, whichwerethoroughlyreviewedbymembersoftheProgramCommitteeconsistingof above 100 top specialists in the conference topic areas. A total of 80 accepted papers outof150submitted forSPECOMandICRwereselectedbytheProgram Committee forpresentationattheconferenceandforinclusioninthisbook.Theoreticalandmore general contributions were presented in common (plenary) sessions. Problem-oriented sessions as well as panel discussions brought together specialists in limited problem areaswiththeaimofexchangingknowledgeandskillsresultingfromresearchprojects ofallkinds.Thisyear,excepttheregulartechnicalsessions,threespecialsessionswere organizedon(i)NaturalLanguageProcessingforSocialMediaAnalysis,(ii)Multilingual andLow-ResourcedLanguagesSpeechProcessinginHuman-ComputerInteraction,and (iii) Real-Life Challenges in Voice and Multimodal Biometrics. We would like to express our gratitude to the authors for providing their papers on time, to the members of the conference Program Committee and the organizers of the special sessions for their careful reviews and paper selection, and to the editors and correctors for their hard work in preparing this volume. Special thanks are due to the members of the Organizing Committee for their tireless effort and enthusiasm during the conference organization. September 2017 Alexey Karpov Rodmonga Potapova Iosif Mporas Organization The conference SPECOM 2017 was organized by the University of Hertfordshire, in cooperation with the St. Petersburg Institute for Informatics and Automation of the Russian Academy of Sciences (SPIIRAS), Moscow State Linguistic University (MSLU), and St. Petersburg National Research University of Informa- tion Technologies, Mechanics and Optics (ITMO University). SPECOM 2017 was sponsored by ASM Solutions Ltd. (Moscow, Russia). The conference website is: http://specom.nw.ru/sites/2017. General Co-chairs Iosif Mporas University of Hertfordshire, UK Rodmonga Potapova MSLU, Russia Andrey Ronzhin SPIIRAS, Russia Program Committee Chair Alexey Karpov SPIIRAS, Russia Program Committee Shyam Agrawal, India Vasiliki Foufi, Switzerland Tanel Alumae, Estonia Peter French, UK Ebru Arisoy, Turkey Mark Gales, UK Elias Azarov, Belarus Philip Garner, Switzerland Gerard Bailly, France Theodoros Giannakopoulos, Greece Andrey Barabanov, Russia Gabor Gosztolya, Hungary Anton Batliner, Germany Abualsoud Hanani, Palestine Marie-Luce Bourguet, UK Charl van Heerden, South Africa Nick Campbell, Ireland Ivan Himawan, Australia Eric Castelli, Vietnam Ruediger Hoffmann, Germany Vladimiir Chuchupal, Russia Marek Hruz, Czech Republic Dirk van Compernolle, Belgium Alexei Ivanov, USA Marelie Davel, South Africa Kristiina Jokinen, Finland Vlado Delic, Serbia Oliver Jokisch, Germany Olivier Deroo, Belgium Alexey Karpov, Russia Keelan Evanini, USA Heysem Kaya, Turkey Nicholas Evans, France Andreas Kerren, Sweden Vera Evdokimova, Russia Tomi Kinnunen, Finland Nikos Fakotakis, Greece Irina Kipyatkova, Russia Mauro Falcone, Italy Kate Knill, UK VIII Organization Daniil Kocharov, Russia Blaise Potard, UK Liliya Komalova, Russia Fabio Rinaldi, Switzerland Theodoros Kostoulas, Switzerland Andrey Ronzhin, Russia Constantine Kotropoulos, Greece Paolo Rosso, Spain Georgios Kouroupetroglou, Greece Milan Rusko, Slovakia Alexandros Lazaridis, Switzerland Saeid Safavi, UK Benjamin Lecouteux, France Sakriani Sakti, Japan Boris Lobanov, Belarus Albert Ali Salah, Turkey Elena Lyakso, Russia Murat Saraclar, Turkey Fragkiskos Malliaros, USA Björn Schuller, Germany Konstantin Markov, Japan James Scobbie, UK Yuri Matveev, Russia Vasiliki Simaki, Sweden Lily Meng, UK Pavel Skrelin, Russia Roman Meshcheryakov, Russia Victor Sorokin, Russia Peter Mihajlik, Hungary Efstathios Stamatatos, Greece Wolfgang Minker, Germany Stefan Steidl, Germany Bernd Möbius, Germany Mikhail Stolbov, Russia Konstantinos Moustakas, Greece Sebastian Stüker, Germany Iosif Mporas, UK Yannis Stylianou, Greece Hema Murthy, India György Szaszák, Hungary Maryam Najafian, USA Zheng-Hua Tan, Denmark Satoshi Nakamura, Japan Laszlo Toth, Hungary Marina Nastasenko, Russia Isabel Trancoso, Portugal Géza Németh, Hungary Khiet Truong, The Netherlands Thomas Niesler, South Africa Stavros Tsakalidis, USA Stavros Ntalampiras, Italy Vasilisa Verkhodanova, Russia Carita Paradis, Sweden Klara Vicsi, Hungary Hemant Patil, India Wenwu Wang, UK Alexander Petrovsky, Belarus Christian Wellekens, France Alexey Petrovsky, Russia Andreas Wendemuth, Germany Branislav Popović, Serbia Hossein Zeinali, Iran Vsevolod Potapov, Russia Miloš Železný, Czech Republic Rodmonga Potapova, Russia Organizing Committee Iosif Mporas (Chair) Ekaterina Miroshnikova Alexey Karpov Rodmonga Potapova Irina Kipyatkova Andrey Ronzhin Dana Kovach Dmitry Ryumin Yuri Matveev Anton Saveliev Contents Invited Talks Low-Resource Speech Recognition and Keyword-Spotting. . . . . . . . . . . . . . 3 Mark J.F. Gales, Kate M. Knill, and Anton Ragni Big Data, Deep Learning – At the Edge of X-Ray Speaker Analysis. . . . . . . 20 Björn W. Schuller Conference Papers A Comparison of Covariance Matrix and i-vector Based Speaker Recognition . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 37 Nikša Jakovljević, Ivan Jokić, Slobodan Jošić, and Vlado Delić A Trainable Method for the Phonetic Similarity Search in German Proper Names. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 46 Oliver Jokisch and Horst-Udo Hain Acoustic and Perceptual Correlates of Vowel Articulation in Parkinson’s Disease With and Without Mild Cognitive Impairment: A Pilot Study. . . . . . 56 Michaela Strinzel, Vasilisa Verkhodanova, Fedor Jalvingh, Roel Jonkers, and Matt Coler Acoustic Cues for the Perceptual Assessment of Surround Sound . . . . . . . . . 65 Ingo Siegert, Oliver Jokisch, Alicia Flores Lotz, Franziska Trojahn, Martin Meszaros, and Michael Maruschke Acoustic Modeling in the STC Keyword Search System for OpenKWS 2016 Evaluation . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 76 Ivan Medennikov, Aleksei Romanenko, Alexey Prudnikov, Valentin Mendelev, Yuri Khokhlov, Maxim Korenevsky, Natalia Tomashenko, and Alexander Zatvornitskiy Adaptation Approaches for Pronunciation Scoring with Sparse Training Data. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 87 Federico Landini, Luciana Ferrer, and Horacio Franco AnAlgorithm forDetectionofBreathSoundsinSpontaneousSpeechwith Application to Speaker Recognition. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 98 Sri Harsha Dumpala and K.N.R.K. Raju Alluri

See more

The list of books you might like

Most books are stored in the elastic cloud where traffic is expensive. For this reason, we have a limit on daily download.