ebook img

Recent Advances in Speech Understanding and Dialog Systems PDF

502 Pages·1988·20.993 MB·English
Save to my drive
Quick download
Download
Most books are stored in the elastic cloud where traffic is expensive. For this reason, we have a limit on daily download.

Preview Recent Advances in Speech Understanding and Dialog Systems

Recent Advances in Speech Understanding and Dialog Systems NATO ASI Series Advanced Science Institutes Series A series presenting the results of activities sponsored by the NA TO Science Committee, which aims at the dissemination of advanced scientific and technological knowledge, with a view to strengthening links between scientific communities. The Series is published by an international board of publishers in conjunction with the NATO Scientific Affairs Division A Life Sciences Plenum Publishing Corporation B Physics London and New York C Mathematical and Kluwer Academic Publishers Physical Sciences Dordrecht, Boston and London D Behavioural and Social Sciences E Applied Sciences F Computer and Springer-Verlag Systems Sciences Berlin Heidelberg New York G Ecological Sciences London Paris Tokyo H Cell Biology Series F: Computer and Systems Sciences Vol. 46 Recent Advances in Speech Understanding and Dialog Systems Edited by H. Niemann Universitat Erlangen-NOrnberg Martensstr. 3 0-8520 Erlangen, FRG M. Lang Siemens AG, ZT ZTI SYS 5 Otto-Hahn-Ring 6 0-8000 MOnchen 83, FRG G. Sagerer Universitat Erlangen-NOrnberg Martensstr. 3 0-8520 Erlangen, FRG Springer-Verlag Berlin Heidelberg New York London Paris Tokyo Published in cooperation with NATO Scientific Affairs Division Proceedings of the NATO Advanced Study Institute on Recent Advances in Speech Understanding and Dialog Systems, held in Bad Windsheim, FRG, July 5-18, 1987. ISBN-13: 978-3-642-83478-3 e-ISBN-13: 978-3-642-83476-9 001: 10.1007/978-3-642-83476-9 This work is subject to copyright. All rights are reserved, whether the whole or part of the material is concemed, specifically the rights of translation, reprinting, re-use of illustrations, recitation, broadcasting, reproduction on microfilms or in other ways, and storage in data banks. Duplication of this publication or parts thereof is only permitted under the provisions of the German Copyright Law of September 9, 1965, in its version of June 24, 1985, and a copyright fee must always be paid. Violations fall under the prosecution act of the German Copyright Law. © Springer-Verlag Berlin Heidelberg 1988 Softcover reprint of the hardcover 1st edition 1988 2145/3140-543210 PREFACE This volume contains invited and contributed papers presented at the NATO Advanced study Instit ute on "Recent Advances in Speech Understanding and Dialog systems" held in Bad Windsheim, Federal Republic of Germany, July 5 to July 18, 1987. It is divided into the three parts Speech coding and Segmentation, Word Recognition, and Linguistic Processing. Although this can only be a rough organization showing some overlap, the editors felt that it most naturally represents the bottom-up strategy of speech understanding and, therefore, should be useful for the reader. Part 1, SPEECH CODING AND SEGMENTATION, contains 4 invited and 14 contributed papers. The first invited paper summarizes basic properties of speech signals, reviews coding schemes, and describes a particular solution which guarantees high speech quality at low data rates. The second and third invited papers are concerned with acoustic-phonetic decoding. Techniques to integrate knowledge sources into speech recognition systems are presented and demonstrated by experimental systems. The fourth invited paper gives an overview of approaches for using prosodic knowledge in automatic speech recogni tion systems, and a method for assigning a stress score to every syllable in an utterance of German speech is reported in a contributed paper. A set of contributed papers treats the problem of automatic segmentation, and several authors successfully apply knowledge-based methods for interpreting speech signals and spectrograms. The last three papers investigate phonetic models, Markov models and fuzzy quantization techniques and provide a transit ion to Part 2 . Part 2, WORD RECOGNITION, contains 3 invited and 16 contributed papers. This part starts with an invited paper presenting a full introduction to and overview of all aspects of hidden Markov models and their application to isolated word recognition and continuous speech understanding. The second invited paper gives an approach to the unification of cognitive and information-theoretic approaches to speech recognition and its application to letter and digit recognition. The third invited paper is devoted to the problem of isolated word recognition in very large vocabularies. The first set of contributed papers concerns the problem of isolated word VI recognition, and the second set concerns word recognition in continuous speech. The last set of papers treats the problem of adaptation or learning in order to achieve speaker independence. One paper suggests an approach to the evaluation of speech recognizer performance. Part 3, LINGUISTIC PROCESSING, contains 3 invited and 7 contributed papers. This section makes explicit that linguistic processing has become an integral part of speech understanding systems. It covers both the continuous speech recognition task with the help of linguistic models and the speech understanding task based on explicitly represented linguistic knowledge. Syntactic, semantic, and pragmatic aspects in the design of on-line language understanding systems are addressed by the first invited paper. The second one evaluates the state of the art in semantic representation and linguistic analysis techniques as far as they are used in speech understanding systems. system architectures for knowledge-based speech understanding and the problem of knowledge representation for such systems are discussed in the third paper. Both papers also concern the integration and cooperation of linguistic and acoustic analysis for speech understanding. The problems of continuous speech recognition are treated in the first set of contributed papers. A second part deals with the understanding task, and a third one suggests models for oral dialog systems. The editors would like to thank the authors for their effort to provide papers of high scientific quality, the participants of the NATO ASI for their active support of a stimulating meeting, the publisher for preparing this volume, and last but not least the N4TO Office of Scientific Affairs in Brussels, Siemens AG Miinchen, and the University of Erlangen-Niirnberg for support of the ASI. H. Niemann, M. Lang, andG. Sagerer Erlangen and Miinchen Spring 1988 TABLE OF CONTENTS PART 1: Speech Coding and Segmentation INVITED PAPERS Recent Advances in Speech Coding 1 D. ~lf, H. Reininger, Universitat Frankfurt, F. R. Germany Acoustic-Phonetic Decoding of Speech 25 R.M. Schwartz, Y. Chow, M. Dunham, O. Kimball, M. Krasner, F. Rilbala, J. Makhoul, P. Price, S. Roucos Bolt Beranek and Newman Laboratories cambridge MA, USA Knowledge-Based Approaches in Acoustic-Phonetic Decoding of Speech 51 J. P. Haton, INRIA/CRIN Nancy, France The Use of Prosodic Parameters in Automatic Speech Recognition 71 J. Vaissiere, mET Lannion, France CONTRIBUTED PAPERS Prosodic Features in German Speech: Stress Assignment by Man and Machine 101 E. Neth, H. Niemann, S. schmelz universitat Erlangen, F .R. Germany Recognition of Speech Using '!emporal Decomposition 107 G. Chollet, G. Ahlbom, F. Bimbot, mRS Paris, France Long 'Darm Analysis-Synthesis of Speech by Non- Stationary AR Methods 111 S. Ragazzini, FondazioneUgo Bordoni Roma, Italy Using Contextual Information in View of Formant Speech Analysis Improvement 117 O. Al-Dakkak, G. Murillo, G. Bailly, B. Guerin Laboratoire de la connnunication Parlee Grenoble, France A Speech Recognition Strategy Based on Making Acoustic Evidence and Phonetic Knowledge Explicit 123 P.D. Green, M.P. Cooke, H.H. Lafferty, A.J.H. Simons. University of Sheffield, England On Finding Objects in Spectrograms: A Multiscale Relaxation Labelling Approach 129 M.P. Cooke, P.D. Green, University of Sheffield, England Phonetic Segmentation using psychoacoustic Speech Parameters 135 J .-P. Martens Electronics Laboratory, University of Gent, Belgium VIII Morphological Representation of Speech Knowledge for Automatic Speech Recognition Systems 141 M.J. Palakal, Concordia University Montreal, Canada Speaker-Independent Automatic Recognition of Plosive Sound in Letters and Digits 147 R. Cardin, McGill University Montreal, Canada A Real-Time Auditory Model: Description and Applications in an ASR Acoustic-Phonetic Front End 153 F. Gooding, I. Shaw, H. Mahdi University College of North Wales Bangor, UK A New Phonematic Approach to Speech Recognition 159 A.J. Rubio-Ayuso, J .M. Herrera-Garrido Univ ersidad de Granada, Spain Primary Perceptual units in Word Recognition 165 W. F. Sendlmeier Max-Planck-Inst. fur Psycholinguistik Nijmegen, Netherlands Context-Dependent Phone Markov Models for Speech Recognition 171 A. -M. Derouault , IBM France Scientific Center Paris, France Speech Recognition Based on speech Units 177 G. Zanellato, Facult e Polyt echnique de Mons, Belgium PART 2: Word Recognition INVITED PAPERS Mathematical Foundations of Hidden Markov Models 183 L.R. Rabiner, AT&T Bell Laboratories Murray Hill, USA Computer Recognition of Spoken Letters and Digits 207 R. de Mori, McGill University Montreal, Canada Recognition of Words in Very Large Vocabulary 235 P. Laface, Politecnico di '!brino, Italy G. Micca, R. Pieraccini, CSELT '!brino, Italy CONTRIBUTED PAPERS Isolated Word Recognition Using Hidden Markov Models 255 S. Euler, D. Wolf, Universitat Frankfurt, F.R. Germany Isolated Digit Recognition Using the Multi-Layer Perceptron 261 S.M. Peeling, R.K. Moore, A.P. Varga Royal Signals and Radar Establishment Malvern, UK Use of Procedural Knowledge for Spoken Letters and Digits Recognition 267 E. Merlo, McGill University Montreal, canada IX Real-Time Large Vocabulary Word Recognition via Diphone Spotting and Multiprocessor Implementation 273 C. Scagliola, A. Carossino, A.M. Colla, C. Favareto, P. Pedrazzi, D. Sciarra, C. Vicenzi Elsag S.p.A. Genova, Italy Speech Recognition With Difficult Dictionaries 279 F. casacuberta, E. Vidal Universidad Politecnica de valencia, Spain Recent Results on the Application of a Metric-Space Search Algorithm (AESA) to Multispeaker Data 285 E. Vidal, M.J. Lloret Universidad Politecnica de valencia, Spain Robust Features for Word Recognition 291 E.G. Schukat-Talamazzini, Universitat Erlangen, F.R. Germany statistical Analysis of Left-to-Right Parser for Word-Hypothesing 297 H. Hoge, E. Marschall, Siemens AGMiinchen, F.R. Germany overview of Speech Recognition in the 'SPICOS' System 305 H. Ney, D. Mergel, A. Noll, A. Paeseler Philips Forschungslaboratorium Hamburg, F .R. Germany An Experimental Environment for Generating Word Hypotheses in continuous Speech 311 S. Rilnzmann, T. Rilhn, H. Niemann Universitat Erlangen, F .R. Germany Application of the Error Correcting Grammatical Inference Method (ECGI)to Multi-Speaker Isolated Word Recognition 317 E. Vidal, N. Prieto, E. Sanchis, H. Rulot universidad Politecnica de valencia, Spain Multi-Speaker Experiments with the Morphic Generator Grammatical Inference Methodology 323 E. Vidal, E. Segarra, P. Garcia, I. Galiano Universidad Politecnica de Valencia, Spain A New Approach to '!emplate Selection for Speaker Independent Word Recognition 329 N. Yalabik, F. Yarman-Vural, A. Mansur Middle East 'lechnical University Ankara, '1\.1rkey Dynamic Spectral Adaptation of Automatic Speech Recognizers to New Speakers 335 G. Chollet, K. Choukri, Qi/RS Paris, France 'lbwards Speaker-Independent continuous Speech Recognition 339 K.F. Lee, Carnegie-Mellon University pittsburgh, USA Evaluating Speech Recognizers and Data Bases 345 G. Chollet, C. Montacie, CNRS Paris, France x PART 3: Linguistic Processing INVITED PAPERS On-Line Interpretation in Speech Understanding and Dialogue Systems 349 H. Bunt, Tilburg University, The Netherlands Semantic Processing in Speech Understanding 397 G. Thurmair, Siemens At:. MUnchen, F .R. Germany Knowledge Based Systems for Speech Understanding 421 G. Sagerer, F. KUnunert, Universitat Erlangen, F.R. Germany CONTRIBUTED PAPERS Recognition of Speaker-Dependent continuous Speech with KEAL-NEVEZH 459 G. Mercier, A. Cozannet, J. vaissiere, (NET Lannion, France Modification of Earley's Algorithm for Speech Recognition 465 A. Paeseler Philips Forschungslaboratorium Hamburg, F .R. Germany Expectation-Based Speech Recognition 473 J. Mudler, E. Paulus 'l.echnische Universitat Braunschweig, F .R. Germany Merging Acoustics and Linguistics in speech Understanding 479 G.T. Niedermair, Siemens At:. MUnchen, F.R. Germany Using Semantic and Pragmatic Knowledge for the Interpretation of Syntactic constituents 485 U. Ehrlich, H. Niemann, Universitat Erlangen, F .R. Germany Task-Oriented Dialogue Processing in Human-Computer Voice communication 491 N. Carbonell, J .M. Pierrel CRIN/INRIA-Lorraine, Vandoeuvre cedex, France Experimentation in the Specification of an Oral Dialogue 497 M. Guyomard, J. Siroux, (NET Lannion, France List of Participants 503 Author Index 507 Subject Index 508

See more

The list of books you might like

Most books are stored in the elastic cloud where traffic is expensive. For this reason, we have a limit on daily download.