ebook img

Implementation and Application of Automata: 6th International Conference, CIAA 2001 Pretoria, South Africa, July 23–25, 2001 Revised Papers PDF

298 Pages·2002·5.334 MB·English
Save to my drive
Quick download
Download
Most books are stored in the elastic cloud where traffic is expensive. For this reason, we have a limit on daily download.

Preview Implementation and Application of Automata: 6th International Conference, CIAA 2001 Pretoria, South Africa, July 23–25, 2001 Revised Papers

Lecture Notes in Computer Science 2494 EditedbyG.Goos,J.Hartmanis,andJ.vanLeeuwen 3 Berlin Heidelberg NewYork Barcelona HongKong London Milan Paris Tokyo Bruce W. Watson Derick Wood (Eds.) Implementation and Application of Automata 6th International Conference, CIAA 2001 Pretoria, South Africa, July 23-25, 2001 Revised Papers 1 3 SeriesEditors GerhardGoos,KarlsruheUniversity,Germany JurisHartmanis,CornellUniversity,NY,USA JanvanLeeuwen,UtrechtUniversity,TheNetherlands VolumeEditors BruceW.Watson UniversityofPretoria,DepartmentofComputerScience LynwoodRoad,Pretoria0002,SouthAfrica E-mail:[email protected] DerickWood HongKongUniversityofScienceandTechnology DepartmentofComputerScience ClearwaterBay,Kowloon,HongKong E-mail:[email protected] Cataloging-in-PublicationDataappliedfor AcatalogrecordforthisbookisavailablefromtheLibraryofCongress BibliographicinformationpublishedbyDieDeutscheBibliothek DieDeutscheBibliothekliststhispublicationintheDeutscheNationalbibliographie; detailedbibliographicdataisavailableintheInternetat<http://dnb.ddb.de>. CRSubjectClassification(1998):F.1.1,F.4.3,F.3,F.2 ISSN0302-9743 ISBN3-540-00400-9Springer-VerlagBerlinHeidelbergNewYork Thisworkissubjecttocopyright.Allrightsarereserved,whetherthewholeorpartofthematerialis concerned,specificallytherightsoftranslation,reprinting,re-useofillustrations,recitation,broadcasting, reproductiononmicrofilmsorinanyotherway,andstorageindatabanks.Duplicationofthispublication orpartsthereofispermittedonlyundertheprovisionsoftheGermanCopyrightLawofSeptember9,1965, initscurrentversion,andpermissionforusemustalwaysbeobtainedfromSpringer-Verlag.Violationsare liableforprosecutionundertheGermanCopyrightLaw. Springer-VerlagBerlinHeidelbergNewYork amemberofBertelsmannSpringerScience+BusinessMediaGmbH http://www.springer.de ©Springer-VerlagBerlinHeidelberg2002 PrintedinGermany Typesetting:Camera-readybyauthor,dataconversionbyDA-TeXGerdBlumenstein Printedonacid-freepaper SPIN:10870693 06/3142 543210 Foreword The Sixth International Conference on Implementation and Application of Au- tomata (CIAA 2001) — the first one held in the southern hemisphere — was heldatthe UniversityofPretoriainPretoria,SouthAfrica,on23–25July2001. This volume of Springer’s Lecture Notes in Computer Science contains all the papers (including the invited talk by Gregor v. Bochmann) that were pre- sentedatCIAA2001,aswellasanexpandedversionofoneofthe posterpapers displayed during the conference. Theconferenceaddressedtheissuesinautomataapplicationandimplementa- tion.Thetopicsofthepaperspresentedinthisconferencerangedfromautomata applications in software engineering, natural language and speech recognition, andimage processing,to new representationsandalgorithmsforefficient imple- mentation of automata and related structures. Automata theory is one of the oldest areas in computer science. Research in automata theory has been motivated by its applications since its early stages of development. In the 1960sand 1970s,automata researchwas motivated heavily by problems arisingfromcompiler construction,circuitdesign, stringmatching, etc. In recent years, many new applications of automata have been found in variousareasofcomputerscienceaswellasinotherdisciplines.Examplesofthe new applications include statecharts in object-oriented modeling, finite trans- ducers in natural language processing, and nondeterministic finite-state models incommunicationprotocols.Manyofthenewapplicationscannotsimplyutilize the existing models and algorithms in automata theory to solve their problems. New models, or modifications of the existing models, are needed to satisfy their requirements. Also, the sizes of the typical problems in many of the new appli- cationsareastronomicallylargerthanthoseusedinthetraditionalapplications. New algorithmsandnewrepresentationsofautomataarerequiredtoreducethe time and space requirements of the computation. The CIAA conference series provides a forum for the new problems and challenges.Intheseconferences,boththeoreticalandpracticalresultsrelatedto the application and implementation of automata were presented and discussed, and software packages and toolkits were demonstrated. The participants of the conference series were from both research institutions and industry. Wethankalloftheprogramcommitteemembersandrefereesfortheirefforts in refereeing andselecting papers.This volume was edited with muchhelp from NanetteSaesandHannekeDriever,whiletheconferenceitselfwasrunsmoothly with the help of Elmarie Willemse, Nanette Saes, and Theo Koopman. VI Foreword We also wishto thank the South AfricanNRF (for funding airfares)and the Department of Computer Science, University of Pretoria,for their financialand logisticsupportoftheconference.WealsothanktheeditorsoftheLectureNotes inComputer Science seriesandSpringer-Verlag,inparticularAnna Kramer,for their help in publishing this volume. October 2002 Bruce W. Watson Derick Wood CIAA 2001 Program Committee Bernard Boigelot Universit´e de Liege, Belgium Jean-Marc Champarnaud Universit´e de Rouen, France Maxime Crochemore University of Marne-la-Vall´ee,France Oscar Ibarra University of California at Santa Barbara,USA Lauri Karttunen Xerox Palo Alto Research Center, USA Nils Klarlund AT&T Laboratories,USA Denis Maurel Universit´e de Tours, France Mehryar Mohri AT&T Laboratories,USA Jean-Eric Pin Universit´e Paris 7, France Kai Salomaa Queen’s University, Canada Helmut Seidl Trier University, Germany Bruce Watson (Chair) University of Pretoria,South Africa Eindhoven University, The Netherlands Derick Wood (Co-chair) Hong Kong University of Science and Technology, China Sheng Yu University of Western Ontario, Canada Table of Contents Using Finite State Technology in Natural Language Processing of Basque ...1 In˜aki Alegria, Maxux Aranzabe, Nerea Ezeiza, Aitzol Ezeiza, and Ruben Urizar Cascade Decompositions are Bit-Vector Algorithms ........................13 Anne Bergeron and Sylvie Hamel Submodule Construction and Supervisory Control: A Generalization .......27 Gregor v. Bochmann Counting the Solutions of Presburger Equations without Enumerating Them ...............................................40 Bernard Boigelot and Louis Latour Brzozowski’s Derivatives Extended to Multiplicities ........................52 Jean-Marc Champarnaud and G´erard Duchamp Finite Automata for Compact Representation of Language Models in NLP ...............................................65 Jan Daciuk and Gertjan van Noord Past Pushdown Timed Automata ..........................................74 Zhe Dang, Tevfik Bultan, Oscar H. Ibarra, and Richard A. Kemmerer Scheduling Hard Sporadic Tasks by Means of Finite Automata and Generating Functions .............................87 Jean-Philippe Dubernard and Dominique Geniet Bounded-Graph Construction for Noncanonical Discriminating-Reverse Parsers ..........................101 Jacques Farr´e and Jos´e Fortes Ga´lvez Finite-State Transducer Cascade to Extract Proper Names in Texts .......115 Nathalie Friburger and Denis Maurel Is this Finite-State Transducer Sequentiable? .............................125 Tama´s Gaa´l Compilation Methods of Minimal Acyclic Finite-State Automata for Large Dictionaries .....................................................135 Jorge Gran˜a, Fco. Mario Barcala, and Miguel A. Alonso Bit Parallelism– NFA Simulation .........................................149 Jan Holub Improving Raster Image Run-Length Encoding Using Data Order .........161 Markus Holzer and Martin Kutrib X Table of Contents Enhancements of Partitioning Techniques for Image Compression Using Weighted Finite Automata .................177 Frank Katritzke, Wolfgang Merzenich, and Michael Thomas Extraction of (cid:1)-Cycles from Finite-State Transducers ......................190 Andr´e Kempe On the Size of Deterministic Finite Automata ............................202 Boˇrivoj Melichar and Jan Skryja Crystal Lattice Automata ................................................214 Jim Morey, Kamran Sedig, Robert E. Mercer, and Wayne Wilson Minimal Adaptive Pattern-Matching Automata for Efficient Term Rewriting ..............................................221 Nadia Nedjah and Luiza de Macedo Mourelle Adaptive Rule-Driven Devices - General Formulation and Case Study .....234 Joa˜o Jos´e Neto TypographicalNearest-Neighbor Search in a Finite-State Lexicon and Its Application to Spelling Correction ................................251 Agata Savary On the Software Design of Cellular Automata Simulators for EcologicalModeling ...................................................261 Yuri Velinov Random Number Generation with ⊕-NFAs ...............................263 Lynette van Zijl Supernondeterministic Finite Automata ..................................274 Lynette van Zijl Author Index ...........................................................289 Using Finite State Technology in Natural Language Processing of Basque Iæaki Alegria, Maxux Aranzabe, Nerea Ezeiza, Aitzol Ezeiza, and Ruben Urizar Ixa taldea, University of the Basque Country, Spain [email protected] Abstract. This paper describes the components used in the design and implementation of NLP tools for Basque. These components are based on finite state technology and are devoted to the morphological analysis of Basque, an agglutinative pre-Indo-European language. We think that our design can be interesting for the treatment of other languages. The main components developed are a general and robust morphological analyser/generator and a spelling checker/corrector for Basque named Xuxen. The analyser is a basic tool for current and future work on NLP of Basque, such as the lemmatiser/tagger Euslem, an Intranet search engine or an assistant for verse-making. 1 Introduction This paper describes the components used in the design and implementation of NLP tools for Basque. These components are based on finite state technology and are devoted to the morphological analysis of Basque, an agglutinative pre-Indo-European language. We think that our design can be interesting for the treatment of other languages. The main components developed are a general and robust morphological analyser/generator (Alegria et al. 1996) and a spelling checker/corrector for Basque named Xuxen (Aldezabal et al. 1999). The analyser is a basic tool for current and future work on NLP of Basque, for example the lemmatiser/tagger Euslem (Ezeiza et al. 1998), an Intranet search engine (Aizpurua et al. 2000) or an assistant for verse- making (Arrieta et al. 2000) These tools are implemented using lexical transducers. A lexical transducer (Karttunen 1994) is a finite-state automaton that maps inflected surface forms to lexical forms, and can be seen as an evolution of two-level morphology (Koskenniemi 1983; Sproat 1992) where the use of diacritics and homographs can be avoided and the intersection and composition of transducers is possible. In addition, the process is very fast and the transducer for the whole morphological description can be compacted in less than one Mbyte. The tool used for the implementation is the fst library of Inxight1 (Karttunen and Bessley 1992; Karttunen 1993; Karttunen et al. 1996). Similar compilers have been developed by other groups (Mohri 1997; Daciuk et al. 1998). 1 Inxight Software, Inc., a Xerox Enterprise Company (www.inxight.com) B.W. Watson and D. Wood (Eds.): CIAA 2001, LNCS 2494, pp. 1-12, 2002.  Springer-Verlag Berlin Heidelberg 2002

See more

The list of books you might like

Most books are stored in the elastic cloud where traffic is expensive. For this reason, we have a limit on daily download.