ebook img

Document analysis systems : theory and practice : Third IAPR Workshop, DAS'98, Nagano, Japan, November 1998 : selected papers PDF

386 Pages·1999·11.2 MB·English
Save to my drive
Quick download
Download
Most books are stored in the elastic cloud where traffic is expensive. For this reason, we have a limit on daily download.

Preview Document analysis systems : theory and practice : Third IAPR Workshop, DAS'98, Nagano, Japan, November 1998 : selected papers

Lecture Notes in Computer Science 1655 Editedby G.Goos,J. Hartmanisand J.van Leeuwen 3 Berlin Heidelberg NewYork Barcelona HongKong London Milan Paris Singapore Tokyo Seong-Whan Lee Yasuaki Nakano (Eds.) Document Analysis Systems: Theory and Practice Third IAPR Workshop, DAS’98 Nagano, Japan, November 4-6, 1998 Selected Papers 1 3 SeriesEditors GerhardGoos,KarlsruheUniversity,Germany JurisHartmanis,CornellUniversity,NY,USA JanvanLeeuwen,UtrechtUniversity,TheNetherlands VolumeEditors Seong-WhanLee KoreaUniversity,CenterforArtificialVisionResearch Anam-dong,Seongbuk-ku,136-701Seoul,Korea E-mail:[email protected] YasuakiNakano ShinshuUniversity,DepartmentofInformationEngineering 500Wakasato,380-8553Nagano,Japan E-mail:[email protected] Cataloging-in-Publicationdataappliedfor DieDeutscheBibliothek-CIP-Einheitsaufnahme Documentanalysissystems:theoryandpractice;thirdIAPRworkshop;selected papers/DAS’98,Nagano,Japan,November4-6,1998.Seong-WhanLee;Yasuaki Nakano(ed.).-Berlin;Heidelberg;NewYork;Barcelona;HongKong;London ;Milan;Paris;Singapore;Tokyo:Springer,1999 (Lecturenotesincomputerscience;Vol.1655) ISBN3-540-66507-2 CRSubjectClassification(1998):I.7.5,I.5,I.4,I.7 ISSN0302-9743 ISBN3-540-66507-2Springer-VerlagBerlinHeidelbergNewYork Thisworkissubjecttocopyright.Allrightsarereserved,whetherthewholeorpartofthematerialis concerned,specificallytherightsoftranslation,reprinting,re-useofillustrations,recitation,broadcasting, reproductiononmicrofilmsorinanyotherway,andstorageindatabanks.Duplicationofthispublication orpartsthereofispermittedonlyundertheprovisionsoftheGermanCopyrightLawofSeptember9,1965, initscurrentversion,andpermissionforusemustalwaysbeobtainedfromSpringer-Verlag.Violationsare liableforprosecutionundertheGermanCopyrightLaw. (cid:1)c Springer-VerlagBerlinHeidelberg1999 PrintedinGermany Typesetting:Camera-readybyauthor SPIN10704062 06/3142–543210 Printedonacid-freepaper Preface Recently, there has been an increased interest in the research and development of techniques for components of complete document analysis systems. In recognition of this trend, a series of workshops on Document Analysis Systems commenced in 1994, under the leadership of Henry Baird. The first workshop, held in Kaiserslautern, Germany, in October, 1994, was chaired by Andreas Dengel and Larry Spitz. The second workshop on Document Analysis Systems was held in Malvern, PA, USA, in October, 1996, chaired by Jonathan J. Hull and Suzanne Liebowitz Taylor. The DAS workshop has been one of the most prestigious technical meetings, bringing together a large number of scientists and engineers from all over the world to express their innovative ideas and report on their latest achievements in the area of document analysis systems. The papers in this special book edition were rigorously selected from the Third IAPR Workshop on Document Analysis Systems (DAS’98), held in Nagano, Japan, on 4 - 6 November 1998. It is worth mentioning that the papers were chosen for their original and substantial contributions to the workshop theme and this special book edition. From among the 53 papers that were presented by authors from 11 countries at the DAS’98 after critical reviews by at least three experts, we carefully selected 29 papers for this special book edition. Most of the contributions in this edition have been expanded or extensively revised to include helpful discussions, suggestions, or comments made during the workshop. The papers deal with a wide range of research on document analysis systems, such as design principles, theoretical analysis, implementation techniques, and experimental results. In keeping with the main topics of document analysis systems research, the papers contributed to this special book edition are organized into five sections: VI Preface Part I: Document Image Compression and Retrieval Part II : Document Structure Analysis Part III: Handwriting Recognition Part IV: Document Image Analysis Part V: Document Analysis System This book is primarily addressed to researchers involved in document analysis systems who wish to increase their knowledge and expertise regarding these topics. It furthermore aims at the audience of those scientists who work in the various disciplines represented in the sections of the book. We would like to express our sincere appreciation to all the contributors and reviewers; this special edition would have not been publishable without them. The reviewers contributed generous amounts of time in the review process. They include H.S. Baird (Xerox Palo Alto Research Center, USA), A. Belaïd (UMR LORIA, France), H. Bunke (University of Bern, Switzerland), Y. Choi (Sookmyung Women's University, Korea), A. Dengel (German Research Center for AI, Germany), D. Doermann (University of Maryland, USA), T. Ejima (Kyushu Institute of Technology, Japan), H. Fujisawa (Central Research Labaratory, Hitachi, Ltd., Japan), J.J. Hull (Ricoh California Research Center, USA), Y. Ishitani (Toshiba Corporation, Japan), J. Kanai (Panasonic Information and Networking Technologies Laboratory, USA), G. Kim (Sogang University, Korea), J. Kim (Yonsei University, Korea), S.H. Kim (Chonnam National University, Korea), F. Kimura (Mie University, Japan), S.-W. Lee (Korea University, Korea), Y. Lee (Yonsei University, Korea), G. Maderlechner (Siemens AG, Germany), Y. Nakano (Shinshu University, Japan), H. Nishida (Ricoh, Japan), A.L. Spitz (Document Recognition Technologies, USA), C.Y. Suen (Concordia University, Canada), M. Suzuki (Kyushu University, Japan), Y.Y. Tang (Hong Kong Preface VII Baptist University, Hong Kong), and K. Yamamoto (Gifu University, Japan). Finally, we would also like to acknowledge all authors and participants of the workshop who facilitated the communication between investigators and created an insightful and stimulating environment. Our appreciation also goes to A. Hofmann, Editor at Springer-Verlag for his patience and help in guiding us through this task. We hope that this book will promote further research in document analysis systems. August 1999 Seong-Whan Lee and Yasuaki Nakano Table of Contents Part I: Document Image Compression and Retrieval Measuring the Robustness of Character Shape Coding . . . . . . . . . . . . . . . . . . . . . . . 1 A. L. Spitz, P. Marks Group 4 Compressed Document Matching . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .13 D.-S. Lee, J. J. Hull Restoration of Decorative Headline Images for Document Retrieval . . . . . . . . . . . 22 T. Amano Document Image Analysis Using a New Compression Algorithm . . . . . . . . . . . . . 32 S. Deng, S. Latifi, J. Kanai Part II: Document Structure Analysis A General Approach to Quality Evaluation of Document Segmentation Results . . 43 M. Thulke, V. Märgner, A. Dengel Form Analysis by Neural Classification of Cells . . . . . . . . . . . . . . . . . . . . . . . . . . .58 Y. Belaïd, A. Belaïd A Formal Approach to Textons and Its Application to Font Style Detection . . . . .72 A. Schreyer, P. Suda, G. Maderlechner A Statistical Method for an Automatic Detection of Form Types . . . . . . . . . . . . . . 84 S. Kebairi, B. Taconet, A. Zahour, S. Ramdane Structure Analysis of Low Resolution Fax Cover Pages . . . . . . . . . . . . . . . . . . . . 99 Y.-K. Lim, H.-J. Kang, C. Ahn, S.-W. Lee Part III: Handwriting Recognition Lexical Search Approach for Character-String Recognition . . . . . . . . . . . . . . . . . 115 M. Koga, R. Mine, H. Sako, H. Fujisawa A Segmentation Method for Touching Handwritten Japanese Characters . . . . . . .130 H. Nishimura, H. Ikeda, Y. Nakano X Table of Contents Cursive Handwritten Word Recognition by Integrating Multiple Classifiers . . . . 140 K. Maruyama, M. Kobayashi, H. Yamada, Y. Nakano Segmentation of Touching Characters in Formulas . . . . . . . . . . . . . . . . . . . . . . . . 151 M. Okamoto, S. Sakaguchi, T. Suzuki The AddressScriptTM Recognition System for Handwritten Envelopes . . . . . . . . .157 A. Filatov, V. Nikitin, A. Volgunin, P. Zelinsky Part IV: Document Image Analysis Sorting and Recognizing Cheques and Financial Documents . . . . . . . . . . . . . . . . 173 C. Y. Suen, K. Liu, N. W. Strathy A System for the Automated Reading of Check Amounts - Some Key Ideas . . . .188 G. Kaufmann, H. Bunke A Fast Japanese Word Extraction with Classification to Similarly-Shaped Character Categories and Morphological Analysis . . . . . . . .201 M. Ozaki, K. Itonori A Layout-Free Method for Extracting Elements from Document Images . . . . . . .215 T. Kochi, T. Saitoh Text-Line Extraction as Selection of Paths in the Neighbor Graph . . . . . . . . . . . .225 K. Kise, M. Iwata, A. Dengel, K. Matsumoto Table Structure Extraction from Form Documents Based on Gradient-Wavelet Scheme . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 240 D. Xi, S.-W. Lee The T-Recs Table Recognition and Analysis System . . . . . . . . . . . . . . . . . . . . . . .255 T. Kieninger, A. Dengel Part V: Document Analysis Systems Document Analysis Systems Development and Representation through the Object-Process Methodology . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 271 D. Dori Precise Table Recognition by Making Use of Reference Tables . . . . . . . . . . . . . . 283 C. Wenzel, W. Tersteegen

See more

The list of books you might like

Most books are stored in the elastic cloud where traffic is expensive. For this reason, we have a limit on daily download.