ebook img

From Data to Knowledge: Theoretical and Practical Aspects of Classification, Data Analysis, and Knowledge Organization PDF

471 Pages·1996·18.326 MB·English
Save to my drive
Quick download
Download
Most books are stored in the elastic cloud where traffic is expensive. For this reason, we have a limit on daily download.

Preview From Data to Knowledge: Theoretical and Practical Aspects of Classification, Data Analysis, and Knowledge Organization

Studies in Classification, Data Analysis, and Knowledge Organization Managing Editors Editorial Board H. H. Bock, Aachen W. H. E. Day, St. John's o. Opitz, Augsburg E. Diday, Paris M. Schader, Mannheim A. Ferligoj, Ljubljana W. Gaul, Karlsruhe J. C. Gower, Harpenden D. J. Hand, Milton Keynes P. Ihm, Marburg J. Meulmann, Leiden S. Nishisato, Toronto F. J. Radermacher, Ulm R. Wille, Darmstadt Springer Berlin Heidelberg New York Barcelona Budapest Hong Kong London Milan Paris Santa Clara Singapore Tokyo Titles in the Series H.-H. Bock and P. Ihm (Eds.) Classification, Data Analysis, and Knowledge Organization M. Schader (Ed.) Analyzing and Modeling Data and Knowledge o. Opitz, B. Lausen, and R. Klar (Eds.) Information and Classification H.-H. Bock, W. Lenski, and M. M. Richter (Eds.) Information Systems and Data Analysis E. Diday, Y. Lechevallier, M. Schader, P. Bertrand, and B. Burtschy (Eds.) New Approaches in Classification and Data Analysis Wolfgang Gaul · Dietmar Pfeifer (Editors) From Data to Knowledge Theoretical and Practical Aspects of Classification, Data Analysis, and Knowledge Organization With 123 Figures and 57 Tables , Springer Professor Dr. Wolfgang Gaul Universitat Karlsruhe (TH) Institut fUr Entscheidungstheorie und Unternehmensforschung Postfach 6980 76128 Karlsruhe, Germany Professor Dr. Dietmar Pfeifer Universitat Oldenburg FB6 (Mathematik) Ammerlander HeerstraBe 114-118 26129 Oldenburg, Germany Cataloging-in-Publication Data applied for Die Deutsche Bibliothek -CIP-Einheitsaufnahme From data to knowledse : theoretical and practical aspects of classification, data analysis, and knowledge organization; with 57 tables I Wolfgang Gaul; Dietmar Pfeifer (ed.). -Berlin; Heidelberg; New York; Barcelona; Budapest; Hong Kong; London; Milan; Paris; Santa Clara; Singapore; Tokyo: Springer, 1995 (Studies in classification. data analysis. and knowledge orglUlization) ISBN -13: 978-3-540-60354-2 e-ISBN -13: 978-3-642-79999-0 DOl: 10.1007/978-3-642-79999-0 This work is subject to copyright. All rights are reserved, whether the whole or part of the material is concerned, specifically the rights of translation, reprinting, reuse of illustrations, recitation, broadcasting, reproduction on microfilms or in other ways, and storage in data banks. Duplication of this publication or parts thereof is only permitted under the provisions of the German Copyright Law of September 9, 1965, in its version of June 24, 1985, and a copyright fee must always be paid. Violations fall under the prosecution act of the German Copyright Law. © Springer-Verlag Berlin· Heidelberg 1996 The use of registered names, trademarks, etc. in this publication does not imply, even in the absence of a specific statement, that such names are exempt from the relevant protective laws and regulations and therefore free for general use. 43/2202-543210-Printed on acid-free paper PREFACE Selected papers presented at the 18th Annual Conference of th~ German Classification Society GfKl (Gesellschaft fUr Klassifikation) are contained in this volume of 'Studies in Classification, Data Analysis, and Knowledge Or ganization'. The conference took place at the University of Oldenburg in 1994 under the general subject "From Data to Knowledge" and provided an international forum for participants from theory and practice. The interdisciplinary character of GfKI - although aspects concerning clas sification and related areas of data analysis are in the center of interest of the activities of the society - is one of the reasons that within a process entitled "From Data to Knowledge" emphasis was laid as well on theoret ical contributions as on applications in various fields including information systems and knowledge organization. More than 330 participants could choose their individual program from more than 130 presentations included in the scientific program of the conference. The selected papers of this volume are divided into the following major sec tions: Plenary and Semi Plenary Presentations Classification Related Results and Other Aspects of Data Analysis Spatial Data Analysis Applications in Economics Applications in Linguistics Applications in Medicine and Biology - Information Systems and Knowledge Organization Within the sections the contributions are listed in alphabetical order with respect to the authors' names. Besides the plenary and semi plenary presen tations which demonstrate the magnitude of the different scientific directions originating in classification and related areas the arrangement of the papers in the sections mentioned gives an overview of the main topics tackled at the conference. As most contributions contain aspects from different areas an unambiguous assignment of papers to sections is not possible in all cases. However, we appreciate that many contributions show in their application parts that the activities which GfKl supports make an important impact on various practical fields. This time, spatial data analysis was intentionally separated from other as pects of data analysis as the conference attracted a major group of re searchers from this area. In this context, we gratefully take the opportunity to acknowledge support by - Deutsche Forschungsgemeinschaft (DFG) Land Niedersachsen Carl von Ossietzky Universitat Oldenburg Universitatsgesellschaft Oldenburg e.V. VI which made it possible to hold the 18th Annual Conference of GfKl in the way described. The final version of this volume was put together at the University of Karls ruhe by Frank Wartenberg who did an extremely good job in organizing and supervising typesetting and reproduction of figures. From the students who helped, at least, Frau Marzena Gajowa and Lars Bjorner should be men tioned. Last but not least we thank Dr. Schuster from Springer-Verlag for excellent cooperation. Karlsruhe and Oldenburg, July 1995 W. Gaul and D. Pfeifer Contents Plenary and Semi Plenary Presentations Advances in Cluster Analysis Relevant to Marketing Research P. Arabie, L. Hubert ... . . . . . . . . . . . . . . . . 3 Representation of Statistical Structures, Classification and Prediction Using Multidimensional Scaling C. M. Cuadras, J. Fortiana, F. Oliva .............. 20 Null Models in Cluster Validation A. D. Gordon ............ . 32 Classifying Space and Analysing the Consequences: Spatial Analysis of Health Data R. Haining ......................... 45 An Ordinal Model for Cluster Analysis - 15 Years in Retrospect M. F. Janowitz ....................... 58 An Overview and Recent Developments in Dual Scaling S. Nishisato . . . . . . . . . . . . . . . . . . . . 73 Gibbs Sampling in AR Models with Random Walk Priors W. Polasek, S. Jin ......... . ...... . 86 Finding the Edge of a Poisson Fores't with Inside and Outside Observations: The Discriminant Analysis Point of View J. P. Rasson, M. Remon, FI. Henry . . . . . . . . . . 94 Spatial Fibre and Surface Processes - Siereological Estimations and Applications K. Sandau . . . . . . . . . . . . . . . . . . . . . . . . . . . . 102 Robustizing Mixture Analysis Using Model Weighting M. P. Windham ............... . 116 Evaluation of the First Life Table Published in 1888 in Japan K. Yajima ....................... . 124 Classification Related Results and Other Aspects of Data Analysis Incomplete Data Matrices and Tests on Randomly Missing Data U. Bankhofer ............................ 133 Vlll Valuations and Hierarchizations K. Biedermann, R. Wille .............. 141 Recent Developments in Multimode Clustering T. Eckes . ............... . . . 151 Gaussian Neural Networks Applied to the Cluster Analysis Problem C. Firmin, D. Hamad .................. . . 159 Graph-Theoretic Models for Testing the Homogeneity of Data E. Godehardt, A. Horsch ................ . . 167 Least Squares Multidimensional Scaling with Transformed Distances P. J. F. Groenen, J. de Leeuw, R. Mathar . . . . . . . . . . . . 177 Alternatives to Configural Frequency Analysis P. Ihm, I. Kuchler .......... . 186 Clustering Methods in Fuzzy Control F. Klawonn, R. Kruse 195 The Excess Mass Approach and the Analysis of Multi-Modality G. Sawitzki ............................. 203 Mode Extraction by Multivalue Morphology for Cluster Analysis A. Sbihi, J.-G. Postaire ...................... 212 On the Longest Edge of the Minimal Spanning Tree E. Tabakis ................. . . .... 222 Detection of Blocks in a Binary Matrix - A Bayesian Approach W. Vach, K. W. Alt ........................ 231 Spatial Data Analysis Detection of Spatial Discontinuities in Vegetation Data by a Moving Window Algorithm H. Balzter, P. Braun, W. Kohler ................. 243 Spatial Clustering of Neurons by Hypergeometric Disjoint Statistics J. Krauth .......................... .. 253 A New Approach of Regionalisation by Classifying Hydrological, Quantities K.-P. Nieschulz, O. Richter, B. Diekkriiger, A. Liicke . . . . . . 262 The Index-of-Dispersion Test Revisited D. Pfeifer, H. Ortleb, U. Schleier-Langer, H.-P. Baumer 270 Comparing Spatio-Temporal Patterns from Defaunization Experiments in Marine Ecology R. Wilhelm, A. Tecklenborg . . . . . . . . . . . . . . . . . . . . 278 IX Applications in Economics A Unifying Approach to Benefit Segmentation and Product Line Design Based on Rank Order Conjoint Data E. Aust, W. Gaul . . . . . . . . . . . . . . . . . . . . . . . . . . 289 Classification and Representation Using Conjoint Data D. Baier, W. Gaul ............... . 298 Overlapping Clustering of Statistical Software Packages for PC R. Lasch . ........................ . 308 Scenario Analysis with BASICS - Testing the Representativity of the Results of the Dynamic Probability Adjustment of Scenario Components with the Help of Classification Methods M. Missler-Behr .......................... 318 Analysis of Sales Data: A Neural Net Approach F. Wartenberg, R. Decker ....... . 326 Applications in Linguistics On the Definition of Inflection P. ten Hacken . . . . . . . . 337 Computer-Aided Analysis of Vocabulary Acquisition J. Liedtke .................. . 345 Features and Tags S. Naumann 353 Semantically Based Universal Definitions of Grammatical Agreement and Agreement Domain Universals: A Critical Evaluation P. Schmidt ............................ 360 Towards a Hypermedia, Multilingual, On-Line Resource System for LSP Users /Learners W. Wieden, K. Ronacher, A. Weiss, H. Goebl, K. Miiller . . . . 367 Applications in Medicine and Biology ANew Methodologic Look at Describing the Performance of Diagnostic Classification Procedures in Medicine O. Gefeller, H. Brenner ................... 379 Xmed-DD: From Document Processing to Systematic Information Storage W. Giere, A. Gregori, C. Luz ............... ... 387 Ribosomal RNA Phylogeny Derived from a Correlation Model of Sequence Evolution A. von Haeseler, M. SchOniger . . . . . . . . . . . . . . . . . . . 395 x SALBIDH2 - Modifications of the LBI-Method for Automated Lexicon-Based Indexing of Diagnoses K. Hofmann, B. Brigl, E. GlUck, R. Haux .... 404 Record Linkage of Anonymous Data by Control Numbers W. Thoben, H.-J. Appelrath, S. Sauer ..... . 412 Information Systems and Knowledge Organization Processing Partial Information in Decision Support Systems F. Dellmann .................. . 423 Consistency Conditions for the Classification in LIS j CI W. Lenski, M. M. Richter, E. Wette-Roch ............ 433 Using Hypertext for Information Retrieval in STEP jEXPRESS Schemata H. Liihrsen, H. Wedekind ..................... 442 Two Software Tools Supporting Enduser Oriented Information Retrieval in Physics ' L. Weisel, B. Diekmann ...................... 450 From Verbal Data to Practical Knowledge J. ZeIger . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 458 Index ....................................... 467

See more

The list of books you might like

Most books are stored in the elastic cloud where traffic is expensive. For this reason, we have a limit on daily download.