ebook img

Data Mining: Practical Machine Learning Tools and Techniques PDF

534 Pages·2016·33.738 MB·English
Save to my drive
Quick download
Download
Most books are stored in the elastic cloud where traffic is expensive. For this reason, we have a limit on daily download.

Preview Data Mining: Practical Machine Learning Tools and Techniques

Data Mining Practical Machine Learning Tools and Techniques FOURTH EDITION Ian H. Witten University of Waikato, Hamilton, New Zealand Eibe Frank University of Waikato, Hamilton, New Zealand Mark A. Hall University of Waikato, Hamilton, New Zealand Christopher J. Pal Polytechnique Montréal, Montreal, Quebec, Canada 2 Table of Contents Cover image Title page Copyright List of Figures List of Tables Preface Updated and Revised Content Acknowledgments Part I: Introduction to data mining Chapter 1. What’s it all about? Abstract 1.1 Data Mining and Machine Learning 1.2 Simple Examples: The Weather Problem and Others 1.3 Fielded Applications 1.4 The Data Mining Process 1.5 Machine Learning and Statistics 1.6 Generalization as Search 1.7 Data Mining and Ethics 1.8 Further Reading and Bibliographic Notes Chapter 2. Input: Concepts, instances, attributes Abstract 2.1 What’s a Concept? 2.2 What’s in an Example? 2.3 What’s in an Attribute? 2.4 Preparing the Input 2.5 Further Reading and Bibliographic Notes Chapter 3. Output: Knowledge representation Abstract 3 3.1 Tables 3.2 Linear Models 3.3 Trees 3.4 Rules 3.5 Instance-Based Representation 3.6 Clusters 3.7 Further Reading and Bibliographic Notes Chapter 4. Algorithms: The basic methods Abstracts 4.1 Inferring Rudimentary Rules 4.2 Simple Probabilistic Modeling 4.3 Divide-and-Conquer: Constructing Decision Trees 4.4 Covering Algorithms: Constructing Rules 4.5 Mining Association Rules 4.6 Linear Models 4.7 Instance-Based Learning 4.8 Clustering 4.9 Multi-instance Learning 4.10 Further Reading and Bibliographic Notes 4.11 Weka Implementations Chapter 5. Credibility: Evaluating what’s been learned Abstract 5.1 Training and Testing 5.2 Predicting Performance 5.3 Cross-Validation 5.4 Other Estimates 5.5 Hyperparameter Selection 5.6 Comparing Data Mining Schemes 5.7 Predicting Probabilities 5.8 Counting the Cost 5.9 Evaluating Numeric Prediction 5.10 The MDL Principle 5.11 Applying the MDL Principle to Clustering 5.12 Using a Validation Set for Model Selection 5.13 Further Reading and Bibliographic Notes Part II: More advanced machine learning schemes Part II. More advanced machine learning schemes Chapter 6. Trees and rules 4 Abstract 6.1 Decision Trees 6.2 Classification Rules 6.3 Association Rules 6.4 Weka Implementations Chapter 7. Extending instance-based and linear models Abstract 7.1 Instance-Based Learning 7.2 Extending Linear Models 7.3 Numeric Prediction With Local Linear Models 7.4 Weka Implementations Chapter 8. Data transformations Abstracts 8.1 Attribute Selection 8.2 Discretizing Numeric Attributes 8.3 Projections 8.4 Sampling 8.5 Cleansing 8.6 Transforming Multiple Classes to Binary Ones 8.7 Calibrating Class Probabilities 8.8 Further Reading and Bibliographic Notes 8.9 Weka Implementations Chapter 9. Probabilistic methods Abstract 9.1 Foundations 9.2 Bayesian Networks 9.3 Clustering and Probability Density Estimation 9.4 Hidden Variable Models 9.5 Bayesian Estimation and Prediction 9.6 Graphical Models and Factor Graphs 9.7 Conditional Probability Models 9.8 Sequential and Temporal Models 9.9 Further Reading and Bibliographic Notes 9.10 Weka Implementations Chapter 10. Deep learning Abstract 10.1 Deep Feedforward Networks 10.2 Training and Evaluating Deep Networks 5 10.3 Convolutional Neural Networks 10.4 Autoencoders 10.5 Stochastic Deep Networks 10.6 Recurrent Neural Networks 10.7 Further Reading and Bibliographic Notes 10.8 Deep Learning Software and Network Implementations 10.9 WEKA Implementations Chapter 11. Beyond supervised and unsupervised learning Abstract 11.1 Semisupervised Learning 11.2 Multi-instance Learning 11.3 Further Reading and Bibliographic Notes 11.4 WEKA Implementations Chapter 12. Ensemble learning Abstract 12.1 Combining Multiple Models 12.2 Bagging 12.3 Randomization 12.4 Boosting 12.5 Additive Regression 12.6 Interpretable Ensembles 12.7 Stacking 12.8 Further Reading and Bibliographic Notes 12.9 WEKA Implementations Chapter 13. Moving on: applications and beyond Abstract 13.1 Applying Machine Learning 13.2 Learning From Massive Datasets 13.3 Data Stream Learning 13.4 Incorporating Domain Knowledge 13.5 Text Mining 13.6 Web Mining 13.7 Images and Speech 13.8 Adversarial Situations 13.9 Ubiquitous Data Mining 13.10 Further Reading and Bibliographic Notes 13.11 WEKA Implementations Appendix A. Theoretical foundations 6 A.1 Matrix Algebra A.2 Fundamental Elements of Probabilistic Methods Appendix B. The WEKA workbench B.1 What’s in WEKA? B.2 The package management system B.3 The Explorer B.4 The Knowledge Flow Interface B.5 The Experimenter References Index 7 Copyright Morgan Kaufmann is an imprint of Elsevier 50 Hampshire Street, 5th Floor, Cambridge, MA 02139, United States Copyright © 2017, 2011, 2005, 2000 Elsevier Inc. All rights reserved. No part of this publication may be reproduced or transmitted in any form or by any means, electronic or mechanical, including photocopying, recording, or any information storage and retrieval system, without permission in writing from the publisher. Details on how to seek permission, further information about the Publisher’s permissions policies and our arrangements with organizations such as the Copyright Clearance Center and the Copyright Licensing Agency, can be found at our website: www.elsevier.com/permissions. This book and the individual contributions contained in it are protected under copyright by the Publisher (other than as may be noted herein). Notices Knowledge and best practice in this field are constantly changing. As new research and experience broaden our understanding, changes in research methods, professional practices, or medical treatment may become necessary. Practitioners and researchers must always rely on their own experience and knowledge in evaluating and using any information, methods, compounds, or experiments described herein. In using such information or methods they should be mindful of their own safety and the safety of others, including parties for whom they have a professional responsibility. To the fullest extent of the law, neither the Publisher nor the authors, contributors, or editors, assume any liability for any injury and/or damage to persons or property as a matter of products liability, negligence or otherwise, or from any use or operation of any methods, products, instructions, or ideas contained in the material herein. British Library Cataloguing-in-Publication Data A catalogue record for this book is available from the British Library Library of Congress Cataloging-in-Publication Data A catalog record for this book is available from the Library of Congress ISBN: 978-0-12-804291-5 For Information on all Morgan Kaufmann publications visit our website at https://www.elsevier.com 8 Publisher: Todd Green Acquisition Editor: Tim Pitts Editorial Project Manager: Charlotte Kent Production Project Manager: Nicky Carter Designer: Matthew Limbert Typeset by MPS Limited, Chennai, India 9 List of Figures Figure 1.1 Rules for the contact lens data. 13 Figure 1.2 Decision tree for the contact lens data. 14 Figure 1.3 Decision trees for the labor negotiations data. 18 Figure 1.4 Life cycle of a data mining project. 29 Figure 2.1 A family tree and two ways of expressing the sister-of relation. 48 Figure 2.2 ARFF file for the weather data. 58 Figure 2.3 Multi-instance ARFF file for the weather data. 60 Figure 3.1 A linear regression function for the CPU performance data. 69 Figure 3.2 A linear decision boundary separating Iris setosas from Iris versicolors. 70 Figure 3.3 Constructing a decision tree interactively: (A) creating a rectangular test involving petallength and petalwidth; (B) the resulting 73 (unfinished) decision tree. Figure 3.4 Models for the CPU performance data: (A) linear regression; (B) regression tree; (C) model tree. 74 Figure 3.5 Decision tree for a simple disjunction. 76 Figure 3.6 The exclusive-or problem. 77 Figure 3.7 Decision tree with a replicated subtree. 77 Figure 3.8 Rules for the iris data. 81 Figure 3.9 The shapes problem. 82 Figure Different ways of partitioning the instance space. 86 3.10 Figure Different ways of representing clusters. 88 3.11 Figure 4.1 Pseudocode for 1R. 93 Figure 4.2 Tree stumps for the weather data. 106 Figure 4.3 Expanded tree stumps for the weather data. 108 Figure 4.4 Decision tree for the weather data. 109 Figure 4.5 Tree stump for the ID code attribute. 111 Figure 4.6 Covering algorithm: (A) covering the instances; (B) decision tree for the same problem. 113 Figure 4.7 The instance space during operation of a covering algorithm. 115 Figure 4.8 Pseudocode for a basic rule learner. 118 Figure 4.9 (A) Finding all item sets with sufficient coverage; (B) finding all sufficiently accurate association rules for a k-item set. 127 Figure Logistic regression: (A) the logit transform; (B) example logistic regression function. 130 4.10 Figure The perceptron: (A) learning rule; (B) representation as a neural network. 132 4.11 Figure The Winnow algorithm: (A) unbalanced version; (B) balanced version. 134 4.12 Figure A kD-tree for four training instances: (A) the tree; (B) instances and splits. 137 4.13 Figure Using a kD-tree to find the nearest neighbor of the star. 137 4.14 Figure Ball tree for 16 training instances: (A) instances and balls; (B) the tree. 139 4.15 Figure Ruling out an entire ball (gray) based on a target point (star) and its current nearest neighbor. 140 4.16 Figure Iterative distance-based clustering. 143 4.17 Figure A ball tree: (A) two cluster centers and their dividing line; (B) corresponding tree. 145 4.18 Figure Hierarchical clustering displays. 149 10

See more

The list of books you might like

Most books are stored in the elastic cloud where traffic is expensive. For this reason, we have a limit on daily download.