Table Of Content

COS 402 – Machine Learning and Artificial Intelligence Fall 2016 Lecture 2: Classification and Decision Trees Sanjeev Arora Elad Hazan This lecture contains material from the T. Michel text “Machine Learning”, and slides adapted from David Sontag, Luke Zettlemoyer, Carlos Guestrin, and Andrew Moore Admin • Enrolling to the course – priorities • Pre-requisites reminder (calculus, linear algebra, discrete math, probability, data structures & graph algorithms) • Movie tickets • Slides • Exercise 1 – theory – due in one week, in class Agenda This course: Basic principles of how to design machines/programs that act “intelligently.” • Recall crow / fox example. • Start with simpler task – classification, learning from examples • Today: still Naive AI. Next week: statistical learning theory Classification Goal: Find best mapping from domain (features) to output (labels) • Given a document (email), classify spam or ham. Features = words , labels = {spam, ham} • Given a picture, classify if it contains a chair or not features = bits in a bitmap image, labels = {chair, no chair} GOAL: automatic machine that learns from examples Terminology for learning from examples: • Set aside a ”training set” of examples, train a classification machine • Test on a “test set”, to see how well machine performs on unseen examples Classifying fuel efficiency mpg cylinders displacement horsepower weight acceleration modelyear maker good 4 low low low high 75to78 asia bad 6 medium medium medium medium 70to74 america bad 4 medium medium medium low 75to78 europe bad 8 high high high low 70to74 america • 40 data points bad 6 medium medium medium medium 70to74 america bad 4 low medium low medium 70to74 asia bad 4 low medium low low 70to74 asia • Goal: predict bad 8 high high high low 75to78 america : : : : : : : : MPG : : : : : : : : : : : : : : : : bad 8 high high high low 70to74 america • Need to find: good 8 high medium high high 79to83 america bad 8 high high high low 75to78 america f : X àY good 4 low low low low 79to83 america bad 6 medium medium medium high 75to78 america good 4 medium low low low 79to83 america • Discrete data good 4 low low medium high 79to83 america bad 8 high high high low 70to74 america (for now) good 4 low medium low medium 75to78 europe bad 5 medium medium medium medium 75to78 europe Y X From the UCI repository (thanks to Ross Quinlan) Decision trees for classification • Why use decision trees? • What is their expressive power? • Can they be constructed automatically? • How accurate can they classify? • What makes a good decision tree besides accuracy on given examples? Decision trees for classification Some real examples (from Russell & Norvig, Mitchell) • BP’s GasOIL system for separating gas and oil on offshore platforms - decision trees replaced a hand-designed rules system with 2500 rules. C4.5-based system outperformed human experts and saved BP millions. (1986) • learning to fly a Cessna on a flight simulator by watching human experts fly the simulator (1992) • can also learn to play tennis, analyze C-section risk, etc. Decision trees for classification • interpretable/intuitive, popular in medical applications because they mimic the way a doctor thinks • model discrete outcomes nicely • can be very powerful (expressive) • C4.5 and CART - from “top 10 data mining methods” - very popular • This Thu: we’ll see why not… decision trees f : X àY • Each internal node tests an attribute x i Cylinders • One branch for each possible 3 4 5 6 8 attribute value x =v i good bad bad Maker Horsepower • Each leaf assigns a class y america asia europe low med high • To classify input x: bad good good bad good bad traverse the tree from root to leaf, output the labeled y • Can we construct a Human interpretable! tree automatically? Expressive power of DT What kind of functions can they potentially represent? • Boolean functions? $ X1 X2 X3 F(X1,X2,X3) F = 0,1 ↦ {0,1} 0 0 0 0 x1 0 0 1 0 0 1 0 1 0 0 x2 x2 0 1 1 0 0 1 1 0 0 0 1 0 1 0 1 0 x3 x3 x3 x3 1 1 0 0 0 10 1 0 10 1 1 1 1 0 0 0 0 0 0 0

Description:

Given a document (email), classify spam or ham. Features = words , labels What makes a good decision tree besides accuracy on given examples?

Lecture 2: Classification and Decision Trees Sanjeev Arora Elad Hazan PDF

50 Pages·2016·1.41 MB·English

Checking for file health...

Download

Upgrade Premium

Most books are stored in the elastic cloud where traffic is expensive. For this reason, we have a limit on daily download.

Download Lecture 2: Classification and Decision Trees Sanjeev Arora Elad Hazan PDF Free - Full Version

by Unknow| 2016| 50 pages| 1.41| English

Download Lecture 2: Classification and Decision Trees Sanjeev Arora Elad Hazan by in PDF format completely FREE. No registration required, no payment needed. Get instant access to this valuable resource on PDFdrive.to!

Free Download PDF

About Lecture 2: Classification and Decision Trees Sanjeev Arora Elad Hazan

Given a document (email), classify spam or ham. Features = words , labels What makes a good decision tree besides accuracy on given examples?

Detailed Information

Author:	Unknown
Publication Year:	2016
Pages:	50
Language:	English
File Size:	1.41
Format:	PDF
Price:	FREE

Download Free PDF

Safe & Secure Download - No registration required

Why Choose PDFdrive for Your Free Lecture 2: Classification and Decision Trees Sanjeev Arora Elad Hazan Download?

100% Free: No hidden fees or subscriptions required for one book every day.
No Registration: Immediate access is available without creating accounts for one book every day.
Safe and Secure: Clean downloads without malware or viruses
Multiple Formats: PDF, MOBI, Mpub,... optimized for all devices
Educational Resource: Supporting knowledge sharing and learning

Frequently Asked Questions

Is it really free to download Lecture 2: Classification and Decision Trees Sanjeev Arora Elad Hazan PDF?

Yes, on https://PDFdrive.to you can download Lecture 2: Classification and Decision Trees Sanjeev Arora Elad Hazan by completely free. We don't require any payment, subscription, or registration to access this PDF file. For 3 books every day.

How can I read Lecture 2: Classification and Decision Trees Sanjeev Arora Elad Hazan on my mobile device?

After downloading Lecture 2: Classification and Decision Trees Sanjeev Arora Elad Hazan PDF, you can open it with any PDF reader app on your phone or tablet. We recommend using Adobe Acrobat Reader, Apple Books, or Google Play Books for the best reading experience.

Is this the full version of Lecture 2: Classification and Decision Trees Sanjeev Arora Elad Hazan?

Yes, this is the complete PDF version of Lecture 2: Classification and Decision Trees Sanjeev Arora Elad Hazan by Unknow. You will be able to read the entire content as in the printed version without missing any pages.

Is it legal to download Lecture 2: Classification and Decision Trees Sanjeev Arora Elad Hazan PDF for free?

https://PDFdrive.to provides links to free educational resources available online. We do not store any files on our servers. Please be aware of copyright laws in your country before downloading.

The materials shared are intended for research, educational, and personal use in accordance with fair use principles.