Welcome to Scribd!

Skip carousel

Introduction To Tree Methods

Uploaded by

johnconnor

0% found this document useful (0 votes)

18 views15 pages

test

Original Title

Intro to Tree Methods

Copyright

Available Formats

PDF, TXT or read online from Scribd

Share this document

Share or Embed Document

Sharing Options

Did you find this document useful?

Is this content inappropriate?

Report this Document

test

Copyright:

Available Formats

Download as PDF, TXT or read online from Scribd

Flag for inappropriate content

0% found this document useful (0 votes)

18 views15 pages

Introduction To Tree Methods

Uploaded by

johnconnor

test

Copyright:

Available Formats

Download as PDF, TXT or read online from Scribd

Flag for inappropriate content

Jump to Page

You are on page 1of 15

Search inside document

Introduction to

Tree Methods
Reading Assignment

Chapter 8 of
Introduction to Statistical Learning
By Gareth James, et al.
Tree Methods

Let’s start off with a thought experiment to give some

motivation behind using a decisionMachine Math &
tree method.
Learning
Statistics
DS

Software Research

Domain
Knowledge
Tree Methods

Imagine that I play Tennis every Saturday and I always invite a

friend to come with me. Machine Math &
Learning
Sometimes my friend shows up, sometimes not.
Statistics
DS
For him it depends on a variety of factors, such as: weather,
Software Research
temperature, humidity, wind etc..
I start keeping track of these features and whether or not he
showed up to play with me. Domain
Knowledge
Tree Methods

Machine Math &

Learning
Statistics
DS

Software Research

Domain
Knowledge
Tree Methods

I want to use this data to

predict whether or not he will Machine Math &
Learning
show up to play. Statistics
An intuitive way to do this is DS

through a Decision Tree Software Research

Domain
Knowledge
Tree Methods

In this tree we have:

Machine Math &
● Nodes Learning
○ Split for the value of a
Statistics
certain attribute DS

● Edges Software Research

○ Outcome of a split to
next node
Domain
Knowledge
Tree Methods

In this tree we have:

Machine Math &
● Root Learning
○ The node that performs
Statistics
the first split DS

● Leaves Software Research

○ Terminal nodes that

predict the outcome
Domain
Knowledge
Intuition Behind Splits

Imaginary Data with 3 features (X,Y, and Z) with two possible

classes. Machine Math &
Learning
Statistics
DS

Software Research

Domain
Knowledge
Intuition Behind Splits

Splitting on Y gives us a clear separation between classes

Machine Math &
Learning
Statistics
DS

Software Research

Domain
Knowledge
Intuition Behind Splits

We could have also tried splitting on other features first:

Machine Math &
Learning
Statistics
Intuition Behind Splits

Entropy and Information Gain are the Mathematical Methods of

choosing the best split. Refer to reading Math &
Machine assignment.
Learning
Statistics
Random Forests

To improve performance, we can use many trees with a

random sample of features chosen Math
Machineas the &
split.
Learning
Statistics
● A new random sample of features is chosen for
every single tree at every single split.
● For classification, m is typically chosen to be the
square root of p.
Random Forests

What's the point?

Machine Math &
Learning
● Suppose there is one very strong featureStatistics
in the data set. When
using “bagged” trees, most of the trees will use that feature as
the top split, resulting in an ensemble of similar trees that are
highly correlated.
Random Forests

What's the point?

Machine Math &
Learning
Statistics
● Averaging highly correlated quantities does not significantly
reduce variance.
● By randomly leaving out candidate features from each split,
Random Forests "decorrelates" the trees, such that the
averaging process can reduce the variance of the resulting
model.

Big Data Computing Decision Trees For Big Data Analytics
Document48 pages
Big Data Computing Decision Trees For Big Data Analytics
sunita chalageri
No ratings yet
PGDMEx ML Session 6&7 HO
Document27 pages
PGDMEx ML Session 6&7 HO
Surajit Das
No ratings yet
Fintech ML Using Azure
Document51 pages
Fintech ML Using Azure
Vikram Pandya
No ratings yet
Untitled Presentation
Document27 pages
Untitled Presentation
Hari Priya
No ratings yet
CCST9017 (2023-24lecture11printed Version) MachineLearning
Document55 pages
CCST9017 (2023-24lecture11printed Version) MachineLearning
meganyaptan
No ratings yet
Day 2
Document58 pages
Day 2
ganareddys
No ratings yet
Introduction To ML
Document34 pages
Introduction To ML
jondu sanchez
No ratings yet
ID3 Algorithm
Document3 pages
ID3 Algorithm
sharad verma
100% (1)
360DigiTMG Practical Data Science New
Document168 pages
360DigiTMG Practical Data Science New
yesbene
100% (1)
360DigiTmg E Book Data Science
Document168 pages
360DigiTmg E Book Data Science
Kavin Sahasran
100% (1)
Week 01
Document37 pages
Week 01
Osii C
No ratings yet
Raja-Oussama Anglais
Document8 pages
Raja-Oussama Anglais
Rajae Chainabou
No ratings yet
Machine Learning Algorithms - A Review - ART20203995
Document6 pages
Machine Learning Algorithms - A Review - ART20203995
sdsdfs
No ratings yet
Building A ML System
Document42 pages
Building A ML System
goyag47512
No ratings yet
Super
Document2 pages
Super
Yup
No ratings yet
Week 1 Introduction To ML
Document42 pages
Week 1 Introduction To ML
Jaurel Kouam
100% (1)
Data Science 2
Document55 pages
Data Science 2
kagome
No ratings yet
Advanced Artificial Intelligence
Document59 pages
Advanced Artificial Intelligence
Muhammad Sadiq Khan Nasar
No ratings yet
Machine Learning Algorithms - A Review: January 2019
Document7 pages
Machine Learning Algorithms - A Review: January 2019
Nitin Prasad
No ratings yet
DSA - Introduction (Revised) PDF
Document75 pages
DSA - Introduction (Revised) PDF
siddhipandey
No ratings yet
Data Science 1
Document133 pages
Data Science 1
Akhil Reddy
100% (2)
Week-7 - Lecture Notes
Document143 pages
Week-7 - Lecture Notes
tejastaware7451
No ratings yet
Decision Tree Algorithm, Explained-1-22
Document22 pages
Decision Tree Algorithm, Explained-1-22
shyla
No ratings yet
ML - Week 1
Document37 pages
ML - Week 1
AYESHA SHAZ
No ratings yet
Course: Machine Learning Duration: 30-40 Hours Course Fee: 12000/-Course Description
Document2 pages
Course: Machine Learning Duration: 30-40 Hours Course Fee: 12000/-Course Description
Padmini Palli
No ratings yet
Deep Learning: A Visual Introduction
Document53 pages
Deep Learning: A Visual Introduction
Sajid Ali Maari
No ratings yet
Lecture 8 - 1
Document16 pages
Lecture 8 - 1
Yt Noob
No ratings yet
Artificial Intelligence & Data Analytics
Document1 page
Artificial Intelligence & Data Analytics
Datta Biswajit
No ratings yet
Machine Learning File
Document7 pages
Machine Learning File
buddy
No ratings yet
Lecture 1
Document31 pages
Lecture 1
Varun Anto
No ratings yet
Lecture 8 - 1
Document16 pages
Lecture 8 - 1
Chanchal Misra
No ratings yet
Deep Learning
Document207 pages
Deep Learning
intelligence gateway
100% (2)
Supersived Machine Learning
Document52 pages
Supersived Machine Learning
farhan yutub
No ratings yet
How Decision Tree Algorithm Works
Document16 pages
How Decision Tree Algorithm Works
hnoor6
No ratings yet
Chapter1Introduction To DataMining
Document21 pages
Chapter1Introduction To DataMining
Priyanka Khutwad
No ratings yet
What Is AI? What Is ML? What Is Deep Learning? Machine Learning Process
Document8 pages
What Is AI? What Is ML? What Is Deep Learning? Machine Learning Process
ranamzeeshan
No ratings yet
01 Introduction
Document16 pages
01 Introduction
chea rotha
No ratings yet
DL Workshop Bejaia
Document143 pages
DL Workshop Bejaia
NorL'huda Gourari
No ratings yet
ML Interview Questions
Document60 pages
ML Interview Questions
koti199912
No ratings yet
2-Capacity, Underfitting, overfitting-15-Jul-2020Material - I - 15-Jul-2020 - ML - Fundamentals
Document35 pages
2-Capacity, Underfitting, overfitting-15-Jul-2020Material - I - 15-Jul-2020 - ML - Fundamentals
Anand Amsuri
No ratings yet
AI For DS - Theory Search Fundamentals and Intelligent Search
Document15 pages
AI For DS - Theory Search Fundamentals and Intelligent Search
Saranya Narayanasetti
No ratings yet
Machine Learning
Document30 pages
Machine Learning
Firas Gerges
No ratings yet
ISOMAP
Document11 pages
ISOMAP
Vishwa Muthukumar
100% (1)
Data Science
Document14 pages
Data Science
scientist01234
No ratings yet
ISOMAP in ML
Document12 pages
ISOMAP in ML
Vishwa Muthukumar
No ratings yet
Data Science I: Lesson #01 - Outline Presentation
Document20 pages
Data Science I: Lesson #01 - Outline Presentation
alesy
No ratings yet
Lesson 4 Gradient Descent
Document13 pages
Lesson 4 Gradient Descent
John Veksler Lingal
No ratings yet
Fichas de Aprendizaje IBM Data Science Quiz Questions - Quizlet
Document16 pages
Fichas de Aprendizaje IBM Data Science Quiz Questions - Quizlet
juanreyes
No ratings yet
Comparative Analysis On Popular Games Between 1980-2023
Document20 pages
Comparative Analysis On Popular Games Between 1980-2023
generalsumit01
No ratings yet
Machine Learning
Document211 pages
Machine Learning
Harish
100% (1)
Deep Learning PIAIC
Document229 pages
Deep Learning PIAIC
waqarmwach
100% (1)
Data Scientist
Document10 pages
Data Scientist
aydan abdulsalamli
No ratings yet
Decision Tree Algorithm, Explained
Document20 pages
Decision Tree Algorithm, Explained
Ali Khan
No ratings yet
cs229 Notes1
Document5 pages
cs229 Notes1
fatihy73
No ratings yet
Understanding The Data Ecosystem
Document11 pages
Understanding The Data Ecosystem
Moath Hasan Almasani
No ratings yet
Data Management and Data Transformation, Introduction To Machine Learning
Document54 pages
Data Management and Data Transformation, Introduction To Machine Learning
Atif Mahmood
No ratings yet
Decision Tree
Document20 pages
Decision Tree
Tsabbit Aqdami
No ratings yet
Data Science With Python - Lesson 01 - Data Science Overview
Document35 pages
Data Science With Python - Lesson 01 - Data Science Overview
Swarnajyoti Mazumdar
100% (2)
DATA MINING and MACHINE LEARNING. PREDICTIVE TECHNIQUES: ENSEMBLE METHODS, BOOSTING, BAGGING, RANDOM FOREST, DECISION TREES and REGRESSION TREES.: Examples with MATLAB
From Everand
DATA MINING and MACHINE LEARNING. PREDICTIVE TECHNIQUES: ENSEMBLE METHODS, BOOSTING, BAGGING, RANDOM FOREST, DECISION TREES and REGRESSION TREES.: Examples with MATLAB
César Pérez López
No ratings yet
PYTHON DATA ANALYTICS: Harnessing the Power of Python for Data Exploration, Analysis, and Visualization (2024)
From Everand
PYTHON DATA ANALYTICS: Harnessing the Power of Python for Data Exploration, Analysis, and Visualization (2024)
NED MUNOZ
No ratings yet
Python Cheat Sheet For Leetcode - LeetCode Discuss
Document1 page
Python Cheat Sheet For Leetcode - LeetCode Discuss
johnconnor
No ratings yet
Data Science and Engineering
Document2 pages
Data Science and Engineering
johnconnor
No ratings yet
APAC MEA Ecommerce Analytics Business Intelligence Manager
Document2 pages
APAC MEA Ecommerce Analytics Business Intelligence Manager
johnconnor
No ratings yet
BC3406 Business Analytics Hackathon Rubric
Document1 page
BC3406 Business Analytics Hackathon Rubric
johnconnor
No ratings yet
Power BI For SMEs and Me
Document7 pages
Power BI For SMEs and Me
johnconnor
No ratings yet
Data Hackathon: Bc3406 Business Analytics Consulting
Document175 pages
Data Hackathon: Bc3406 Business Analytics Consulting
johnconnor
No ratings yet
BC3406 Business Analytics Hackathon Rubric PDF
Document1 page
BC3406 Business Analytics Hackathon Rubric PDF
johnconnor
No ratings yet
Handbook The Best Skin of Your Life Starts Here
Document404 pages
Handbook The Best Skin of Your Life Starts Here
johnconnor
86% (7)
BC3406 Business Analytics Consulting Semester 2, AY 2016/17 Paula's Choice Report
Document31 pages
BC3406 Business Analytics Consulting Semester 2, AY 2016/17 Paula's Choice Report
johnconnor
No ratings yet
BC3406 Data Hackathon 2017 Brief v4
Document3 pages
BC3406 Data Hackathon 2017 Brief v4
johnconnor
No ratings yet
Nanyang Business School BC3406 Business Analytics Consulting Data Hackathon
Document77 pages
Nanyang Business School BC3406 Business Analytics Consulting Data Hackathon
johnconnor
No ratings yet
Data Hackathon: Paula's Choice
Document21 pages
Data Hackathon: Paula's Choice
johnconnor
No ratings yet
PHD Thesis - A Hierarchical Model For SC Optimization
Document264 pages
PHD Thesis - A Hierarchical Model For SC Optimization
johnconnor
No ratings yet
Supreme Health Brochure (Full Version)
Document27 pages
Supreme Health Brochure (Full Version)
johnconnor
No ratings yet
Intro To Neural Nets PDF
Document29 pages
Intro To Neural Nets PDF
johnconnor
No ratings yet
BC3406 - Business Analytics Consulting: Team 5
Document75 pages
BC3406 - Business Analytics Consulting: Team 5
johnconnor
No ratings yet
Wireless AC1750 Dual-Band Gigabit Cloud Router USB 3.0: Product Highlights
Document3 pages
Wireless AC1750 Dual-Band Gigabit Cloud Router USB 3.0: Product Highlights
johnconnor
No ratings yet
MIT SCM Full Time Brochure 28.8.13
Document2 pages
MIT SCM Full Time Brochure 28.8.13
johnconnor
No ratings yet
Basic Git Command Line Reference For Windows Users (John Atten)
Document11 pages
Basic Git Command Line Reference For Windows Users (John Atten)
johnconnor
No ratings yet
Powerpoint Template - 190911
Document2 pages
Powerpoint Template - 190911
johnconnor
No ratings yet
ECommerce Data Analysis
Document1 page
ECommerce Data Analysis
johnconnor
No ratings yet
Admin Instructions - MA3003 - S2 AY 2014-15
Document3 pages
Admin Instructions - MA3003 - S2 AY 2014-15
johnconnor
No ratings yet
Newsvendor Problem
Document13 pages
Newsvendor Problem
johnconnor
No ratings yet
3.0 PV, FV, NPV, Irr & Mirr
Document8 pages
3.0 PV, FV, NPV, Irr & Mirr
Ishrat Jahan Dia
No ratings yet
Theory of Computation - Question Bank
Document19 pages
Theory of Computation - Question Bank
superalad
No ratings yet
Table of Low-Weight Binary Irreducible Polynomials: Internal Accession Date Only
Document16 pages
Table of Low-Weight Binary Irreducible Polynomials: Internal Accession Date Only
jublima
No ratings yet
Ss & RQ Model For Inductrial Engineering
Document25 pages
Ss & RQ Model For Inductrial Engineering
Padhai Kar Yaar
No ratings yet
Assignment - 4
Document10 pages
Assignment - 4
YUH 2020
No ratings yet
Astar
Document20 pages
Astar
Petter P
No ratings yet
Brochure
Document3 pages
Brochure
nglc srz
No ratings yet
23-24-III-DSL-Assignment List
Document3 pages
23-24-III-DSL-Assignment List
animationmake14
No ratings yet
CH 2. ForecastingB
Document59 pages
CH 2. ForecastingB
Muhammad Rizky Paramarta
No ratings yet
Chapter 1 - Vector Calculus
Document11 pages
Chapter 1 - Vector Calculus
NURAIN IDAYU BINTI YAHAYA STUDENT
No ratings yet
n4 Handout
Document2 pages
n4 Handout
Fizzer
No ratings yet
(Applicable From The Academic Session 2018-2019) : Syllabus For B. Tech in Computer Science & Engineering
Document2 pages
(Applicable From The Academic Session 2018-2019) : Syllabus For B. Tech in Computer Science & Engineering
Halder Subhas
No ratings yet
88 107
Document14 pages
88 107
riddhip patel
No ratings yet
Magnimind Academy Full-Stack Data Science Bootcamp Syllabus
Document17 pages
Magnimind Academy Full-Stack Data Science Bootcamp Syllabus
pwz90854
No ratings yet
CSC508 TEST2 16jan2023
Document4 pages
CSC508 TEST2 16jan2023
Afiq Izzuddin
No ratings yet
Bishop Neural Networks ACM
Document3 pages
Bishop Neural Networks ACM
Oana Mihai
No ratings yet
Practice Problems - Present Value With Answers
Document5 pages
Practice Problems - Present Value With Answers
Ayra Bernabe
No ratings yet
Parity-Time Symmetry in Optical Microcavity
Document25 pages
Parity-Time Symmetry in Optical Microcavity
Naman Kapoor
No ratings yet
Modelling For Cruise Two-Dimensional Online Revenue Management System
Document7 pages
Modelling For Cruise Two-Dimensional Online Revenue Management System
Bysani Vinod Kumar
No ratings yet
Abaqus Energy Equations
Document6 pages
Abaqus Energy Equations
arkan1976
No ratings yet
A. Course Reader Lecture Notes
Document177 pages
A. Course Reader Lecture Notes
zhitao wang
No ratings yet
Solving Job-Shop Scheduling Problems by Genetic Algorithm
Document6 pages
Solving Job-Shop Scheduling Problems by Genetic Algorithm
hafca
No ratings yet
Problem Set Network Models
Document3 pages
Problem Set Network Models
CIAN CARLO ANDRE TAN
No ratings yet
Gondwana University Prospectus
Document12 pages
Gondwana University Prospectus
Carlos Gmo E. Ramírez
No ratings yet
Trajectory Planning For The Tracking Control of Systems With Unstable Zeros
Document13 pages
Trajectory Planning For The Tracking Control of Systems With Unstable Zeros
Sachin Shende
No ratings yet
43 - A Framework For Sentiment Analysis With Opinion Mining of Hotel Reviews
Document4 pages
43 - A Framework For Sentiment Analysis With Opinion Mining of Hotel Reviews
Office Work
No ratings yet
Unit III Regular Grammar
Document54 pages
Unit III Regular Grammar
Deependra
No ratings yet
New Report
Document73 pages
New Report
Ankit kumar singh
No ratings yet
Cholesky Decomposition
Document8 pages
Cholesky Decomposition
mohsinali680
No ratings yet
Lightgcn: Simplifying and Powering Graph Convolution Network For Recommendation
Document10 pages
Lightgcn: Simplifying and Powering Graph Convolution Network For Recommendation
tangxiu fu
No ratings yet