FDA - Data Mining Methods, week 4

0.0(0)

Studied by 0 people

Call Kai

Learn

Practice Test

Spaced Repetition

Match

Flashcards

Knowt Play

Card Sorting

1/13

There's no tags or description

Looks like no tags are added yet.

Last updated 1:51 PM on 1/13/26

Name	Mastery	Learn	Test	Matching	Spaced	Call with Kai

No analytics yet

Send a link to your students to track their progress

14 Terms

New cards

Data mining categories - Supervised method

Uses labeled data to predict the output for new, unseen data

→ Example: data= customer financial info, whether they defaulted on a loan

task= predict if new applicant is likely to default

New cards

Data mining categories - Unsupervised method

Uses unlabeled data to find hidden patterns, groupings or structures.

→ Example: data= shopping transactions

task= identify patterns like: ‘people who buy bread often buy butter’

New cards

Data mining categories - Global method

Creates information applicable to all data.

New cards

Data mining categories - Local method

Creates information only applicable to some of the data.

New cards

Data mining method - Linear regression, goal

Supervised and global
Goal= to create a linear model to relate an output variable to one or more input variables

New cards

Data mining method - Linear regression, SSD

SSD= Sum of Squared Deviations

The lower the SSD of a model the better the model.

→ SSD expresses variation relative to the model. Small SSD shows you that the prediction is close to the real data

Always look at the outcome of SSD in relation to the y-value to see if it is large or small

→ small: SSD= 0,8 and y-value= around 10

→ large: SSD= 3,6 and y-value= ranges 0-5

New cards

Data mining method - Linear regression, R²

Explains how well the regression line fits the data.

→ - R²= a slope down from left to right

→ + R²= a slope going up from left to right

The closer the R² to 0, the regression line has little to no predictive power.

→ -1 = perfect predictive power

→ -0.2 = little predictive power

→ 1 = perfect predictive power

→ 0.3 = little predictive power

New cards

Data mining method - Linear regression, residual plot

Used to check the linear regression plot.

→ Good plot= residual plot centered around 0, no pattern

→ Bad plot= residual plot with patterns, clusters and extreme plots(outliers)

New cards

Data mining method - Clustering, goal

Unsupervised and global
Goal= partition all locations in geographically meaningful groups.

<ul><li><p>Unsupervised and global</p></li><li><p>Goal= partition all locations in geographically meaningful groups.</p></li></ul><p></p>

New cards

Data mining method - Clustering, k-means

This is clustering by algorithms.

Pick random centroids of clusters.
Assign points to nearest centroid.
Recompute centroids until clusters are stable

→ The smaller the distance between centroids the better

New cards

Data mining method - Decision tree, goal

Supervised and global
Goal= Learn a tree to separate ‘quit’ and ‘non-quit’ cases.

<ul><li><p>Supervised and global</p></li><li><p>Goal= Learn a tree to separate ‘quit’ and ‘non-quit’ cases.</p></li></ul><p></p>

New cards

Data mining method - Decision tree, structure

Branches= the lines in between the ovals and cubes

Nodes= the ovals, the ‘topics’

Tree’s leaves= the cubes, the eventual outcome

→ Branches connect the nodes and the tree’s leaves

New cards

Data mining method - Decision tree, confusion matrix

Used to check the quality of the decision tree.

New cards

Data mining method - Association rules, goal

Unsupervised and local
Goal= to find high-confidence associations between frequently occurring (exam outcomes)

<ul><li><p>Unsupervised and local</p></li><li><p>Goal= to find high-confidence associations between frequently occurring (exam outcomes)</p></li></ul><p></p>

Explore top notes

Jewish Holidays and Practices

Updated 1113d ago

Note

Osmosis

Updated 426d ago

Note

Chapter 5: Newton's Laws of Motion

Updated 125d ago

Note

Unité 2- Comprendre le Christ

Updated 529d ago

Note

Notes of The Great Gatsby Intro

Updated 992d ago

Note

AP WORLD HISTORY REVIEW

Updated 635d ago

Note

RELIGION EXAM

Updated 965d ago

Note

Key Operant Conditioning Concepts to Know for AP Psychology (AP)

Updated 74d ago

Note

Jewish Holidays and Practices

Updated 1113d ago

Note

Osmosis

Updated 426d ago

Note

Chapter 5: Newton's Laws of Motion

Updated 125d ago

Note

Unité 2- Comprendre le Christ

Updated 529d ago

Note

Notes of The Great Gatsby Intro

Updated 992d ago

Note

AP WORLD HISTORY REVIEW

Updated 635d ago

Note

RELIGION EXAM

Updated 965d ago

Note

Key Operant Conditioning Concepts to Know for AP Psychology (AP)

Updated 74d ago

Note

Explore top flashcards

LS PRJ1

Updated 1001d ago

Flashcards (345)

Unit 7: Developmental Psychology

Updated 718d ago

Flashcards (71)

Exam Idiom

Updated 973d ago

Flashcards (165)

musculature of the shoulder joint

Updated 474d ago

Flashcards (20)

VIBRIO, AEROMONAS, HELICOBACTER SPECIES (MODULE 6)

Updated 838d ago

Flashcards (131)

Air Cargo Test 1

Updated 1090d ago

Flashcards (44)

06. Điền dấu hỏi - dấu ngã (Written)

Flashcards (40)

Flashcards (42)

Flashcards (345)

Unit 7: Developmental Psychology

Updated 718d ago

Flashcards (71)

Exam Idiom

Updated 973d ago

Flashcards (165)

musculature of the shoulder joint

Updated 474d ago

Flashcards (20)

VIBRIO, AEROMONAS, HELICOBACTER SPECIES (MODULE 6)

Updated 838d ago

Flashcards (131)

Air Cargo Test 1

Updated 1090d ago

Flashcards (44)

06. Điền dấu hỏi - dấu ngã (Written)

Updated 259d ago

Flashcards (40)

OMM Final Exam Terms

Updated 1170d ago

Flashcards (42)