CONCEPTS IN MACHINE LEARNINGPPT1

Branch of computer science aiming to create machines capable of mimicking human intelligence.
- Targets thinking, learning, problem-solving, language understanding, perception, etc.
Serves as the umbrella field that contains Machine Learning (ML) as a subset.

Subset of AI that provides systems the ability to learn and improve automatically from experience without explicit programming.
Arthur Samuel (IBM, 1959) coined the term “Machine Learning” and defined the field as “the study that gives computers the ability to learn without being explicitly programmed.”
Modern wording: “Programming computers to optimize a performance criterion using example data or past experience.”
- Model specified up to parameters; learning = algorithmic optimization of those parameters with training data.
- Models can be predictive (future estimates), descriptive (knowledge extraction), or both.

We assume a model $f(x,\Theta)$ .
Learning = running a computer program that adjusts $\Theta$ to minimize error per a performance measure.

Aspect	Traditional (Rule-Based)	Machine Learning
Source of logic	Human-written rules	Patterns inferred from data
Example	`if number % 2==0:` → “Even”	Model learns parity pattern from examples
Workflow diagram	Data + Program → Output	Data + Output → Program (learned model)

“A computer program is said to learn from experience E with respect to some tasks T and performance measure P if its performance at tasks T, as measured by P, improves with experience E.”

Hand-written word recognition
- T: Classify words in images.
- P: % of correctly classified words.
- E: Labeled dataset of word images.
Autonomous highway driving
- T: Drive using vision sensors.
- P: Avg. distance before an error.
- E: Video + steering data from human driver.
Chess playing
- T: Win chess games.
- P: % games won.
- E: Self-play practice games.

Data Storage
- Ability to store/retrieve vast data (brain vs. HDD/SSD/RAM).
Abstraction
- Extracting knowledge; fitting a model to data (training) ⇒ abstract representation.
Generalization
- Converting learned knowledge into a form usable for future/unseen instances.
- Key property: algorithm’s accuracy on new data.
Evaluation
- Providing feedback on model utility; drives iterative improvement.

Visualization: Data → Storage → Abstraction (concepts) → Generalization (inferences) → Evaluation (feedback).

Manufacturing, Healthcare, Insurance, Transportation, Automotive, E-commerce, Customer Service.
Data-mining: applying ML to large databases to derive high-accuracy yet simple models.
Sample domain uses:
1. Retail – consumer-behavior analysis.
2. Finance – credit scoring, fraud detection, stock modeling.
3. Manufacturing – process optimization & troubleshooting.
4. Medicine – diagnostic support.
5. Science – physics/astronomy/biology data analysis, web search.
6. AI research – adaptability in unforeseen scenarios (vision, speech, robotics).
7. Autonomous vehicles – steering on diverse roads.
8. Games – chess, backgammon, Go AI.

Smallest entity whose properties are measured (person, object, time point, region, measurement, person-years, …).

Rows = examples (cars).
Columns = features: year, model, price, mileage, color, transmission.
- “year”, “price”, “mileage” → numeric.
- “model”, “color”, “transmission” → categorical.

Numeric (Quantitative)
- Continuous (infinite values): age, weight, blood pressure.
- Discrete (finite counts): shoe size, number of children.
Categorical (Qualitative)
- Nominal (no intrinsic order): eye color, dog breed.
- Ordinal (ordered categories): clothing sizes, pain severity.

Goal: discover interesting relations (association rules) between variables in large datasets.
Example rule: {onion, potato} ⇒ {burger}
- Strength quantified by conditional probability $P({\text{burger}}\mid{\text{onion},\ \text{potato}})$ .
- If $P=0.8$ → “80 % of customers who buy onion & potato also buy burger.”
Marketing use: customers who bought X but not Y are potential Y targets (cross-sell, promo pricing, product placement).
Algorithms: Apriori, FP-Growth (Frequency Pattern), …

Problem: assign new observations to one of predefined categories using labeled data.
Requires discriminant rule/function.

Score1 Score2 Result
29     43      Pass
22     29      Fail
10     47      Fail
31     55      Pass
…

Finance loan-risk rule: IF income $x<em>1 > \theta</em>1$ AND savings $x<em>2 > \theta</em>2$ THEN “low-risk” ELSE “high-risk”.