CAP4770 - Final

0.0(0)
studied byStudied by 0 people
learnLearn
examPractice Test
spaced repetitionSpaced Repetition
heart puzzleMatch
flashcardsFlashcards
Card Sorting

1/54

encourage image

There's no tags or description

Looks like no tags are added yet.

Study Analytics
Name
Mastery
Learn
Test
Matching
Spaced

No study sessions yet.

55 Terms

1
New cards

What is data mining?

The field that addresses the need of making sense of the staggering number of data values produced by helping us find hidden relationships, connections, or patterns.

2
New cards

What is the role of data mining, as depicted in a flowchart?

Unstructured/structured data → data mining → knowledge (relationships, patterns, or models)

3
New cards

What is the application of data mining in business intelligence?

In helping organizations to make better decisions through basket analysis, sales forecasting, and inventory planning

4
New cards

What is the application of data mining in retail?

In studying customer preferences, shopping patterns, and purchasing habits

5
New cards

What is the application of data mining in banking?

In marketing, risk management, and money laundering detection

6
New cards

What is the application of data mining in bioinformatics?

in protein modeling, drug discovering, and biomarker identification

7
New cards

What is the application of data mining in healthcare?

In understanding complex mechanisms and their interactions, identifying people who are at risk for diseases, and evaluating diagnostics.

8
New cards

What is the application of data mining in education?

In improving learning outcomes, curriculum improvement, and finding out reasons behind dropouts

9
New cards

What is the application of data mining in television and radio?

in making personalized recommendations to radio listeners and TV viewers and making informed decisions on content creation and distribution

10
New cards

What is the application of data mining in crime prevention?

in fraud detection and prevention, modeling criminal behavior, and predictions of future crimes

11
New cards

What is the application of data mining in social media analysis?

In drawing conclusions about the social media users for targeted marketing campaigns and in the study of human behavior and human interaction

12
New cards

What is the application of data mining in supply chain management?

In improving customer satisfaction and loyalty and making decisions regarding supplier relationship, production processes, and distribution channels

13
New cards

What is step 1 of the process of knowledge discovery?

Data preparation

14
New cards

What is step 2 of the process of knowledge discovery?

Data mining

15
New cards

What is step 3 of the process of knowledge discovery?

Pattern and model evaluation

16
New cards

What is step 4 of the process of knowledge discovery?

Knowledge representation

17
New cards

What happens during the data preparation step of knowledge discovery?

A process which includes aspects such as removing noise, integrating multiple data sources, transforming data into forms suited for mining, and selecting data relevant to the problem at hand

18
New cards

What happens during the data mining step of the process of knowledge discovery?

The actual extraction of patterns and construction of models

19
New cards

What happens during the pattern and model evaluation step of knowledge discovery?

The identification of patterns and models relevant to the application

20
New cards

What happens during the knowledge representation step of knowledge discovery?

The generation of ways to present the knowledge obtained

21
New cards

What is multidimensional data summarization?

Consists of obtaining a concise description of data to facilitate gaining useful information, and can be offered in the form of pie charts, bar charts, data cubes and other forms

22
New cards

what is mining frequent patterns, associations, and correlations?

consists of discovering relationships among the items in a given dataset

23
New cards

what is classification?

consists of constructing a model that describes data classes and that can be used to determine, given a new object, its class

24
New cards

what is cluster analysis?

consists of forming groups (or clusters) of objects based on their properties, so that objects in the same cluster have high similarity but are dissimilar to objects in other clusters

25
New cards

What is deep learning?

uses techniques to identify meaningful features in datasets

26
New cards

what is outlier analysis?

the process of identifying data objects that significantly differ from the rest of the dataset

27
New cards

What is descriptive data mining?

is concerned with unconvering relationships that provide insight into the underlying structure of the data.

28
New cards

What is predictive data mining?

is concerned with finding models that help making predictions based on known data. For example, decision trees.

29
New cards

What concepts are the stepstones of data mining?

data objects, data attributes, and attribute types

30
New cards

What are data objects?

The entities our applications deal with: persons, car models, patients, items, etc.

31
New cards

What are data attributes?

provide the description of objects, that is, a data field representing a feature or characteristic of the object.

32
New cards

What is an attribute type?

an attribute’s values. These include nominal, ordinal, interval, and ratio.

33
New cards

What is the nominal attribute type?

values consist of names or labels, for example, ‘single’ or ‘married’.

34
New cards

What is the ordinal attribute type?

values can be arranged in order, that is, they can be sorted, but differences are not meaningful. For example, values in the 5-point Likert scale, 1-5 = strongly disagree-strongly agree.

35
New cards

What is the interval attribute type?

values have order, differences are meaningful, and there is not a value of inherent-zero (meaning ‘none’). Here, ratios are not meaningful. For example, the years in the last 4 decades that california has had at least one big earthquake: 1984, 1992, 1994, 1999, 2010, 2019.

36
New cards

What is the ratio attribute type?

Similar to the interval type, with the added property that an inherent-zero is possible. Here, it is meaningful to say that a value is a multiple of another one or calculate ratios. For example, the length in bytes of documents.

37
New cards

What are summary statistics?

Also known as statistical parameters, these are used to describe the information given by datasets in a manner as simple as possible.

38
New cards

What is the measure of central tendency?

A value that represents the center of a data set, or in other words, a typical value of the data set. Among the most used are the mean, the median, the mode, and the midrange.

39
New cards

What is measure of dispersion?

A value that helps us understand the variability or spread of the data. Among the most used are the range, the variance, and the standard deviation

40
New cards

What is variance?

Variance is the average squared deviations from the mean.

41
New cards

What is standard deviation?

The square root of the variance

42
New cards

What are proximity measures?

Used to be able to determine how similar or dissimilar two objects are. Examples are the Minkowski sum and the cosine similarity.

43
New cards

What is normalization?

A method to attempt to give different attributes an equal weight. This is used to make it easier for patterns to be understood from data. Examples are Min-Max normalization, decimal scaling, and z-score.

44
New cards
45
New cards
46
New cards
47
New cards
48
New cards
49
New cards
50
New cards
51
New cards
52
New cards
53
New cards
54
New cards
55
New cards