KWARA STATE UNIVERSITY, MALETE - Fundamentals of Data Analysis

0.0(0)
studied byStudied by 0 people
learnLearn
examPractice Test
spaced repetitionSpaced Repetition
heart puzzleMatch
flashcardsFlashcards
Card Sorting

1/43

flashcard set

Earn XP

Description and Tags

Flashcards on Data Science and Big Data Fundamentals.

Study Analytics
Name
Mastery
Learn
Test
Matching
Spaced

No study sessions yet.

44 Terms

1
New cards

Data Science

Domain of study that deals with vast volumes of data using modern tools and techniques to find unseen patterns, derive meaningful information, and make business decisions.

2
New cards

Capture (Data Science Lifecycle)

Data Acquisition, Data Entry, Signal Reception, Data Extraction.

3
New cards

Maintain (Data Science Lifecycle)

Data Warehousing, Data Cleansing, Data Staging, Data Processing, Data Architecture.

4
New cards

Process (Data Science Lifecycle)

Data Mining, Clustering/Classification, Data Modeling, Data Summarization.

5
New cards

Analyze (Data Science Lifecycle)

Exploratory/Confirmatory, Predictive Analysis, Regression, Text Mining, Qualitative Analysis.

6
New cards

Communicate (Data Science Lifecycle)

Data Reporting, Data Visualization, Business Intelligence, Decision Making.

7
New cards

Big Data

Extremely large and diverse collections of structured, unstructured, and semi-structured data that continues to grow exponentially over time.

8
New cards

Big Data

Massive amount of organized and unstructured data that a company encounters on a daily basis, studied for insights.

9
New cards

Volume (Big Data)

The amount of data

10
New cards

Velocity (Big Data)

The rapid collection of data.

11
New cards

Structured Data

Data that has been arranged.

12
New cards

Semi-structured Data

A type of data that is semi-organized and doesn't follow the traditional data structure.

13
New cards

Unstructured Data

Data that has not been arranged and doesn't fit cleanly into a relational database's standard row and column structure.

14
New cards

Cost Savings (Big Data)

Helps in providing business intelligence that can reduce costs and improve the efficiency of operations.

15
New cards

Data

Collection of facts and figures used for analysis or a survey; a series of representations of various values of that quantity.

16
New cards

Quantitative Data

Numerical values are assigned to the characteristics or properties of objects or events, according to logically accepted rules.

17
New cards

Qualitative Data

The researcher takes into consideration the phenomenon as a whole and does not attempt to analyze it in measurable or quantifiable terms

18
New cards

Nominal Scale

Used when a set of objects among two or more categories is to be differentiated on the basis of certain clearly known characteristics

19
New cards

Ordinal Scale

Corresponds to quantitative classification of a set of objects done with the help of ranking on a continuum.

20
New cards

Interval Scale

Based on equal units of measurement; includes how much or how little of a given characteristic or attribute is present.

21
New cards

Ratio Scale

The highest level of measurement; assumes the existence of absolute zero.

22
New cards

Data Characteristics

Attributes include accuracy, completeness, consistency, timeliness, relevance, validity, and reliability.

23
New cards

Accuracy

Data should be free from errors and accurately reflect the real-world situation.

24
New cards

Completeness

Data should include all necessary information fields without missing values.

25
New cards

Timeliness

Data should be current and available when needed for decision-making.

26
New cards

Relevance

Data should be relevant to the specific task or analysis at hand, avoiding unnecessary or irrelevant information.

27
New cards

Reliability

Data should produce consistent results over time and be trustworthy.

28
New cards

Data collection

The process of collecting, measuring and analyzing different types of information using a set of standard validated techniques.

29
New cards

Primary Data

Data collected from first-hand experience directly from the main source.

30
New cards

Secondary data

Information that is been collected or gathered by some other researchers.

31
New cards

Observation Method

Data from the field is collected with the help of observation by the observer or by personally going to the field.

32
New cards

Participant Observation

The researcher actively engages in the daily activities of the subjects being studied.

33
New cards

Non-Participant Observation

The researcher observes subjects without becoming involved in their activities.

34
New cards

Questionnaire Method

A set of questions arranged logically, divided into groups, with the object of collecting information for research.

35
New cards

Schedule Method

The questionnaire but it filled by enumerator.

36
New cards

Survey method

A detailed inspection or investigation.

37
New cards

Panel Method

Data is collected from the same sample respondents at some interval either by mail or by personal interview.

38
New cards

Case Study Method

An intensive investigation of the particular unit under consideration..

39
New cards

Raw Data

Unprocessed data that has been collected and recorded directly from a source without any manipulation, organization, or analysis.

40
New cards

Contextualized Data

Refers to raw data that has been processed, organized, and enriched with additional context or meaning.

41
New cards

Metadata

Data about data

42
New cards

Descriptive Metadata

Aids users in finding, identifying, and selecting resources by describing them for search and discovery.

43
New cards

Structural Metadata

Describes the structure, type, and relationships of data.

44
New cards

Administrative Metadata

Carries technical details about a file or resource and is crucial for its identification, presentation, and preservation.