CIS 328 midterm

0.0(0)
studied byStudied by 0 people
0.0(0)
full-widthCall Kai
learnLearn
examPractice Test
spaced repetitionSpaced Repetition
heart puzzleMatch
flashcardsFlashcards
GameKnowt Play
Card Sorting

1/38

encourage image

There's no tags or description

Looks like no tags are added yet.

Study Analytics
Name
Mastery
Learn
Test
Matching
Spaced

No study sessions yet.

39 Terms

1
New cards

Data

Facts and statistics collected together for reference or analysis

2
New cards

What do we do with data?

collect, store, measure, analyze, visualize

3
New cards

What skill sets do Data analysts need?

R or SAS, SQL, statistical analysis, database management & reporting, and data analysis

4
New cards

Goal of data analysis

to organize and summarize information in order to make evidence-based inferences about priority population. Take raw data and use it solve problems.

5
New cards

data analysis cycle: Ask

Ask a question (stats, case studies)

6
New cards

data analysis cycle: Obtain

Gather data from relevant sources (database, files, web api)

7
New cards

data analysis cycle: Scrub

Clean data to appropriate formats (Split, merge or extract columns)

8
New cards

data analysis cycle: Explore

Find patterns and trends via statistcs (Type of data, descriptive stats, correlation)

9
New cards

data analysis cycle: Model

Construct models for prediction (Regression for predication)

10
New cards

data analysis cycle: Interpret

Use the results for decision making (visualize results with domain knowledge)

11
New cards

Statistics

the collection and classification of data that are in the form of numbers. It uses data collection, analysis, interpretation, presentation.

12
New cards

Data Collection

capturing and gathering all data necessary to complete the processing of transactions

13
New cards

Data Analysis

The process of compiling, analyzing, and interpreting the results of primary and secondary data collection.

14
New cards

data interpretation

Making sense of and analyzing data to find patterns and trends.

15
New cards

Data presentation

an organized display of data.

16
New cards

Population

the entire collection of people, objects, or events that needs to be analyzed

17
New cards

Sample

a subset of the population

18
New cards

descriptive statistics

Uses data to describe the population parameters through numbers and graphics

19
New cards

inferential statistics

use data collected from a sample of a population to make inference and predications about the entire population

20
New cards

Probability

likelihood that a particular event will occur

21
New cards

random experiment

An experiment or process for which the outcome can not be predicted with certainty

22
New cards

sample space

the set of all possible outcomes

23
New cards

event

One or more outcomes of an experiment

24
New cards

confidence interval

a range of values so defined that there is a specified probability that the value of a parameter lies within it.

25
New cards

Types of Investigations

Descriptive, Relational, experimental

26
New cards

descriptive investigation (field studies or interviews)

constructing an accurate description of what is happening.

27
New cards

Relational investigations (Observations or surveys)

used identify relations between multiple factors. Rarely determines the causal relationship between multiple factors.

28
New cards

experimental research (controlled experiements)

the establishment of causal relationship

29
New cards

hypotheses

A precise problem statement that can be directly tested through an empirical investigation.

30
New cards

null hypothesis

there is no difference between experimental treatments.

31
New cards

alternative hypothesis

lstatement that is mutually exclusive with the null hypothesis

32
New cards

Goal of an experiment

to find statistical evidence to refute the null hypothesis in order to support the alternative hypothesis.

33
New cards

independent variable

the variable that is manipulated in an experiment, modify subjects conditions.

34
New cards

Type 1 error (worse than type 2)

Rejecting null hypothesis when it is true. Ex. False alarm

35
New cards

type 2 error

failing to reject a false null hypothesis. Ex. Missed dectection

36
New cards

Low threshold

many type 1 errors

37
New cards

high threshold

many type 2 errors

38
New cards

Dependent variable

variable affected by change

39
New cards

factorial design (between/within, split plot)

a study in which there are two or more independent variables, or factors. Researchers identify interactions between variables.