Descriptive Analytics in Information Systems and Supply Chain Management

0.0(0)
studied byStudied by 0 people
0.0(0)
linked notesView linked note
call with kaiCall with Kai
learnLearn
examPractice Test
spaced repetitionSpaced Repetition
heart puzzleMatch
flashcardsFlashcards
GameKnowt Play
Card Sorting

1/19

flashcard set

Earn XP

Description and Tags

This set of flashcards covers key concepts and commands related to descriptive analytics, focusing on measures of central tendency, dispersion, and visualization in R.

Study Analytics
Name
Mastery
Learn
Test
Matching
Spaced
Call with Kai

No study sessions yet.

20 Terms

1
New cards

What is the purpose of descriptive statistics?

To describe, summarize, and visualize data.

2
New cards

What are the three measures of central tendency?

Mean, Median, Mode.

3
New cards

How is the Mean calculated?

By dividing the sum of the values by the number of observations.

4
New cards

What is the Median?

The middle value in a sorted dataset.

5
New cards

What does the Mode represent?

The most frequently occurring value in the dataset.

6
New cards

How is the Range calculated?

As the difference between the maximum and minimum values.

7
New cards

What is the Interquartile Range?

The difference between the first (Q1) and third (Q3) quartiles.

8
New cards

What does standard deviation measure?

The variability of data points from each other.

9
New cards

What is frequency distribution?

It describes how often values occur in a dataset.

10
New cards

What does Skewness indicate?

The asymmetry in the distribution of data.

11
New cards

How to create a histogram in R?

Use hist(dataframe$column, main="title", xlab="x-axis label", ylab="y-axis label").

12
New cards

What is a boxplot based on?

The 5 Number Summary (Min, Q1, Median, Q3, Max).

13
New cards

What do outliers represent in a boxplot?

Values that fall below Q1-1.5(IQR) or above Q3+1.5(IQR).

14
New cards

What command in R displays the first n observations of a dataframe?

head(dataframe, n).

15
New cards

What function returns the distinct values of a column in R?

unique(dataframe$column).

16
New cards

What is the full summary function in R used for?

To provide a summary statistics including Min, Q1, Median, Mean, Q3, Max.

17
New cards

How do you check for missing values in calculating mean in R?

Use mean(dataframe$column, na.rm=TRUE) to ignore NA values.

18
New cards

What is tapply used for in R?

To apply a function to subsets of a vector, categorized by another vector.

19
New cards

How do you visualize the relationship between two numerical variables?

By using a scatterplot.

20
New cards

What is a scatterplot meant to illustrate?

The potential relationship between two numerical variables.