the process of getting data from an outside source into an Excel worksheet.
2
New cards
Cleanse Data
fix obvious errors and convert to useful format
3
New cards
Transform Data
convert data to anther format
4
New cards
consolidate data
combine data from different sources
5
New cards
Power Query Editor
-use to transform data -tools to groups rows, filter, replace values, remove duplicates, edit columns
6
New cards
Data in Editor
-A local copy: not original data -editing data does not change original data source -local data becomes a table/query
7
New cards
What is a connection (when you click get external data group on the data tab, excel establishes a connection)?
-an identifier and a link for data that originated outside the workbook -the connection can be edited, refreshed, named or removed.
8
New cards
Web Query:
-a way of importing data into a spreadsheet direct from a web page -links spreadsheet to data on a webpage (query will update if website updates)
9
New cards
What is HTML?
Hypertext Markup Language, used to display web data
10
New cards
What is XML?
Extensible Markup Language, helps give structure to web page data so it can be searched and processed more efficiently
11
New cards
.csv mean
comma separated
12
New cards
.txt mean
tab delimited
13
New cards
.prn
space delimited
14
New cards
What is a Relational database?
stores data in tables linked by shared fields
15
New cards
What is Primary Key?
unique identifier for each row or record used to link to other tables
16
New cards
What is Foreign key?
Common field of a table linked to a primary key field in another table
17
New cards
What is a Microsoft query used for?
use to import individual data fields into excel instead of importing a complete table
18
New cards
Data Cleansing
fixes obvious data errors and converts data into a useful format
19
New cards
Data verification
validates that data is correct and accurate
20
New cards
Flash Fill
recognizes simple patterns in data as you type and automatically fills in values
21
New cards
Why is using text functions more preferable than using flash fill?
flash fill can make mistakes and its results are not dynamic (won't update if the original data isn't updated)
22
New cards
Lower text function
changes uppercase characters to lowercase
23
New cards
Upper text function
changes all characters in a cell to uppercase
24
New cards
Proper text function
capitalizes first letter of each word in the text string
25
New cards
TRIM text function
removes extra spaces from a string of text
26
New cards
clean
removes nonprintable characters
27
New cards
Nesting
Using a function as an argument within another function.
28
New cards
FIND function
function finds where a specific string of text is located within a larger string
29
New cards
LEFT function
extracts a specific number of characters from a text string beginning at the left of the string
30
New cards
RIGHT function
extracts a specific number of characters from a text string beginning at the right of the string =RIGHT(text,[num_chars])
31
New cards
Mid Function
extracts characters from left to right based on a starting location and the number of characters =MID(text,start_num,num_chars)
32
New cards
len function
calculates the length of a string =LEN(text)
33
New cards
EXACT function
compares two text strings, values, or characters to determine if they are identical
34
New cards
TEXT Function
Can display numeric data as text using special formatting strings to display the text
35
New cards
TEXTJOIN
Combines strings of text, values, or characters
36
New cards
DATEVALUE function
Converts a date in the form of text to a serial number
37
New cards
What-if analysis (also called sensitivity analysis)
use several different values in one or more formulas to explore all the various results
38
New cards
Variable
A value that can be changed to see how the change affects other values
39
New cards
break-even analysis
-a type of what-if analysis -calculates the break-even point in sales volume or dollars -can be used to evaluate the impact of changes in sales volume on profits
40
New cards
break-even point
sales level at which revenue equals total costs
41
New cards
Scroll Bar Form Control
- Use a Scroll Bar to quickly enter or change a range of values - Scrolls through a range of values when you click the scroll arrows or drag the scroll box - Useful for setting or adjusting a large range of values, or for cases when precision is not important - Link the scroll bar to the cell containing the value you want to change.
42
New cards
Excel What-if Analysis Tools: Data Tools
-take sets of input values, determine possible result, and display results in a table
43
New cards
One-variable data table
Uses one set of input values, substitutes those values in one or more formulas and displays results for all the formulas
44
New cards
Two-variable data table
Uses two sets of input values, one in a column, the other in a row, and displays results for only one formula
45
New cards
Goal Seek
Uses iteration to find the values needed to achieve a goal or objective
46
New cards
Backsolving
knowing the results and determining the value needed to reach those results
47
New cards
Scenario
-A saved set of values in a ws -user can vary numbers and see potential results -often used to review best and worst possibilities -ws can have up to 32 scenarios
48
New cards
Scenario summary report
A generated worksheet that displays values for changing and result cells for each scenario in a workbook
49
New cards
Solver
data analysis tool that optimizes a problem by manipulating the values of selected variables but bound by constraints
50
New cards
Simplex LP
Used when objective function and all constraints are linear
51
New cards
GRG Nonlinear method
-Default method -used for more complex models -used when the objective or at least one of the constraints is nonlinear and smooth
52
New cards
Evolutionary
-most complex model -used when worksheet model is nonlinear and not smooth -"Nonsmooth"-line has sharp bends and no slope
53
New cards
Median
the middle value of a set of values
54
New cards
Mode
the value that appears most often in a set of values
55
New cards
MODE.SNGL
Returns only one mode value. Returns the first mode value that it finds. If there are no duplicate values, returns the error #N/A.
56
New cards
MODE.MULTI
will return multiple values if the data has more than one mode. an array function*
57
New cards
RANK.EQ and RANK.AVG Functions
-calculate where a particular value appears in the data -when duplicated values exist in the range, RANK.EQ shows the lower rank -RANK.AVG calculates duplicate values with and average rank
58
New cards
RAND Function
return a random number greater than or equal to 0 or less than 1
59
New cards
RANDBETWEEN function
Returns a random number between the numbers you specify
60
New cards
Measuring dispersion
-tells us how spread out (or variable) a set of data is -measures: range, variance, standard deviation
61
New cards
Variance
measures how far the data varies from the mean
62
New cards
standard deviation
calculates average spread of data set from the mean
63
New cards
Histogram Chart
-like a column or bar chart except each column represents a range of values instead of a specific data series -each column is called a bin -the range of values in a bin is called the class interval
64
New cards
Pareto Chart
A type of histogram chart where the columns are sorted from largest to smallest with a line chart on a secondary axis showing the cumulative total percentage.
65
New cards
Frequency Function
an array function that calculates how often values occur in a bin
66
New cards
box and whisker chart
another way to represent the range of values for each category based on dividing the values into quartiles
67
New cards
waterfall chart
- Shows a running total as values are added or subtracted - Useful for understanding how a value is affected by a series of positive and negative values - Columns are color coded to easily differentiate between positive and negative numbers - The initial and the final value columns often start on the horizontal axis, while the intermediate values are floating columns
68
New cards
Analysis ToolPak
-An Excel add-in with statistical and engineering functions -Generates descriptive statistics
69
New cards
moving average
a series of averages that is recalculated through time based on a window of observations.
70
New cards
Business Intelligence (BI)
Software applications used to analyze an organizations data to improve: decision making, cut costs, identify new opportunities
71
New cards
BI tools
-gather, process, analyze and visualize large volumes of past, current, and future data -create interactive reports
72
New cards
Digital Dashboards
-deliver BI in graphical form -some features include: interactive tools, integration of different types of data from different sources, user interface that is easy to navigate and understand
73
New cards
Key performance indicators (KPIs)
Quantifiable measures of performance used to gauge progress toward strategic objectives or agreed standards of performance.
74
New cards
Data Modeling
The process of creating a small scale representation of data and the relationships among pieces of data
75
New cards
data model
an abstract model that organizes elements of data and standardizes how they relate to one another and to the properties of real-world entities
76
New cards
Relational database
-collection of tables that can be joined through a common field -data can be accessed or reassembled without changing the tables
77
New cards
Get & Transform (Power Query)
Use to extract, connect, clean and transform large amounts of data. Can modify data types and extract PivotTables.
78
New cards
Power Pivot
use to import and compare large amounts of data from multiple sources to analyze relationships between tables of data
79
New cards
Power View
an interactive visualization tool used to provide a drag-and-drop interface for rapid model building. can connect to different data models within the same workbook
80
New cards
3D Maps
-Can plot and visualize geographic or temporal data on a three-dimensional map. -Can build cinematic time tours or animations.
81
New cards
data analysis expressions (DAX)
A library of functions and operators that can be combined to build formulas and expressions in power pivot in excel data models (focus on performing calculations on columns-not cells
82
New cards
KPI
-a base value and status are evaluated against a target value
83
New cards
Status threshold
-defined by the range between a high value and a low value -displays with a graphic to help users easily determine the status of the base value compared to the target value.
84
New cards
base value
a calculated field that resolves to a value as part of a KPI.
85
New cards
target value
a calculated field that results in a value or an absolute value
86
New cards
Positive KPI
-the greater the value the better the KPI -Example: Company profit
87
New cards
Negative KPI
-The greater the value, the worse the KPI -Example: Employee turnover rate
88
New cards
Bidirectional KPI
-Value becomes worse the farther it deviates from target value in either direction -Example : inventory on hand must not be too high or too low
89
New cards
Dynamic Array formulas
returns arrays of variable size which can spill into adjacent cells
90
New cards
spilling
when formulas return values into neighboring cells
91
New cards
unique function
returns a unique list of values from an array and updates dynamically
92
New cards
sort function
customizes the sorting of a range or array
93
New cards
TRANSPOSE function
converts a vertical array of cells to a horizontal array or vice versa
94
New cards
Data Types
connections to an online data source that return information related to the selected data
95
New cards
algorithms
a sequence of steps or set of instructions for doing something
96
New cards
information processing
transforming data to produce new information
97
New cards
features of an algorithm
consists of a finite number of instructions, each individual instruction is well defined, describes a process that eventually halts after arriving at a solution to a problem
98
New cards
Hardware
physical devices required to execute algorithms
99
New cards
software
set of the algorithms, represented as programs in particular programming languages