1/69
Looks like no tags are added yet.
Name | Mastery | Learn | Test | Matching | Spaced |
---|
No study sessions yet.
Which task is an analyst performing in a spreadsheet when they isolate data that meets specific, exclusive criteria?
Filtering data
Which command should an analyst use to identify which table to access in a database using a Structured Query Language (SQL)?
FROM
How can an analyst sort all areas of Microsoft Excel and Google Sheets spreadsheets to organize data for analysis?
“Sort Sheet”
Which method can an analyst use to categorize data based on a specific metric in Microsoft Excel and Google Sheets spreadsheets for data analysis?
Write a sort function
An analyst needs to manipulate and categorize data using a spreadsheet and Structured Query Language (SQL).
How can they format and adjust data for analysis?
Sort and filter data
How does an analyst reorganize the layout of data in Microsoft Excel or Google Sheets spreadsheets based on product type while keeping all rows visible?
SORT function
How does a pivot table custom list sort by default for analysis?
In ascending order first
An analyst is working with data in a Google Sheet spreadsheet that lists temperatures in Fahrenheit but needs the data to be in Celsius.
How can the analyst change the unit of measurement?
Use the CONVERT function
An analyst is working with a Microsoft Excel spreadsheet that will later be shared with a project team.
Why would the analyst use the validation function on this sheet?
To control the type of data entered
An analyst wants to format all data in a column to YYYY-MM-DD format to be represented by dates and times with values ranging from 00:00:00 to 11:59:59 using Structured Query Language (SQL).
The analyst uses the DATETIME command.
What does this command do to the data?
Data will be converted to a datetime value.
Why does an analyst use the LEN function in Microsoft Excel and Google Sheets spreadsheets?
To determine the length of a string
Why does an analyst use the CONCAT_WS function in Structured Query Language (SQL)?
To add two or more strings together with a separator
Which programming language can handle trillions of rows of data and is now a standard language for working with database programs?
Structured Query Language (SQL)
An analyst is working on a project with a lot of milestones to track and wants to create a spreadsheet that everyone can use to track their individual progress.
How could the analyst sort and filter progress milestones?
Create a data validation menu in cells with predetermined options
An analyst working with data in two different Google Sheets needs information from both spreadsheets to connect together in a matching column on a single sheet.
Which function should the analyst use?
VLOOKUP
An analyst working to aggregate data in databases uses a Structured Query Language (SQL) command that returns all matching records in two tables and the key values in each.
Which command has the analyst used?
INNER JOIN
An analyst working to aggregate data in databases wants to create a temporary name for a table with a complex name to make Structured Query Language (SQL) queries simpler.
Which command should the analyst use?
AS
An analyst working to aggregate data in databases uses a Structured Query Language (SQL) command to create a temporary name for a table and receives an ERROR message when the command is used.
Which command does the analyst use to create a temporary name for a table?
FROM
An analyst working in a spreadsheet needs to determine the total quantity of numerical values within a specific range that are unique.
Which function should the analyst use?
COUNT DISTINCT
An analyst is working in a database and needs to use a function that will return all the records from one table and only the matching records from a second table.
How does the analyst determine the order of the tables using a JOIN function?
The table mentioned first is on the left.
An analyst is working on a spreadsheet and notices that rows and columns that should remain unchanged are changing when they are copied. This data set is generally problematic, and the analyst is trying to solve many problems under time constraints.
What does the analyst do to correct this issue with the data-aggregation process as quickly as possible?
Lock the table array part of the function
An analyst working in Google Sheets wants to find the lowest possible average of monthly sales in a range of values.
Which function should the analyst use?
MIN
An analyst working in Google Sheets wants to calculate the total revenue of each transaction recorded. Each transaction can have exactly one item type and the data is organized by a category other than quantity.
Which function should the analyst use?
SUMIF
An analyst working with data in Google Sheets wants to count the number of times a record appears in a column based on several conditions.
Which syntax should the analyst use?
=COUNTIFS(criteria_range1, criterion1, [criteria_range2, criterion2, ...])
An analyst working in BigQuery applies a clause to a query creating a specific set of rows and columns that executes the same process as a temporary table.
Which clause does the analyst apply to the query?
WITH
An analyst working in Google Sheets wants to multiply individual revenue amounts by profit margin percentages and then calculate overall profit margins for a given range.
Which function should the analyst use to save time while performing the calculation?
SUMPRODUCT
An analyst working with data in Google Sheets wants to calculate the total sales for each department, the number of products each department sold, and the department that generated the most revenue, but the data in the original spreadsheet must be maintained.
Which approach should the analyst take?
Use a pivot table to make calculations
Which best practice helps to ensure the integrity of data before beginning to work on a data set?
Data validation
An analyst is comparing the common features of R and Python programming languages.
Which feature do both programming languages share?
They both offer communities for code development and support
What does an analyst use to identify how words and symbols should be used in R programming language?
Syntax
What is a unique challenge an analyst faces when using R programming language as opposed to Python?
Users may have difficulty using R programming language naming conventions.
An analyst wants to record the steps of analysis so they can double-check their work and solve problems.
How do general-purpose programming languages such as R support this?
By automatically storing code to reproduce and share work at any time
A web designer is responsible for the layout and styling of web pages that feature text, graphics, and video. The web designer needs to use a programming language that provides structure for web pages and is used to connect to hosting platforms.
Which programming language does the web designer use?
Hypertext Markup Language v5
A web-application developer designs network applications used all across the web. They need to use a programming language to create enterprise web applications that can run on multiple clients.
Which programming language should the web application developer use?
Java
An application developer who specializes in video games needs to use a programming language that is an extension of the C programming language that is also used to create console games such as those for Xbox.
Which programming language should the game developer use?
C++
Which tab does an analyst access to see the results of a diagram that represents values for two variables in RStudio?
Plots
An analyst is working in RStudio and wants to visit an active online community on the social media platform Reddit, where R users go to discuss R, ask questions, and share tips.
Which resource does the analyst use for this?
r/RLanguage
An analyst is working with a package in the RStudio tidyverse used for cleansing wide and long data to ensure that every part of a data table or data frame is the correct type located in the correct place.
Which package in the tidyverse is the analyst using?
Tidyr
An analyst is working in RStudio with a tool in R programming language (R) used for expressing a sequence of multiple operations by taking the output of one statement and making it the input of the next statement.
Which tool is the analyst using?
Pipe
An analyst working in RStudio wants to ensure that they are not hard coding any values within the code that do not change when they need to.
Which best practice does the analyst apply to increase readability?
Make the code dynamic
An analyst is working in RStudio with code that performs a particular function and is contained within code that performs a broader function.
Which tool is the analyst working with?
Nested function
An analyst is working with atomic vectors to organize and store data in R programming language (R). The analyst needs to determine the properties of the vector they are working with.
Which function should the analyst use?
typeof()
A data analyst wants to obtain the standard deviation statistical summary of a data set using R.
Which function should be used?
sd()
Which tidyverse package can transfer data from Microsoft Excel files to R?
readxl
An analyst runs the clean names() function on a data set.
What happens to the column names after completing this task?
Special characters are removed.
Which function can be used to create a single column from two separate columns in R?
unite()
A data analyst wants to set values for variables in R using operators.
Which type of operator should be used?
Assignment
What is a best practice for file naming when working in R?
Do not start or end file names with symbols.
Which tidyverse package is primarily used for creating data visualizations?
ggplot2
Which layer should be used to control the type of a ggplot 2 plot?
Geoms layer
Which symbol must be added to the code to add a new layer to a ggplot2 plot?
Plus (+)
Which function should be used to create the data frame for a plot in ggplot2?
ggplot()
A data analyst wants to create a bar chart using the ggplot2 package in R.
Which function should be used?
geom_bar()
Which layer can be added to a ggplot2 plot to help explain its purpose or to highlight important data?
Annotate
Which function can be used to draw arrows or add shapes to a plot that was built using the ggplot2 package?
annotate()
What does an analyst add to indicate the beginning or end of a data item in R Markdown?
Delimiter
An analyst reviewing code in R Markdown notices output warnings that are unrelated to the impact on the review's findings.
How does the analyst create a final report for stakeholders without unrelated warnings?
Change code chunk options
An analyst working in R Markdown needs to convert a file that is currently in the dot RMD format to another file type for a report that needs to be shared with stakeholders.
How should the analyst do this?
Use the knit option
An analyst wants to create a monthly report using a simple format that runs one line of code to update the data each month rather than creating the report from scratch each time.
Which best practice does the analyst apply to create the report with the most recent information?
Make a template and update code
An analyst who is working with a dot RMD file sets off the knit with two asterisks on either side of a word.
What does the analyst achieve with this action?
Adding a bold font
An analyst needs to use R Markdown to document an analysis.
How does the analyst set up the R Markdown environment?
Through the install packages function
An analyst working in R Markdown needs to begin a code chunk to load two packages.
What should the analyst add to begin the code chunk?
An “r” in braces
The analyst wants to use a tool with files that execute everything as code sequentially using R or Python programming languages.
Which tool should the analyst use?
Jupyter Notebook
An analyst wants to convert an R Markdown file to an output that publishes a group of related data visualizations using sidebars, tab sets, value boxes, and gauges.
Which output format should the analyst create?
Dashboard
An analyst needs to edit the structured reference data that sorts and identifies aspects of the information it describes.
What helps the analyst to do this?
YAML
An analyst needs to show stakeholders the code embedded in a file, its output, evidence for the findings, and the sources.
What does the analyst use to do this?
Code chunk
An analyst adds a code chunk into an R Markdown file.
What is the value of adding a code chunk into the file?
It allows the analyst to check the code for errors.
Which input is needed to generate a new R Markdown file?
Metadata
Which types of content make up an R Markdown file?
Code chunks and text
Which outcome does the bias function compare to the predicted outcome to determine whether the model is biased or unbiased?
Actual outcome