D465- Pre Test

0.0(0)
studied byStudied by 0 people
GameKnowt Play
learnLearn
examPractice Test
spaced repetitionSpaced Repetition
heart puzzleMatch
flashcardsFlashcards
Card Sorting

1/69

encourage image

There's no tags or description

Looks like no tags are added yet.

Study Analytics
Name
Mastery
Learn
Test
Matching
Spaced

No study sessions yet.

70 Terms

1
New cards

Which task is an analyst performing in a spreadsheet when they isolate data that meets specific, exclusive criteria? 

Filtering data

2
New cards

Which command should an analyst use to identify which table to access in a database using a Structured Query Language (SQL)?

FROM

3
New cards

How can an analyst sort all areas of Microsoft Excel and Google Sheets spreadsheets to organize data for analysis? 

“Sort Sheet”

4
New cards

Which method can an analyst use to categorize data based on a specific metric in Microsoft Excel and Google Sheets spreadsheets for data analysis? 

Write a sort function

5
New cards

An analyst needs to manipulate and categorize data using a spreadsheet and Structured Query Language (SQL).

How can they format and adjust data for analysis? 

Sort and filter data

6
New cards

How does an analyst reorganize the layout of data in Microsoft Excel or Google Sheets spreadsheets based on product type while keeping all rows visible? 

SORT function

7
New cards

How does a pivot table custom list sort by default for analysis? 

In ascending order first

8
New cards

An analyst is working with data in a Google Sheet spreadsheet that lists temperatures in Fahrenheit but needs the data to be in Celsius.

How can the analyst change the unit of measurement?

Use the CONVERT function

9
New cards

An analyst is working with a Microsoft Excel spreadsheet that will later be shared with a project team.

Why would the analyst use the validation function on this sheet? 

To control the type of data entered

10
New cards

An analyst wants to format all data in a column to YYYY-MM-DD format to be represented by dates and times with values ranging from 00:00:00 to 11:59:59 using Structured Query Language (SQL).
 
The analyst uses the DATETIME command.
 
What does this command do to the data?

Data will be converted to a datetime value.

11
New cards

Why does an analyst use the LEN function in Microsoft Excel and Google Sheets spreadsheets? 

To determine the length of a string

12
New cards

Why does an analyst use the CONCAT_WS function in Structured Query Language (SQL)?

To add two or more strings together with a separator

13
New cards

Which programming language can handle trillions of rows of data and is now a standard language for working with database programs? 

Structured Query Language (SQL)

14
New cards

An analyst is working on a project with a lot of milestones to track and wants to create a spreadsheet that everyone can use to track their individual progress.

How could the analyst sort and filter progress milestones?  

Create a data validation menu in cells with predetermined options

15
New cards

An analyst working with data in two different Google Sheets needs information from both spreadsheets to connect together in a matching column on a single sheet.

Which function should the analyst use?

VLOOKUP

16
New cards

An analyst working to aggregate data in databases uses a Structured Query Language (SQL) command that returns all matching records in two tables and the key values in each.

Which command has the analyst used?

INNER JOIN

17
New cards

An analyst working to aggregate data in databases wants to create a temporary name for a table with a complex name to make Structured Query Language (SQL) queries simpler.
 
Which command should the analyst use? 

AS

18
New cards

An analyst working to aggregate data in databases uses a Structured Query Language (SQL) command to create a temporary name for a table and receives an ERROR message when the command is used.

Which command does the analyst use to create a temporary name for a table?

FROM

19
New cards

An analyst working in a spreadsheet needs to determine the total quantity of numerical values within a specific range that are unique.

Which function should the analyst use?

COUNT DISTINCT

20
New cards

An analyst is working in a database and needs to use a function that will return all the records from one table and only the matching records from a second table.
 
How does the analyst determine the order of the tables using a JOIN function? 

The table mentioned first is on the left.

21
New cards

An analyst is working on a spreadsheet and notices that rows and columns that should remain unchanged are changing when they are copied. This data set is generally problematic, and the analyst is trying to solve many problems under time constraints.
 
What does the analyst do to correct this issue with the data-aggregation process as quickly as possible?

Lock the table array part of the function

22
New cards

An analyst working in Google Sheets wants to find the lowest possible average of monthly sales in a range of values.
 
Which function should the analyst use? 

MIN

23
New cards

An analyst working in Google Sheets wants to calculate the total revenue of each transaction recorded. Each transaction can have exactly one item type and the data is organized by a category other than quantity.
 
Which function should the analyst use? 

SUMIF

24
New cards

An analyst working with data in Google Sheets wants to count the number of times a record appears in a column based on several conditions.
 
Which syntax should the analyst use? 

=COUNTIFS(criteria_range1, criterion1, [criteria_range2, criterion2, ...])

25
New cards

An analyst working in BigQuery applies a clause to a query creating a specific set of rows and columns that executes the same process as a temporary table.
 
Which clause does the analyst apply to the query? 

WITH

26
New cards

An analyst working in Google Sheets wants to multiply individual revenue amounts by profit margin percentages and then calculate overall profit margins for a given range.
 
Which function should the analyst use to save time while performing the calculation?

SUMPRODUCT

27
New cards

An analyst working with data in Google Sheets wants to calculate the total sales for each department, the number of products each department sold, and the department that generated the most revenue, but the data in the original spreadsheet must be maintained.

Which approach should the analyst take?

Use a pivot table to make calculations

28
New cards

Which best practice helps to ensure the integrity of data before beginning to work on a data set? 

Data validation

29
New cards

An analyst is comparing the common features of R and Python programming languages.
 
Which feature do both programming languages share? 

They both offer communities for code development and support

30
New cards

What does an analyst use to identify how words and symbols should be used in R programming language?

Syntax

31
New cards

What is a unique challenge an analyst faces when using R programming language as opposed to Python? 

Users may have difficulty using R programming language naming conventions.

32
New cards

An analyst wants to record the steps of analysis so they can double-check their work and solve problems.
 
How do general-purpose programming languages such as R support this? 

By automatically storing code to reproduce and share work at any time

33
New cards

A web designer is responsible for the layout and styling of web pages that feature text, graphics, and video. The web designer needs to use a programming language that provides structure for web pages and is used to connect to hosting platforms.
 
Which programming language does the web designer use? 

Hypertext Markup Language v5

34
New cards

A web-application developer designs network applications used all across the web. They need to use a programming language to create enterprise web applications that can run on multiple clients.
 
Which programming language should the web application developer use?

Java

35
New cards

An application developer who specializes in video games needs to use a programming language that is an extension of the C programming language that is also used to create console games such as those for Xbox.
 
Which programming language should the game developer use?

C++

36
New cards

Which tab does an analyst access to see the results of a diagram that represents values for two variables in RStudio? 

Plots

37
New cards

An analyst is working in RStudio and wants to visit an active online community on the social media platform Reddit, where R users go to discuss R, ask questions, and share tips.
 
Which resource does the analyst use for this? 

r/RLanguage

38
New cards

An analyst is working with a package in the RStudio tidyverse used for cleansing wide and long data to ensure that every part of a data table or data frame is the correct type located in the correct place.
 
Which package in the tidyverse is the analyst using? 

Tidyr

39
New cards

An analyst is working in RStudio with a tool in R programming language (R) used for expressing a sequence of multiple operations by taking the output of one statement and making it the input of the next statement.
 
Which tool is the analyst using? 

Pipe

40
New cards

An analyst working in RStudio wants to ensure that they are not hard coding any values within the code that do not change when they need to.
 
Which best practice does the analyst apply to increase readability?

Make the code dynamic

41
New cards

An analyst is working in RStudio with code that performs a particular function and is contained within code that performs a broader function.
 
Which tool is the analyst working with? 

Nested function

42
New cards

An analyst is working with atomic vectors to organize and store data in R programming language (R). The analyst needs to determine the properties of the vector they are working with.
 
Which function should the analyst use? 

typeof()

43
New cards

A data analyst wants to obtain the standard deviation statistical summary of a data set using R.
 
Which function should be used? 

sd()

44
New cards

Which tidyverse package can transfer data from Microsoft Excel files to R? 

readxl

45
New cards

An analyst runs the clean names() function on a data set.
 
What happens to the column names after completing this task? 

Special characters are removed.

46
New cards

Which function can be used to create a single column from two separate columns in R? 

unite()

47
New cards

A data analyst wants to set values for variables in R using operators.
 
Which type of operator should be used? 

Assignment

48
New cards

What is a best practice for file naming when working in R? 

Do not start or end file names with symbols.

49
New cards

Which tidyverse package is primarily used for creating data visualizations? 

ggplot2

50
New cards

Which layer should be used to control the type of a ggplot 2 plot?

Geoms layer

51
New cards

Which symbol must be added to the code to add a new layer to a ggplot2 plot? 

Plus (+)

52
New cards

Which function should be used to create the data frame for a plot in ggplot2? 

ggplot()

53
New cards

A data analyst wants to create a bar chart using the ggplot2 package in R.

Which function should be used? 

geom_bar()

54
New cards

Which layer can be added to a ggplot2 plot to help explain its purpose or to highlight important data? 

Annotate

55
New cards

Which function can be used to draw arrows or add shapes to a plot that was built using the ggplot2 package? 

annotate()

56
New cards

What does an analyst add to indicate the beginning or end of a data item in R Markdown? 

Delimiter

57
New cards

An analyst reviewing code in R Markdown notices output warnings that are unrelated to the impact on the review's findings.
 
How does the analyst create a final report for stakeholders without unrelated warnings? 

Change code chunk options

58
New cards

An analyst working in R Markdown needs to convert a file that is currently in the dot RMD format to another file type for a report that needs to be shared with stakeholders.
 
How should the analyst do this? 

Use the knit option

59
New cards

An analyst wants to create a monthly report using a simple format that runs one line of code to update the data each month rather than creating the report from scratch each time.
 
Which best practice does the analyst apply to create the report with the most recent information? 

Make a template and update code

60
New cards

An analyst who is working with a dot RMD file sets off the knit with two asterisks on either side of a word.
 
What does the analyst achieve with this action? 

Adding a bold font

61
New cards

An analyst needs to use R Markdown to document an analysis.
 
How does the analyst set up the R Markdown environment? 

Through the install packages function

62
New cards

An analyst working in R Markdown needs to begin a code chunk to load two packages.
 
What should the analyst add to begin the code chunk? 

An “r” in braces

63
New cards

The analyst wants to use a tool with files that execute everything as code sequentially using R or Python programming languages.
 
Which tool should the analyst use? 

Jupyter Notebook

64
New cards

An analyst wants to convert an R Markdown file to an output that publishes a group of related data visualizations using sidebars, tab sets, value boxes, and gauges.
 
Which output format should the analyst create? 

Dashboard

65
New cards

An analyst needs to edit the structured reference data that sorts and identifies aspects of the information it describes.
 
What helps the analyst to do this? 

YAML

66
New cards

An analyst needs to show stakeholders the code embedded in a file, its output, evidence for the findings, and the sources.
 
What does the analyst use to do this? 

Code chunk

67
New cards

An analyst adds a code chunk into an R Markdown file.

What is the value of adding a code chunk into the file?

It allows the analyst to check the code for errors.

68
New cards

Which input is needed to generate a new R Markdown file? 

Metadata

69
New cards

Which types of content make up an R Markdown file? 

Code chunks and text

70
New cards

Which outcome does the bias function compare to the predicted outcome to determine whether the model is biased or unbiased? 

Actual outcome