1/14
Name | Mastery | Learn | Test | Matching | Spaced |
|---|
No study sessions yet.
A photographer stores digital photographs on her computer. In this case the photographs are considered the data. Each photograph also includes multiple pieces of metadata including:
Date: The date the photograph was taken
Time: The time the photograph was taken
Location: The location where the photograph was taken
Device: Which camera the photo was taken with
Which of the following could the photographer NOT do based on this metadata?
Filter photos to those taken of buildings.
This question refers to the same collection of photos and their metadata described in the previous question.
Due to a computer error, the Time metadata for all of the photographs is accidentally set to the same time of day. The other pieces of metadata and the data itself are not affected by the error. Which of the following is most likely to be the result of this problem?
The photographer will not be able to filter to photographs taken in the morning.
The next three questions all refer to the same dataset described below.
A school is conducting a survey of students to learn more about how they get to school. Students were asked how they travel to school, how long it takes them to get to school, what time they arrive at school, and for a description of their most significant challenges when traveling to school. Several rows of the data collected are shown in the table below.
Which column is data will likely be most difficult to visualize or analyze?
Biggest Challenges
This question refers to the same table of data described above.
Which column of data might be able to be visualized or analyzed but first would need to be cleaned?
How Long
This question refers to the same table of data described above.
Which information could likely not be found with the data collected?
Whether students who are new to the school take longer to get to school than other students.
The chart above shows the frequency of searches for the words "pencil" and "pen" over a 5 year period. Which of the following is the best interpretation of the chart above?
At any point in time there are typically more people searching for the word "pen" than "pencil."
The mayor of a city is interested in learning what goals are most important for residents of the city. Members of her staff visit one of the many neighborhoods in the city and ask 20 residents to fill out a survey. The mayor is concerned that the survey may be biased and not accurately reflect the overall interests in her town. Which of the following strategies is most likely to address concerns about the data being biased or inaccurate
Visiting multiple new neighborhoods to collect more survey responses.
In which of the following situations would parallel systems most likely be used to help analyze data?
Data analysis involving large datasets.
The next two questions refer to the following chart.
The chart below has a single point for every breed of dog. Each point indicates the maximum weight of that breed and the maximum number of years that breed of dog lives to.
Which of the following conclusions is best supported by the scatter plot?
Dog breeds with a higher maximum weight tend to have shorter maximum lifespans.
This question refers to the chart from the previous question.
Which of the following conclusions is best supported by the scatter plot?
All dog breeds that weigh less than 50 pounds have a maximum lifespan of more than 10 years.
The histogram chart below has a single bar for every state in the United States. Each state is placed in a bucket, or range, based on its area in square miles.
Which of the following conclusions is best supported by the histogram chart?
The most common size of states is between 50,000 and 100,000 square miles.
A bank is developing an algorithm that will help them decide who to make loans to. As input the algorithm will consider information about the person applying for the loan and the amount of money requested, and as output the algorithm will decide whether the bank should give that loan.
The bank intends to develop the algorithm using machine learning techniques. The algorithm will be trained using data from past loan decisions made by human bankers.
Which of the following best describes whether this algorithm will include bias?
The algorithm will likely reflect the human biases in the data used to train it.
A restaurant is interested in learning about the food preferences of people living nearby to the restaurant and intends to use survey data to help decide which new items to add to the menu. Which of the following is least likely to be part of the process used to analyze the data?
Cleaning a data visualization to remove unwanted patterns.
A researcher is interested in learning more about the different kinds of plants growing in different areas of the state she lives in. The researcher creates an app that allows residents of the town to photograph plants in their area using a smartphone and record date, time, and location of the photograph. Afterwards the researcher will analyze the data to try to determine where different kinds of plants grow in the state.
Which of the following does this situation best demonstrate?
Citizen science.
A town decides to publicize data it has collected about electricity usage around the city. The data is freely available for all to use and analyze in the hopes that it is possible to identify more efficient energy usage strategies.
Which of the following does this situation best demonstrate?
Open data.