CompTIA Data+ Certification Exam Questions and Answers
A military commander would like to see the health scorecards of the troops daily and filter them based on gender and rank. Considering this data is PHI, which of the following would be the best way for the commander to view the information?
The director of operations at a power company needs data to help identify where company resources should be allocated in order to monitor activity for outages and restoration of power in the entire state. Specifically, the director wants to see the following:
* County outages
* Status
* Overall trend of outages
INSTRUCTIONS:
Please, select each visualization to fit the appropriate space on the dashboard and choose an appropriate color scheme. Once you have selected all visualizations, please, select the appropriate titles and labels, if applicable. Titles and labels may be used more than once.
If at any time you would like to bring back the initial state of the simulation, please click the Reset All button.
A company's human resources department has asked a data analyst to categorize the income of all employees into five salary bands:
Which of the following types of functions would be the most appropriate to use?
Which one of the following is a measure of dispersion?
The current date is July 14, 2020. A data analyst has been asked to create a report that shows the company’s year-over-year Q2 2020 sales. Which of the following reports should the analyst compare?
An analyst is updating a customer contacts database with information obtained from a survey of new customers. Which of the following data manipulation techniques should the analyst use?
A data analyst needs to collect a similar proportion of data from every state. Which of the following sampling methods would be the most appropriate?
Jhon is working on an ELT process that sources data from six different source systems.
Looking at the source data, he finds that data about the sample people exists in two of six systems.
What does he have to make sure he checks for in his ELT process?
Choose the best answer.
Given the image below:
Which of the following file formats is depicted?
Which of the following is the best description of the term "data governance"?
Which of the following best describes how discrete data differs from continuous data?
What R package makes it easy to work with dates?
A data analyst received the information in the table below from a recently completed marketing campaign:
Which of the following is the total order conversion rate?
Which of the following data cleansing issues will be fixed when a DISTINCT function is applied?
A data analyst has removed the outliers from a data set due to large variances. Which of the following central tendencies would be the best measure to use?
Standardized tests are given to students in the middle of each month, and the results are ready by the end of the month. The superintendent needs a quick view of test performance. Which of the following would be the best recommendation to meet the superintendent's requirements?
Alex wants to use data from his corporate sale, CRM, and shipping systems to try and predict future sales.
Which of the following systems is the most appropriate?
Choose the best answer.
Kelly wants to get feedback on the final draft of a strategic report that has taken her six months to develop.
What can she do to get prevent confusion as see seeks feedback before publishing the report?
Choose the best answer.
A marketing analytics team received customer transaction data from two different sources. The data is complete and accurate; however, the field names appear to be inconsistent. Given the following tables:
Which of the following is considered best practice if the team wants to consolidate the files and conduct further analysis?
An analyst is currently working on a ticket for revamping a company-wide dashboard that has been in use for five years. Which of the following should be the first step in the development process?
Which of the following descriptive statistical methods are measures of central tendency? (Choose two.)
What SQL command is used to delete an entire table from a database?
A data analyst needs to create a dashboard using the company's yearly revenue data sets. Which of the following would be the best way to plot the information to show the top-performing region?
You are working with a dataset and want to change the names of categories that you used fordifferent types of books.
What term best describes this action?
Which of the following tools would be best to use to calculate the interquartile range, median, mean, and standard deviation of a column in a table that has 5.000.000 rows?
A data analyst has been asked to organize the table below in the following ways:
By sales from high to low -
By state in alphabetic order -
Which of the following functions will allow the data analyst to organize the table in this manner?
Which of the following BEST describes the issue in which character values are mixed with integer values in a data set column?
Which of the following is a relational database?
A data analyst is helping a retail store categorize its customers into five different groups based on the following information:
• How recently the customers made purchases
• How frequently the customers made purchases
• How much the customers spent
Given the following information:
Which of the following would be most important for the analysis?
A data engineer is creating a database field to capture whether a customer likes vanilla ice cream. Which of the following data types is the best to capture this information?
A data analyst reviews the following data set:
Which of the following is the range value?
What subset of Structured Query Language (SQL) is used to add, remove, modify, or retrieve the information stored within a relational database?
Which of the following data protection methods provides confidentiality for data in transit?
A data analyst is developing a data dictionary that aligns with a company's data management processes and policies. Which of the following best describes what should be included in the data dictionary?
Which of the following actions should be taken when transmitting data to mitigate the chance of a data leak occurring? (Choose two.)
An analyst reviews the following table:
Which of the following data types is represented in the values in the RefNo column?
Which of the following data analysis tools increases the efficiency of data visualizations?
Joe. an analyst. tests the loading time on a dashboard he is preparing to go live and finds it is slower than he would like. Which of the following must occur to decrease the loading time?
An analyst is building a new dashboard for a user. After an initial conversation with the user. the analyst created a mock-up of the dashboard. Which of the following best explains why the analyst created the mock-up?
A healthcare data analyst notices that one data set in the column for BloodPressure contains several outliers that need to be replaced with meaningful values. Which of the following data manipulation techniques should the analyst use?
While reviewing survey data, an analyst notices respondents entered “Jan,” “January,” and “01” as responses for the month of January. Which of the following steps should be taken to ensure data consistency?
Which of the following is the best description of discrete data types?
Which of the following is the best reason to use database views instead of tables?
An analyst needs to join two tables of data together for analysis. All the names and cities in the first table should be joined with the corresponding ages in the second table, if applicable.
Which of the following is the correct join the analyst should complete. and how many total rows will be in one table?
Which of the following techniques is used to quantify data?
What category of data stewardship work is focused on ensuring that the organization respects the wishes of data subjects?
Which of the following data types would a telephone number formatted as XXX-XXX-XXXX be considered?
Given the following data:
Which of the following BEST describes the data set?
You have two databases tables that you would like to join together using a foreign key relationship.
What term best describes this action?
A data scientist wants to see which products make the most money and which products attract the most customer purchasing interest in their company.
Which of the following data manipulation techniques would he use to obtain this information?
Which of the following contains alphanumeric values?
An analyst is explaining the company’s financial systems and reporting tools to a new coworker. Which of the following data quality dimensions are the most important? (Select three).
Given the table below:
Which of the following variables can be considered inconsistent, and how many distinct values should the variable have?
Which of the following query statements would be used when filtering data in a relational database management system? (Select two).
An analyst conducted a preliminary analysis for a data set and identified several patterns and anomalies. Which of the following analysis techniques did the analyst use?
A data analyst needs to create a master file that includes customer information from the tables below:
Given the three tables above, the analyst wants to filter down the information prior to joining it together. In which of the following orders should this data manipulation bo approached for the most efficient result?
A database administrator is required to mask certain table columns containing PII in order to comply with the company privacy policy. Which of the following are the most likely types of information the administrator should mask? (Select two).
A database consists of one fact table that is composed of multiple dimensions. Each dimension is represented by a denormalized table. This structure is an example of a:
An analyst is reviewing the following data:
Car IDSpeed
123155
566436
564418
650567
546436
645638
Which of the following should the analyst include in the measures of central tendency for speed?
Which of the following data manipulation techniques is an example of a logical function?
A Chief Executive Officer (CEO) is requesting more up-to-date sales data for improved visibility prior to month-end. An analyst must determine the frequency of a sales report that was previously distributed on an as-needed basis. Which of the following would be the most appropriate frequency for this report?
An analyst needs to determine the appropriate data type for the following sample data:
sample data collected:
Which of the following data types should be used for this data?
Which of the following value is the measure of dispersion "range" between the scores of ten students in a test.
The scores of ten students in a test are 17, 23, 30, 36, 45, 51, 58, 66, 72, 77.
A stakeholder wants to see daily sales targets organized in a dashboard by country, state, city, and ZIP Code. Which of the following delivery considerations must a data analyst take into account when creating the dashboard?
While reviewing survey data, a research analyst notices data is missing from all the responses to a single question. Which of the following methods would BEST address this issue?
An analyst has received the requirements for an internal user dashboard. The analyst confirms the data sources and then creates a wireframe. Which of the following is the NEXT step the analyst should take in the dashboard creation process?
Which of the following is a KPI metric for tracking sales performance?
Mario works with a group of R programmers tasked with copying data from an accounting system into a data warehouse.
In what phase are the group's R skills most relevant?
A data analyst has a set of data that shows the number of gallons of oil produced each day. The company would like to know the standard deviation for the data set. The variance for the data is 36 gallons. Which of the following is the standard deviation for gallons produced?
A customer survey reveals 90% positive feedback. Which of the following statistical methods would be best to utilize to determine the reliability of a data set and predict how a larger sample of customers over the same time period might respond?
Which of the following summary statements upholds integrity in data reporting?
A junior web developer is developing a new application where users can upload short videos. The first task is to create a homepage that shows the headline "Upload Your Short Videos" and a clickable button that says "upload now".
Which of the following HTML commands would help the developer to complete the task successfully?
Which of the following best describes a 95% confidence interval?
A data analyst has been asked to merge the tables below, first performing an INNER JOIN and then a LEFT JOIN:
Customer Table -
In-store Transactions –
Which of the following describes the number of rows of data that can be expected after performing both joins in the order stated, considering the customer table as the main table?
Which of the following BEST describes standard deviation?
Consider this dataset showing the retirement age of 11 people, in whole years:
54, 54, 54, 55, 56, 57, 57, 58, 58, 60, 60
This tables show a simple frequency distribution of the retirement age data.
Which of the following data types must be used when working with variables that require classification into two or more groups before analysis?
A reporting analyst needs to create a report that refreshes automatically and is accessible to the entire sales organization. Which of the following tools is the most appropriate to use for this task?
An analyst is designing a dashboard to determine which site has the highest percentage of new customers. The analyst must choose an appropriate chart to include in the dashboard. The following data is available:
Which of the following types of charts should be considered to BEST display the data?
Which of the following is a domain-specific language used in programming that is designed for managing data that is held in a relational data stream management system?
A sales director has requested a report for individual team members within the division be developed. The director would like the report to be shared with all team members, but individual team members should not be identifiable within the report Which of the following access requirements would support the director's needs?
A data analyst is creating a report that will provide information about various regions, products, and time periods. Which of the following formats would be themost efficient way to deliver this report?
An analyst needs to conduct a quick analysis. Which of the following is the FIRST step the analyst should perform with the data?
A data analyst needs to create a data visualization that aids in un the cumulative impact of sequentially introduced values that are positive or negative. Which of the following
data visualization methods should the analyst use?
A data analyst has been asked to create a sales report that calculates the rolling 12-month average for sales. If the report will be published on November 1, 2020, which of the following months shouts the report cover?
A data set has the following values:
Which of the following is the best reason for cleansing the data?
Which of the following is the most appropriate to consider when creating a schema of a central group broken into detailed subcategories?
Which of the following is the most likely reason for a data analyst to optimize a query using parameterization?
Which of the following will MOST likely be streamed live?
An analyst runs a report on a daily basis, and the number of datapoints must be validated before the data can be analyzed. The number of datapoints increases each day by approximately 20% of the total number from the day before. On a given day, the number of datapoints was 8,798. Which of the following should be the total number of datapoints on the next day?
A data set was recorded using multimedia technology. Which of the following is a necessary step on the way to interpretation?
A sales manager wants quarterly sales reports broken down by unit and week. Which of the following data output lists includes the most necessary information?
Given the following report:
Which of the following components need to be added to ensure the report is point-in-time and static? (Select two).
Which of the following best describes the process of examining data for statistics and information about the data?
Cleansing
Given the following report:
Which of the following components need to be added to ensure the report is point-in-time and static? (Choose two.)
An analyst needs to summarize the number of people in Chicago in 2022 using the following set of data:
Which of the following steps should the analyst use to provide results? (Select two).
Which of the following file formats is best suited to start exploratory analysis within statistical software?
Which one the following is not considered an aggregate function?
Which of the following are the first steps a company should take after discovering a data breach? (Select two).
Which of the following is the best approach to use to gain a general understanding of a data set?
An analyst is designing a dashboard to determine which site has the highest percentage of new customers. The analyst must choose an appropriate chart to include in the dashboard. The following data is available:
Which of the following types of charts should be considered to best display the data?