CompTIA Data+ Certification Exam Questions and Answers
A site reliability team wants to monitor the stability of their website. so they can proactively diagnose issues when they occur Which of the following deliverables would best suit their needs?
A data analyst is performing a data merge within a spreadsheet using the tables below:
The analyst is attempting to pull the addresses from Table 2 into Table 1 using the last names and is receiving an error message. Which of the following steps can the analyst perform to fix the error?
Different people manually type a series of handwritten surveys into an online database. Which of the following issues will MOST likely arise with this data? (Choose two.)
Which of the following contains alphanumeric values?
Which of the following best describes an exploratory analysis?
A data analyst is setting up a data dashboard to monitor several ETL data streams to ensure that data is complete for later analysis. Which of the following audiences should the analyst target for this dashboard?
A data analyst is creating a report that will provide information about various regions, products, and time periods. Which of the following formats would be themost efficient way to deliver this report?
Which of the following data cleansing issues will be fixed when a DISTINCT function is applied?
Which of the following types of dashboards should a business intelligence engineer develop in order to provide information about failed data pipelines?
An analyst is currently working on a ticket to revamp a company-wide dashboard that has been in use for five years. Which of the following should be the first step in the development process?
For which of the following test statistics would a low value imply a potentially meaningful result?
Each month an analyst needs to execute a data pull for the two prior months. Which of the following is the most efficient function for the analyst to use?
Which of the following is the best technique for transferring data from one database to another with some data manipulation?
Which one of the following values will appear first if they are sorted in descending order?
Exhibit.
Which of the following logical statements results in Table B?
A)
B)
C)
D)
A company’s marketing department wants to do a promotional campaign next month. A data analyst on the team has been asked to perform customer segmentation, looking at how recently a customer bought the product, at what frequency, and at what value. Which of the following types of analysis would this practice be considered?
A data analyst is developing a data dictionary that aligns with a company's data management processes and policies. Which of the following best describes what should be included in the data dictionary?
Given the below:
Which of the following numbers represents a Type I error?
A data analyst needs to present the results of an online marketing campaign to the marketing manager. The manager wants to see the most important KPIs and measure the return on marketing investment. Which of the following should the data analyst use to BEST communicate this information to the manager?
A database administrator is required to mask certain table columns containing Pll in order to comply with the company privacy policy. Which of the following are the most likely types of information the administrator should mask? (Select two).
An analyst is working on a project for a director. During this process. the analyst pulled the data. created summarized tables and graphs with descriptions, created a report summary, and inserted all items into a report. After writing the report, which of the following would be the most appropriate next step?
Which of the following is a relational database?
A data analyst has been asked to merge the tables below, first performing an INNER JOIN and then a LEFT JOIN:
Customer Table -
In-store Transactions –
Which of the following describes the number of rows of data that can be expected after performing both joins in the order stated, considering the customer table as the main table?
An analyst is updating a customer contacts database with information obtained from a survey of new customers. Which of the following data manipulation techniques should the analyst use?
Which of the following value is the measure of dispersion "range" between the scores of ten students in a test.
The scores of ten students in a test are 17, 23, 30, 36, 45, 51, 58, 66, 72, 77.
A stakeholder wants to see daily sales targets organized in a dashboard by country, state, city, and ZIP Code. Which of the following delivery considerations must a data analyst take into account when creating the dashboard?
A junior web developer is developing a new application where users can upload short videos. The first task is to create a homepage that shows the headline "Upload Your Short Videos" and a clickable button that says "upload now".
Which of the following HTML commands would help the developer to complete the task successfully?
A healthcare data analyst notices that one data set in the column for BloodPressure contains several outliers that need to be replaced with meaningful values. Which of the following data manipulation techniques should the analyst use?
After completing web scraping, which of the following file formats needs to be parsed?
A sales manager wants quarterly sales reports broken down by unit and week. Which of the following data output lists includes the most necessary information?
Which of the following is the best description of the term "data governance"?
A data analyst is asked to create a sales report for the second-quarter 2020 board meeting, which will include a review of the business’s performance through the second quarter. The board meeting will be held on July 15, 2020, after the numbers are finalized. Which of the following report types should the data analyst create?
Which of the following database types is the best to use for transactional SQL?
A data analyst needs to observe the relationship between two numeric variables and identify the clustering pattern as well as the outliers. Which of the following visualizations should the analyst use?
An analyst needs to join two data sets that compare vehicle weights. One data set is in pounds, and the other has various units of measure. Which of the following should the analyst do first to the data prior to any type of join?
An analyst for a small business with multiple locations is using each location’s quarterly sales reports from last year to create a single revenue report for the year. Which of the following data mining techniques should the analyst use to complete this task?
Which of the following techniques should an analyst use to analyze a data set to get a snapshot of basic measures of central tendency?
An analyst is reviewing the following data:
Car IDSpeed
123155
566436
564418
650567
546436
645638
Which of the following should the analyst include in the measures of central tendency for speed?
An analyst wants to check the progress and performance regarding the number of customers an organization served in the last six years. Which of the following represents the type of analysis theanalyst should perform?
Which of the following can be used to translate data into another form so it can only be read by a user who has a key or a password?
A collections manager has a team calling customers who are past due on their accounts in an attempt to collect payments. The manager receives the call list in the form of a printed report that is generated by the accounting department at the beginning of each week. Consequently, the collections team calls some customers who have made payments in the time since the report was last printed. Which of the following reporting enhancements could the accounting department implement to best reduce the number of calls on current accounts?
A data analyst must fulfill a request for information that is needed weekly and should be automatically emailed to a specific set of users. Which of the following types of reports should theanalyst recommend?
An analyst needs to determine the appropriate data type for the following sample data:
sample data collected:
Which of the following data types should be used for this data?
Which of the following data types must be used when working with variables that require classification into two or more groups before analysis?
Which of the following types of data manipulation functions should a data analyst use to implement a YES/NO condition in a spreadsheet?
A data analyst needs to write a SOL query measuring last month's website visits and distribute a summary report to the marketing team. Which of the following is the analyst creating?
A data analyst received a large amount of third-party data that needs to be joined with in-house data files. After the data is joined, the analyst notices three columns all contain dates. Which of the following should the analyst do to maintain data consistency?
A data set was recorded using multimedia technology. Which of the following is a necessary step on the way to interpretation?
An analyst computed a new variable of income per day in the household by multiplying the number of days worked by the number of people working in the household and the income earned per day. Which of the following is the correct name for this new variable?
A data analyst who works for a government agency is required to obtain the average income of citizens. The list of citizens is given in the following table:
A value for one citizen's income is missing. Which of the following approaches should the data analyst take to solve this issue?
Samantha needs to share a list of her organization's top 50 customers with the VP of sales.
She would like to include the name of the customer, the business they represent, their contact information, and their total sales over the past year.
The VP does not have any specialized analytics skills or software but would like to make some personal notes on the dataset.
What would be the best tool for Samantha to use to share this information?
A data architect is designing a data solution for a retail clothing store chain. Each store has a database that tracks sales transactions. The data architect needs to create a summary table that will be used for a senior executive dashboard. The summary table should not contain duplicate store information. Which of the following should the data architect create?
Which of the following is the best reason for removing data outliers?
Which of the ing is the correct ion for a tab-delimited spre file?
A gambler thinks that a coin is fair and is equally likely to turn up heads or tails when the coin is flipped. Which of the following tests should the gambler use to fest this hypothesis?
An analyst needs to conduct a quick analysis. Which of the following is the FIRST step the analyst should perform with the data?
A data engineer needs to store data that can be natively used by an API. Which of the following should the engineer use to best accomplish this task?
A data analyst is working for a shipping company and calculating the volume of boxes according to the following formula:
volume = height × width × depth.
Which of the following variable types describes volume?
A data analyst is building a closed won quarter-over-quarter report for the sales team. Which of the following will be needed to complete this request?
An analyst wants to combine two data sets into a single spreadsheet. Column names from the first spreadsheet are listed in rows in the second spreadsheet. Which of the following is the first step the analyst should take to combine the data sets?
A data analyst wants to create "Income Categories" that would be calculated based on the existing variable "Income". The "Income Categories" would be as follows:
Income category 1: less than $1.
Income category 2: more than $1 and less than $20,000.
Income category 3: more than $20,001 and less than $40,000.
Income category 4: more than $40,001.
Which of the following data manipulation techniques should the data analyst use to create "Income Categories"?
A financial institution is reporting on sales performance to a company at the account level. Due to the sensitive nature of the government the does il with, some account information is not shown. Which of the following fields should be masked?
What R package makes it easy to work with dates?
A sales analyst needs to report how the sales team is performing to target. Which of the following files will be important in determining 2019 performance attainment?
The total values in this month's revenue report are twice as much as last month's. Which of the following most likely occurred during the ETL process?
A data analyst needs to create a data visualization that aids in un the cumulative impact of sequentially introduced values that are positive or negative. Which of the following
data visualization methods should the analyst use?
Which of the following data types would a telephone number formatted as XXX-XXX-XXXX be considered?
Which of the following terms best describes a situation in which a rating scale does not conform to previously agreed-upon requirements?
Which of the following roles is responsible for ensuring an organization's data quality, security, privacy, and regulatory compliance?
A company's human resources department has asked a data analyst to categorize the income of all employees into five salary bands:
Which of the following types of functions would be the most appropriate to use?
An analyst reviews the following table:
Which of the following data types is represented in the values in the RefNo column?
You have two databases tables that you would like to join together using a foreign key relationship.
What term best describes this action?
The number of phone calls that the call center receives in a day is an example of:
Which of the following best describes a business analytics tool with interactive visualization and business capabilities and an interface that is simple enough for end users to create their own reports and dashboards?
Python
Which of the following is the most likely reason for a data analyst to optimize a query using parameterization?
Given the following:
Which of the following is the most important thing for an analyst to do when transforming the table for a trend analysis?
Which one of the following in NOT a common data integration tool?
Which of the following types of analyses is best to use when tracking sales revenue against quarterly targets?
Which of the following is an example of PII?
Which of the following activities occurs during the ETL process?
Which of the following is the correct data type for text?
What category of data stewardship work is focused on ensuring that the organization respects the wishes of data subjects?
Which of the following is a characteristic of a relational database?
Which of the following statistical methods requires two or more categorical variables?
Which of the following is concatenate typically used to combine?
Which of the following differentiates a flat text file from other data types?
Which of the following is a domain-specific language used in programming that is designed for managing data that is held in a relational data stream management system?
An analyst is designing a dashboard to determine which site has the highest percentage of new customers. The analyst must choose an appropriate chart to include in the dashboard. The following data is available:
Which of the following types of charts should be considered to best display the data?
An analyst is training a new coworker on the importance of data governance and is focusing on security requirements. Which of the following should the analyst include in the training?
(Select two).
Given the following customer and order tables:
Which of the following describes the number of rows and columns of data that would be present after performing an INNER JOIN of the tables?
A development company is constructing a new unit in its apartment complex. The complex has the following floor plans:
Using the average cost per square foot of the original floor plans, which of the following should be the price of the Rose unit?
Given the table below:
Which of the following variable types BEST describes the “Year” column?
An analyst wants to include a graph in a quarterly sales report that shows the comparison between two quantitative variables. Which of the following visual diagrams can the analyst use to most effectively represent this relationship?
Which of the following defines the policies and procedures for managing the master data?
Given the following table:
Date of visit
Age
Gender
6/1/22
30
Male
6/15/22
65F
Fem.
6/19/2022
24
M
Which of the following describes the data quality issues with the age data?
What role in a data governance is typically responsible for day-to-day oversight of data use?
The duration of a phone call in milliseconds is an example of:
Which of the following is the best approach to use to gain a general understanding of a data set?
Which of the following types of analysis is used when comparing last week's sales to the previous week's sales?
A data analyst is creating a dashboard and trying to identify the type of information that should be included. Which of the following should the analyst consider first?
A data analyst for a media company needs to determine the most popular movie genre. Given the table below:
Which of the following must be done to the Genre column before this task can be completed?
A data analyst needs to perform a full outer join of a customer's orders using the tables below:
Which of the following is the mean of the order quantity?
Which of the following query statements would be used when filtering data in a relational database management system? (Select two).
Which of the following would be the best way to identify multicollinear attributes in a data set?
Which of the following is an example of structured data?
Given the following athlete workout data (with inconsistent units or formats for time/distance), which of the following best describes the data quality issue?
Given the following tables:
Which of the following will be the dimensions from a FULL JOIN of the tables above?
Given the customer table below:
Which of the following chart types is the most appropriate to represent the average spending of active customers vs. inactive customers?
Which of the following are reasons to conduct data cleansing? (Select two).
Which of the following technologies would be best suited for creating a multiple linear regression model?
Encryption is a mechanism for protecting data.
When should encryption be applied to data?
Choose the best answer.
Which of the following data protection methods provides confidentiality for data in transit?
Given the diagram below:
Which of the following data schemas shown?
Five dogs have the following heights in millimeters:
300, 430, 170, 470, 600
Which of the following is the mean height for the five dogs?
Which of the following best describes a difference between JSON and XML?
A sales team wants visibility of current sales numbers, pipeline, and team performance. The team would also like to see calculations of individuals’ earned commissions and projected commissions based on sales, but they want that information to be kept confidential. Which of the following would be the BEST way to provide this visibility?
A customer's telephone number is in the format 123-456-7890. Which of the following data types is used for the phone number?
A data analyst is working with a data set and would like to combine two fields into a single field. Which of the following data manipulation techniques should the analyst use?