Special Summer Sale Limited Time 70% Discount Offer - Ends in 0d 00h 00m 00s - Coupon code: 70percent

CompTIA DA0-001 CompTIA Data+ Certification Exam Exam Practice Test

Demo: 108 questions
Total 363 questions

CompTIA Data+ Certification Exam Questions and Answers

Question 1

Which of the following should an analyst do to best summarize the data on a data set?

Options:

A.

Filtering

B.

Aggregation

C.

Sorting

D.

Concatenation

Question 2

An analyst wants to test the association between the number of doors in a car and the number of gears in the car. Which of the following is the best test to use?

Options:

A.

F-test

B.

Acceptance test

C.

Chi-squared test

D.

Z-test

Question 3

A column is being used to store strings of variable lengths. Performance is a concern, so the column needs to use as little space as possible. Which of the following data types best meets these requirements?

Options:

A.

char

B.

nchar

C.

varchar

D.

nvarchar

Question 4

A data analyst is helping a retail store categorize its customers into five different groups based on the following information:

• How recently the customers made purchases

• How frequently the customers made purchases

• How much the customers spent

Given the following information:

Which of the following would be most important for the analysis?

Options:

A.

CustomerJD. Channel, Order_Date

B.

CustomerJD, Territory. Amount

C.

CustomerJD, Order_Date. Amount

D.

CustomerJD. Quantity, Amount

Question 5

Mario works with a group of R programmers tasked with copying data from an accounting system into a data warehouse.

In what phase are the group's R skills most relevant?

Options:

A.

Extract.

B.

Load.

C.

Transform.

D.

Purge.

Question 6

A data analyst needs to present the results of an online marketing campaign to the marketing manager. The manager wants to see the most important KPIs and measure the return on marketing investment. Which of the following should the data analyst use to BEST communicate this information to the manager?

Options:

A.

A real-time monitor that allows the manager to view performance the day the campaign was launched

B.

A sell-service dashboard that allows the manager to look at the company's annual budget performance

C.

A spreadsheet of the raw data from all marketing campaigns and channels

D.

A summary with statistics, conclusions, and recommendations from the data analyst

Question 7

What SQL command is used to delete an entire table from a database?

Options:

A.

DROP.

B.

MODIFY.

C.

DELETE.

D.

ALTER.

Question 8

Which of the following will MOST likely be streamed live?

Options:

A.

Machine data

B.

Key-value pairs

C.

Delimited rows

D.

Flat files

Question 9

An analyst is reporting on the average income for a county and is reviewing the following data:

Which of the following is the reason the analyst would need to cleanse the data in this data set?

Options:

A.

Data completeness

B.

Data outliers

C.

Duplicate data

D.

Missing values

Question 10

Which of the following would be used to store unstructured data from different sources?

Options:

A.

A data lake

B.

A database management system

C.

A database

D.

A data warehouse

Question 11

An analyst is currently working on a ticket for revamping a company-wide dashboard that has been in use for five years. Which of the following should be the first step in the development process?

Options:

A.

Talk to the group that made the request to determine the desired goal.

B.

Make changes to a frequently used report that is already in production.

C.

Build an additional dashboard with fewer views that are tailored toward each specific team.

D.

Develop a more streanMined dashboard to roll out by the next delivery date.

Question 12

Given the following data tables:

Which of the following MDM processes needs to take place FIRST?

Options:

A.

Creation of a data dictionary

B.

Compliance with regulations

C.

Standardization of data field names

D.

Consolidation of multiple data fields

Question 13

Which of the following is an example of PII?

Options:

A.

Age

B.

Name

C.

Ethnicity

D.

Gender

Question 14

Which of the following report types is most appropriate for a high-level, year-end report requested by a Chief Executive Officer?

Options:

A.

Dynamic

B.

Recurring

C.

Ad hoc

D.

Self-service

Question 15

A data analyst is asked to create a sales report for the second-quarter 2020 board meeting, which will include a review of the business’s performance through the second quarter. The board meeting will be held on July 15, 2020, after the numbers are finalized. Which of the following report types should the data analyst create?

Options:

A.

Static

B.

Real-time

C.

Self-service

D.

Dynamic

Question 16

Which of the following is used for calculations and pivot tables?

Options:

A.

IBM SPSS

B.

SAS

C.

Microsoft Excel

D.

Domo

Question 17

Which of the following data types best describe 4Ac1? (Select two).

Options:

A.

Alphanumeric

B.

Symbolic

C.

Numeric

D.

Float

E.

Boolean

F.

String

Question 18

Taylor wants to investigate how manufacturing, marketing, and sales expenditures impact overall profitability for her company.

Which of the following systems is the most appropriate?

Options:

A.

OLTP.

B.

OLAP.

C.

Data warehouse.

D.

Data mart.

Question 19

Which of the following best describes an exploratory analysis?

Options:

A.

Involves the use of descriptive statistics to understand observations

B.

Involves analysis of exploring data sets for performance tracking

C.

Involves the testing of specific hypotheses

D.

Involves the use of arithmetic algebra to determine the distribution

Question 20

Which of the following analysis techniques is an unsupervised data mining process?

Options:

A.

Clustering

B.

Descriptive

C.

Regression

D.

Predictive

Question 21

Given the table below:

Which of the following variables can be considered inconsistent, and how many distinct values should the variable have?

Options:

A.

Name, one

B.

Gender, two

C.

Level, three

D.

Code, four

E.

Region, five

Question 22

Which of the following is the best description of discrete data types?

Options:

A.

Non-numeric data used to describe attributes of a population sample

B.

The frequency of the number of times each value occurs by using whole numbers

C.

Numeric values that can be measured on a continuous scale

D.

Non-numeric data used to describe attributes of a population sample ranked in a specific order

Question 23

Which of the following data cleansing issues will be fixed when a DISTINCT function is applied?

Options:

A.

Missing data

B.

Duplicate data

C.

Redundant data

D.

Invalid data

Question 24

A report is scheduled to run and be distributed at the end of business each day. On Mondays, one of the recipients opens the previous week's reports and combines them to calculate the weekly totals and projections for the coming week. This is a tedious process, and the recipient asks an analyst for help. Which of the following should the analyst recommend?

Options:

A.

Add calculation fields to the daily report so the totals are built in.

B.

Create a new report with weekly totals set to run at the end of business on Friday.

C.

Provide a daily summary to the report with totals to save the user the effort of manual calculations.

D.

Reduce the frequency of the report to once a week and change the date range.

Question 25

An analyst is working with the income data of suburban families in the United States. The data set has a lot of outliers, and the analyst needs to provide a measure that represents the typical income. Which of the following would BEST fulfill the analyst’s goal?

Options:

A.

Median

B.

Mean

C.

Mode

D.

Standard deviation

Question 26

A data analyst is designing a dashboard that will provide a story of sales and determine which site is providing the highest sales volume per customer. The analyst must choose an appropriate chart to include in the dashboard. The following data is available:

Which of the following types of charts should be considered?

Options:

A.

Include a line chart using the site and average sales per customer.

B.

Include a pie chart using the site and sales to average sales per customer.

C.

Include a scatter chart using sales volume and average sales per customer.

D.

Include a column chart using the site and sales to average sales per customer.

Question 27

A user receives a large custom report to track company sales across various date ranges. The user then completes a series of manual calculations for each date range. Which of the following should an analyst suggest so the user has a dynamic, seamless experience?

Options:

A.

Create multiple reports, one for each needed date range.

B.

Build calculations into the report so they are done automatically.

C.

Add macros to the report to speed up the filtering and calculations process.

D.

Create a dashboard with a date range picker and calculations built in.

Question 28

Which of the following data governance concepts fits into the security requirements category?

Options:

A.

Data transmission

B.

Data deletion

C.

Data use agreements

D.

Personally identifiable information

Question 29

Which of the following data types should an analyst use to provide the most flexibility when recording emails on a form?

Options:

A.

Alphanumeric

B.

Text

C.

Discrete

D.

Continuous

Question 30

Which one of the following programming languages is specifically designed for use in analytics applications?

Options:

A.

Python.

B.

R

C.

C++

D.

Java.

Question 31

Which of the following variable name formats would be problematic if used in the majority of data software programs?

Options:

A.

First_Name_

B.

FirstName

C.

First_Name

D.

First Name

Question 32

A junior web developer is developing a new application where users can upload short videos. The first task is to create a homepage that shows the headline "Upload Your Short Videos" and a clickable button that says "upload now".

Which of the following HTML commands would help the developer to complete the task successfully?

Options:

A.

< span >Upload Your Short Videos< /span >< button >upload now< /button >

B.

< p >Upload Your Short Videos< /p >< p >upload now< /p >

C.

< hl >Upload Your Short Videos< /h1 >< button >upload now< /button >

D.

< hl >Upload Your Short Videos< /h1 >< hl >upload now< /h1 >

Question 33

Which of the following is a difference between a primary key and a unique key?

Options:

A.

A unique key cannot take null values, whereas a primary key can take null values.

B.

There can be only one primary key in a data set, whereas there can be multiple unique keys.

C.

A primary key can take a value more than once, whereas a unique key cannot take a value more than once.

D.

A primary key cannot be a date variable, whereas a unique key can be.

Question 34

After completing web scraping, which of the following file formats needs to be parsed?

Options:

A.

.html

B.

.txt

C.

.csv

D.

.tsv

Question 35

Which of the following is a control measure for preventing a data breach?

Options:

A.

Data transmission

B.

Data attribution

C.

Data retention

D.

Data encryption

Question 36

Which of the following is an example of a flat file?

Options:

A.

CSV file

B.

PDF file

C.

JSON file

D.

JPEG file

Question 37

An organization wants to evaluate whether project activities are within the set projections and in line to meet the desired project targets. Which of the following types of analysis is best suited for this situation?

Options:

A.

Trend analysis

B.

Performance analysis

C.

Descriptive analysis

D.

Exploratory analysis

Question 38

Which of the following should be accomplished NEXT after understanding a business requirement for a data analysis report?

Options:

A.

Rephrase the business requirement.

B.

Determine the data necessary for the analysis

C.

Build a mock dashboard/presentation layout.

D.

Perform exploratory data analysis.

Question 39

Which of the following is the best variable formal to store a customer's age using the least possible amount of storage data?

Options:

A.

Int

B.

Float

C.

Char

D.

Double

Question 40

A data analyst for a media company needs to determine the most popular movie genre. Given the table below:

Which of the following must be done to the Genre column before this task can be completed?

Options:

A.

Append

B.

Merge

C.

Concatenate

D.

Delimit

Question 41

An analyst has generated a report that includes the number of months in the first two quarters of 2019 when sales exceeded $50,000:

Which of the following functions did the analyst use to generate the data in the Sales_indicator column?

Options:

A.

Aggregate

B.

Logical

C.

Date

D.

Sort

Question 42

Which of the following is an example of a strategy to reduce statistical errors?

Options:

A.

Removing outliers

B.

Adding more data

C.

Transformation

D.

Recoding data

Question 43

A marketing analytics team received customer transaction data from two different sources. The data is complete and accurate; however, the field names appear to be inconsistent. Given the following tables:

Which of the following is considered best practice if the team wants to consolidate the files and conduct further analysis?

Options:

A.

Standardize the field names.

B.

Recode the data values.

C.

Overwrite the field names in one of the tables.

D.

Edit the field names in the data dictionary.

Question 44

Which one of the following is a common data warehouse schema?

Options:

A.

Snowflake.

B.

Square.

C.

Spiral.

D.

Sphere.

Question 45

A data analyst needs to create a master file that includes customer information from the tables below:

Given the three tables above, the analyst wants to filter down the information prior to joining it together. In which of the following orders should this data manipulation bo approached for the most efficient result?

Options:

A.

Merge, append, deduplicate

B.

Merge, deduplicate, append

C.

Deduplicate, append, merge

D.

Append, deduplicate, merge

Question 46

Given the following data set:

Which of the following is the best reason for cleansing the data?

Options:

A.

Duplicate data

B.

Imputed data

C.

Redundant data

D.

Corrupt data

Question 47

An analyst is creating a resource to improve users' experience when they select specific records based on particular dates. Which of the following should the analyst use to create a resource that best meets user needs?

Options:

A.

Drop-down menu

B.

Date range

C.

Text field

D.

Frequency

Question 48

During data cleansing, an analyst conducts measures of central tendency on a data set. Which of the following data is the analyst attempting to identify?

Options:

A.

Duplicate

B.

Missing

C.

Outlying

D.

Invalid

Question 49

An analyst is building a new dashboard for a user. After an initial conversation with the user. the analyst created a mock-up of the dashboard. Which of the following best explains why the analyst created the mock-up?

Options:

A.

To identify the dimensions and measures

B.

To send to the client after deploying the dashboard to production

C.

To confirm important details before dashboard development begins

D.

To receive client approval for the final dashboard design

Question 50

Which of the following is the median of the number set:3, 7, 5, 6, 9?

Options:

A.

5

B.

6

C.

7

D.

9

Question 51

A collections manager has a team calling customers who are past due on their accounts in an attempt to collect payments. The manager receives the call list in the form of a printed report that is generated by the accounting department at the beginning of each week. Consequently, the collections team calls some customers who have made payments in the time since the report was last printed. Which of the following reporting enhancements could the accounting department implement to best reduce the number of calls on current accounts?

Options:

A.

Modify the date range on the report

B.

Include a time stamp on the report.

C.

Increase the frequency of report generation.

D.

Add a report run date to the report.

Question 52

A Chief Executive Officer (CEO) is requesting more up-to-date sales data for improved visibility prior to month-end. An analyst must determine the frequency of a sales report that was previously distributed on an as-needed basis. Which of the following would be the most appropriate frequency for this report?

Options:

A.

Monthly

B.

Quarterly

C.

Weekly

D.

Every other month

Question 53

Which of the following defines the policies and procedures for managing the master data?

Options:

A.

Data administration

B.

Data stewardship

C.

Data ownership

D.

Data governance

Question 54

Given the following data sample:

Which of the following best describes the data quality issue?

Options:

A.

Data outlier

B.

Consistent data

C.

Duplicate data

D.

Invalid data

Question 55

An analyst collected data that includes primary account numbers, expiration dates, and service codes. Which of the following data governance classifications is used to describe this data?

Options:

A.

PI I

B.

PCI

C.

PBI

D.

PHI

Question 56

A data analyst has been asked to organize the table below in the following ways:

By sales from high to low -

By state in alphabetic order -

Which of the following functions will allow the data analyst to organize the table in this manner?

Options:

A.

Conditional formatting

B.

Grouping

C.

Filtering

D.

Sorting

Question 57

An analyst needs to know what data an organization possesses. Which of the following is the best document for the analyst to consult?

Options:

A.

Data destruction policy

B.

Data use document

C.

Data dictionary

D.

Data retention policy

Question 58

Given the following customer and order tables:

Which of the following describes the number of rows and columns of data that would be present after performing an INNER JOIN of the tables?

Options:

A.

Five rows, eight columns

B.

Seven rows, eight columns

C.

Eight rows, seven columns

D.

Nine rows, five columns

Question 59

Standardized tests are given to students in the middle of each month, and the results are ready by the end of the month. The superintendent needs a quick view of test performance. Which of the following would be the best recommendation to meet the superintendent's requirements?

Options:

A.

A dashboard with a continuous data stream and saved searches

B.

A report of test scores by classroom, emailed to the superintendent at the end of the month

C.

A report of test scores with pie charts showing student performance

D.

A dashboard with a scheduled delivery, the ability to filter scores by school, and bar charts for comparison

Question 60

A database administrator needs to ensure only approved users can access specific database tables to perform financial functions. Which of the following is the best access control method for the administrator to use?

Options:

A.

Role-based

B.

Rule-based

C.

Discretionary

D.

Group-based

Question 61

A database administrator is required to mask certain table columns containing Pll in order to comply with the company privacy policy. Which of the following are the most likely types of information the administrator should mask? (Select two).

Options:

A.

Government-issued ID

B.

Address

C.

Order ID

D.

Order date

E.

Customer ID

F.

Referral number

Question 62

An analyst computed a new variable of income per day in the household by multiplying the number of days worked by the number of people working in the household and the income earned per day. Which of the following is the correct name for this new variable?

Options:

A.

Derived

B.

Categorical

C.

Continuous

D.

Control

Question 63

Angela is aggregating data from CRM system with data from an employee system.

While performing an initial quality check, she realizes that her employee ID is not associated with her identifier in the CRM system.

What kind of issues is Angela facing?

Choose the best answer.

Options:

A.

ETL process.

B.

Record linkage.

C.

ELT process.

D.

System integration.

Question 64

A database consists of one fact table that is composed of multiple dimensions. Depending on the dimension, each one can be represented by a denormalized table or multiple normalized tables. This structure is an example of a:

Options:

A.

transactional schema.

B.

star schema.

C.

non-relational schema.

D.

snowflake schema.

Question 65

An analyst is required to run a text analysis of data that is found in articles from a digital news outlet. Which of the following would be the BEST technique for the analyst to apply to acquire the data?

Options:

A.

Web scraping

B.

Sampling

C.

Data wrangling

D.

ETL

Question 66

A publishing group has requested a dashboard to track submissions before publication. A key requirement is that all changes are tracked, as multiple users will be checking out documents and editing them before submissions are considered final. Which of the following is the BEST way to meet this stakeholder requirement?

Options:

A.

Display the version number next to each submission on the dashboard.

B.

Present a data refresh date at the top of the dashboard.

C.

Confirm the dashboard is adhering to the corporate style guide.

D.

Use permissions to ensure users only see certain versions of the submissions.

Question 67

Which one of the following in NOT a common data integration tool?

Options:

A.

XSS

B.

ELT

C.

ETL

D.

APIs

Question 68

Which of the following programming languages are best suited for analysis and machine-learning applications? (Select two).

Options:

A.

Ruby

B.

Rust

C.

PHP

D.

Python

E.

Kotlin

F.

R

Question 69

An analyst needs to provide a chart to identify the composition between the categories of the survey response data set:

Which of the following charts would be BEST to use?

Options:

A.

Histogram

B.

Pie

C.

Line

D.

Scatter pot

E.

Waterfall

Question 70

A data analyst needs to apply quality control concepts to a data set for accuracy. Which of the following is the best way to do this?

Options:

A.

Standardization

B.

Parameterization

C.

Encryption

D.

Cross-validation

Question 71

Given the following data:

Which of the following BEST describes the data set?

Options:

A.

There is data bias.

B.

The data is incomplete.

C.

The data is inconsistent.

D.

The data is outliers.

Question 72

A data analyst has a set of data that shows the number of gallons of oil produced each day. The company would like to know the standard deviation for the data set. The variance for the data is 36 gallons. Which of the following is the standard deviation for gallons produced?

Options:

A.

1.16

B.

6

C.

36

D.

72

Question 73

A data analyst needs to present the results of an online marketing campaign to the marketing manager. The manager wants to see the most important KPIs and measure the return on marketing investment. Which of the following should the data analyst use to BEST communicate this information to the manager?

Options:

A.

A real-time monitor that allows the manager to view performance the day the campaign was launched

B.

A sell-service dashboard that allows the manager to look at the company’s annual budget performance

C.

A spreadsheet of the raw data from all marketing campaigns and channels

D.

A summary with statistics, conclusions, and recommendations from the data analyst

Question 74

Which of the following is the first step an analyst should perform upon receiving a business request for analysis?

Options:

A.

Determine the data needs and sources for analysis.

B.

Initiate the analysis for exploratory data analysis.

C.

Review the business questions to understand the scope.

D.

Finalize the methodology to solve the problem.

Question 75

A reporting analyst is creating a dashboard that shows the year-over-year performance for a sales organization. Which of the following is the best visual for the analyst use to illustrate the organization's performance?

Options:

A.

Pie chart

B.

Scatter plot

C.

Heat map

D.

Line chart

Question 76

Which of the following is an example of a at flat file?

Options:

A.

CSV file

B.

PDF file

C.

JSON file

D.

JPEG file

Question 77

Given the image below:

Which of the following file formats is depicted?

Options:

A.

JSON

B.

CSV

C.

XML

D.

HTML

Question 78

You are working with a professional statistician to perform an analysis and would like to use a statistics package.

Which one of the following would be the most appropriate?

Options:

A.

Rapid Miner.

B.

QLIK.

C.

Power BI.

D.

Minitab.

Question 79

A development company is constructing a new Init in its apartment complex. The complex has the following floor plans:

Using the average cost per square foot of the original floor plans. which of the following should be the price of the Rose Init?

Options:

A.

$640,900

B.

$690,000

C.

$705,200

D.

$702,500

Question 80

Consider two different datasets, one with gas prices and the other with food prices. Which of the following measures is most affected by outliers?

Options:

A.

Absolute value

B.

Mode

C.

Median

D.

Mean

Question 81

A data analyst is creating a report that will provide information about various regions, products, and time periods. Which of the following formats would be themost efficient way to deliver this report?

Options:

A.

A workbook with multiple tabs for each region

B.

A daily email with snapshots of regional summaries

C.

A static report with a different page for every filtered view

D.

A dashboard with filters at the top that the user can toggle

Question 82

Which of the following is the best reason for removing data outliers?

Options:

A.

Data varies significantly from others.

B.

Data is redundant in the table.

C.

Data is duplicated in the whole range.

D.

Data is missing from the table.

Question 83

Which of the following types of dashboards should a business intelligence engineer develop in order to provide information about failed data pipelines?

Options:

A.

Referencing

B.

Strategic

C.

Operational

D.

Technical

Question 84

An analyst has written the following code:

SELECT *

FROM Cust_table

WHERE age > 60 AND City = "New York"

Which of the following criteria is the analyst retrieving?

Options:

A.

All customers older than age 60 in New York state

B.

All customers aged 60 and older in New York state

C.

All customers older than age 60 in New York City

D.

All customers younger than age 60 in New York City

Question 85

Which of the following describes the method of sampling in which elements of data are selected randomly from each of the small subgroups within a population?

Options:

A.

Simple random

B.

Cluster

C.

Systematic

D.

Stratified

Question 86

Which of the following is an example of a data-mining ETL tool?

Options:

A.

SSIS

B.

Stata

C.

SPSS

D.

Cognos

Question 87

Given the following grocery store orders:

If a query is made to the table with the following logic:

Order_Total > 132 OR (Order Total >= 25 AND Order_Total < 74)

Which of the following is the number of orders that will be returned by the query?

Options:

A.

Four

B.

Five

C.

Six

D.

Seven

Question 88

An analyst wants to check the progress and performance regarding the number of customers an organization served in the last six years. Which of the following represents the type of analysis theanalyst should perform?

Options:

A.

Correlation analysis

B.

Trend analysis

C.

Regression analysis

D.

Descriptive analysis

Question 89

A data analyst is developing a dashboard to track and monitor metrics. Which of the following best practices should be taken into during the FIRST pment process?

Options:

A.

Create a A Aupirarrame:

B.

Deploy to production.

C.

Copy a dashboard design from the Internet.

D.

Develop a dashboard.

Question 90

An analyst needs to join two tables of data together for analysis. All the names and cities in the first table should be joined with the corresponding ages in the second table, if applicable.

Which of the following is the correct join the analyst should complete. and how many total rows will be in one table?

Options:

A.

INNER JOIN, two rows

B.

LEFT JOIN. four rows

C.

RIGHT JOIN. five rows

D.

OUTER JOIN, seven rows

Question 91

Which of the following is most likely to be used as a data-mining ETL tool?

Options:

A.

SSIS

B.

Stata

C.

SPSS

D.

Cognos

Question 92

What category of data stewardship work is focused on ensuring that the organization respects the wishes of data subjects?

Options:

A.

Data quality.

B.

Data privacy.

C.

Data security.

D.

Regulatory compliance.

Question 93

Given the following table:

Which of the following describes the data quality issues with theagedata?

Options:

A.

Completeness

B.

Consistency

C.

Accuracy

D.

Manipulation

Question 94

A research analyst collects ten data points from 1.000 specimens. The analyst will not need any additional data to complete the analysis and will not need to retrieve information by specifier. Which of the following is the best data structure for the analyst to use?

Options:

A.

NoSQL

B.

Flat file

C.

JSON

D.

Relational database

Question 95

Which of the following occurs if a 90% confidence interval increases to 95%?

Options:

A.

The margin of error does not change.

B.

The interval remains the same.

C.

The interval becomes narrower.

D.

The margin of error doubles.

Question 96

Which of the following data sampling methods involves dividing a population into subgroups by similar characteristics?

Options:

A.

Systematic

B.

Simple random

C.

Convenience

D.

Stratified

Question 97

Which of the following is a domain-specific language used in programming that is designed for managing data that is held in a relational data stream management system?

Options:

A.

SAS

B.

SQL

C.

Python

D.

R

Question 98

Which one the following is not considered an aggregate function?

Options:

A.

SUM

B.

MIN

C.

SELECT

D.

MAX

Question 99

The duration of a phone call in milliseconds is an example of:

Options:

A.

ordinal data.

B.

nominal data.

C.

boolean data.

D.

continuous data.

Question 100

After the daily ETL jobs are completed, the data in the reports does not appear complete, and a lot of data seems to be missing. Which of the following concepts should be used to assess and investigate further?

Options:

A.

Cross-validation

B.

Data profiling

C.

Data integrity

D.

Data consistency

Question 101

Given the following data:

CustomerID

ItemBought

Date

Tre_234

Sofa

2022-09-08

216_Tre

Shoes

08/02/2021

215/Tre

Blanket

2021/06/20

045/Tre

Mug

12-26-2021

Tre-345

Lamp

31/08/2022

TREJD19

Bucket

2022'08/01

Which of the following best describes the main issue in the data set?

Options:

A.

Inconsistent data

B.

Data mismatch

C.

Invalid data

D.

Redundant data

Question 102

A data analyst needs to create a weekly recurring report on sales performance and distribute it to all sales managers. Which of the following would be the BEST method to automate and ensure successful delivery for this task?

Options:

A.

Use scheduled report delivery.

B.

Implement subscription access delivery.

C.

Print out a copy.

D.

Upload the report to the server.

Question 103

An analyst has been asked to validate data quality. Which of the following are the BEST reasons to validate data for quality control purposes? (Choose two.)

Options:

A.

Retention

B.

Integrity

C.

Transmission

D.

Consistency

E.

Encryption

F.

Deletion

Question 104

Randy scored 76 on a math test, Katie scored 86 on a science test, Ralph scored 80 on a history test, and Jean scored 80 on an English test. The table below contains the mean and standard deviation of the scores for each of the courses:

Using this information, which of the following students had the BEST score?

Options:

A.

Randy

B.

Katie

C.

Ralph

D.

Jean

Question 105

Which of the following best describes a business analytics tool with interactive visualization and business capabilities and an interface that is simple enough for end users to create their own reports and dashboards?

    Python

Options:

A.

R

B.

Microsoft Power Bl

C.

SAS

Question 106

Which of the following concepts should be applied if a data set with 40 fields needs to be pared down to 20 fields and contains similar data across multiple fields?

Options:

A.

Duplication

B.

Consolidation

C.

Compliance

D.

Standardization

Question 107

When analyzing the values of two variables, you decide to convert both variables so they are on a scale of 0 to 1.

What term describes this action?

Options:

A.

Filtering.

B.

Normalization.

C.

Transposition.

D.

Aggregation.

Question 108

A sales team wants visibility of current sales numbers, pipeline, and team performance. The team would also like to see calculations of individuals’ earned commissions and projected commissions based on sales, but they want that information to be kept confidential. Which of the following would be the BEST way to provide this visibility?

Options:

A.

Create a dashboard displaying a data refresh date so users know the current sales numbers and configure permissions to control access.

B.

Create a dashboard for sales numbers, pipeline, and team and individual performance for the management team.

C.

Create a dashboard with filters for the overall team, individuals, and management. Users can filter to see the data they want.

D.

Create a dashboard with views for team, individuals, and management. Configure permissions to control access.

Demo: 108 questions
Total 363 questions