Winter Special Flat 65% Limited Time Discount offer - Ends in 0d 00h 00m 00s - Coupon code: suredis

CompTIA DA0-001 CompTIA Data+ Certification Exam Exam Practice Test

Demo: 94 questions
Total 314 questions

CompTIA Data+ Certification Exam Questions and Answers

Question 1

A county in Illinois is conducting a survey to determine the mean annual income per household. The county is 427sq mi (2.65q km). Which of the following sampling methods would MOST likely result in a representative sample?

Options:

A.

A stratified phone survey of 100 people that is conducted between 2:00 p.m. and 3:00 p.m.

B.

A systematic survey that is sent to 100 single-family homes in the county

C.

Surveys sent to ten randomly selected homes within 5mi (8km) of the county’s office

D.

Surveys sent to 100 randomly selected homes that are reflective of the population

Question 2

Which of the following value is the measure of dispersion "range" between the scores of ten students in a test.

The scores of ten students in a test are 17, 23, 30, 36, 45, 51, 58, 66, 72, 77.

Options:

A.

90

B.

60

C.

70

D.

80

Question 3

A data analyst must fulfill a request for information that is needed weekly and should be automatically emailed to a specific set of users. Which of the following types of reports should the analyst recommend?

Options:

A.

A self-service report

B.

A research report

C.

An ad hoc report

D.

An operational report

Question 4

An analyst has been asked to validate data quality. Which of the following are the BEST reasons to validate data for quality control purposes? (Choose two.)

Options:

A.

Retention

B.

Integrity

C.

Transmission

D.

Consistency

E.

Encryption

F.

Deletion

Question 5

A report is scheduled to run and be distributed at the end of business each day. On Mondays, one of the recipients opens the previous week's reports and combines them to calculate the weekly totals and projections for the coming week. This is a tedious process, and the recipient asks an analyst for help. Which of the following should the analyst recommend?

Options:

A.

Add calculation fields to the daily report so the totals are built in.

B.

Create a new report with weekly totals set to run at the end of business on Friday.

C.

Provide a daily summary to the report with totals to save the user the effort of manual calculations.

D.

Reduce the frequency of the report to once a week and change the date range.

Question 6

A junior web developer is developing a new application where users can upload short videos. The first task is to create a homepage that shows the headline "Upload Your Short Videos" and a clickable button that says "upload now".

Which of the following HTML commands would help the developer to complete the task successfully?

Options:

A.

< span >Upload Your Short Videos< /span >< button >upload now< /button >

B.

< p >Upload Your Short Videos< /p >< p >upload now< /p >

C.

< hl >Upload Your Short Videos< /h1 >< button >upload now< /button >

D.

< hl >Upload Your Short Videos< /h1 >< hl >upload now< /h1 >

Question 7

A research analyst wants to determine whether the data being analyzed is connected to other datapoints. Which of the following is the BEST type of analysis to conduct?

Options:

A.

Trend analysis

B.

Performance analysis

C.

Link analysis

D.

Exploratory analysis

Question 8

A database administrator needs to ensure only approved users can access specific database tables to perform financial functions. Which of the following is the best access control method for the administrator to use?

Options:

A.

Role-based

B.

Rule-based

C.

Discretionary

D.

Group-based

Question 9

A recurring event is being stored in two databases that are housed in different geographical locations. A data analyst notices the event is being logged three hours earlier in one database than in the other database. Which of the following is the MOST likely cause of the issue?

Options:

A.

The data analyst is not querying the databases correctly.

B.

The databases are recording different events.

C.

The databases are recording the event in different time zones.

D.

The second database is logging incorrectly.

Question 10

Which of the following best describes how discrete data differs from continuous data?

Options:

A.

Discrete data cannot create a sloped line.

B.

Discrete data can only be a finite number of values.

C.

Discrete data can have decimal points.

D.

Discrete data applies only to numbers.

Question 11

Emma is working in a data warehouse and finds a finance fact table links to an organization dimension, which in turn links to a currency dimension that not linked to the fact table.

What type of design pattern is the data warehouse using?

Options:

A.

Star.

B.

Sun.

C.

Snowflake.

D.

Comet.

Question 12

Which of the following data governance concepts fits into the security requirements category?

Options:

A.

Data transmission

B.

Data deletion

C.

Data use agreements

D.

Personally identifiable information

Question 13

Which of the following is a non-parametric test?

Options:

A.

One-sample t-test

B.

Two-way ANOVA

C.

Correlation coefficient

D.

Spearman's rank correlation

Question 14

You have two databases tables that you would like to join together using a foreign key relationship.

What term best describes this action?

Options:

A.

Blending.

B.

Appending.

C.

Mixing.

D.

Merging.

Question 15

A cereal manufacturer wants to determine whether the sugar content of its cereal has increased over the years. Which of the following is the appropriate descriptive statistic to use?

Options:

A.

Frequency

B.

Percent change

C.

Variance

D.

Mean

Question 16

Given the data below:

In which of the following file formats is the data presented?

Options:

A.

Xs

B.

CSV

C.

RIF

D.

XML

Question 17

A data analyst is designing a dashboard that will provide a story of sales and determine which site is providing the highest sales volume per customer The analyst must choose an appropriate chart to include in the dashboard. The following data is available:

Which of the following types of charts should be considered?

Options:

A.

Include a line chart using the site and average sales per customer.

B.

Include a pie chart using the site and sales to average sales per customer.

C.

Include a scatter chart using sales volume and average sales per customer.

D.

Include a column chart using the site and sales to average sales per customer.

Question 18

An analyst in a consumer bank department wants to showcase the concentration of accounts opened in the United States by ZIP Code to describe the effectiveness of the bank's marketing campaigns. Which of the following would be the best way to visualize the data?

Options:

A.

A stacked chart

B.

A tree map

C.

A waterfall chart

D.

A geographic map

Question 19

Which of the following data types must be used when working with variables that require classification into two or more groups before analysis?

Options:

A.

Discrete

B.

Numerical

C.

Alphanumeric

D.

Categorical

Question 20

An analyst develops an IT document and needs to describe the technical terms used in the document. Which of the following is where the analyst should include descriptions of the technical terms?

Options:

A.

Glossary

B.

System diagram

C.

User requirements

D.

Index

Question 21

Which of the following is a domain-specific language used in programming that is designed for managing data that is held in a relational data stream management system?

Options:

A.

SAS

B.

SQL

C.

Python

D.

R

Question 22

A database consists of one fact table that is composed of multiple dimensions. Each dimension is represented by a denormalized table. This structure is an example of a:

Options:

A.

non-relational schema.

B.

galaxy schema.

C.

snowflake schema.

D.

star schema.

Question 23

A data analyst needs to present the results of an online marketing campaign to the marketing manager. The manager wants to see the most important KPIs and measure the return on marketing investment. Which of the following should the data analyst use to BEST communicate this information to the manager?

Options:

A.

A real-time monitor that allows the manager to view performance the day the campaign was launched

B.

A sell-service dashboard that allows the manager to look at the company’s annual budget performance

C.

A spreadsheet of the raw data from all marketing campaigns and channels

D.

A summary with statistics, conclusions, and recommendations from the data analyst

Question 24

Which of the following is the first step an analyst should perform upon receiving a business request for analysis?

Options:

A.

Determine the data needs and sources for analysis.

B.

Initiate the analysis for exploratory data analysis.

C.

Review the business questions to understand the scope.

D.

Finalize the methodology to solve the problem.

Question 25

An organization would like to add a secondary email field to its customer database in order to enrich the customer profiles. Which of the following data manipulation techniques should the analyst use to add this information?

Options:

A.

Blend

B.

Merge

C.

Append

D.

Aggregate

Question 26

A data analyst is creating a report that will provide information about various regions, products, and time periods. Which of the following formats would be the MOST efficient way to deliver this report?

Options:

A.

A workbook with multiple tabs for each region

B.

A daily email with snapshots of regional summaries

C.

A static report with a different page for every filtered view

D.

A dashboard with filters at the top that the user can toggle

Question 27

An analyst needs to summarize the number of people in Chicago in 2022 using the following set of data:

Which of the following steps should the analyst use to provide results? (Select two).

Options:

A.

Aggregation

B.

Sorting

C.

Filtering

D.

Indexing

E.

Cleaning

F.

Replacing

Question 28

The ACME Corporation hired an analyst to detect data quality issues in their Excel documents. Which of the following are the most common issues? (Select TWO)

Options:

A.

Apostrophe.

B.

Commas.

C.

Symbols.

D.

Duplicates.

E.

Misspellings.

Question 29

Which of the following concepts should be applied if a data set with 40 fields needs to be pared down to 20 fields and contains similar data across multiple fields?

Options:

A.

Duplication

B.

Consolidation

C.

Compliance

D.

Standardization

Question 30

A user receives a large custom report to track company sales across various date ranges. The user then completes a series of manual calculations for each date range. Which of the following should an analyst suggest so the user has a dynamic, seamless experience?

Options:

A.

Create multiple reports, one for each needed date range.

B.

Build calculations into the report so they are done automatically.

C.

Add macros to the report to speed up the filtering and calculations process.

D.

Create a dashboard with a date range picker and calculations built in.

Question 31

‘Which of the following is the BEST reason to use database views instead of tables?

Options:

A.

Views reduce the need for repetitive, complex data joins.

B.

Views allow for the storage of temporary data. whereas tables do not.

C.

Views allow for the joining of multiple data sources, whereas tables do not.

D.

Views can be used to restrict sensitive information.

Question 32

A JSON file is an example of:

Options:

A.

structured data.

B.

web data.

C.

machine data.

D.

processed data.

Question 33

Which of the following types of analysis is used when comparing last week's sales to the previous week's sales?

Options:

A.

Trend analysis

B.

Exploratory analysis

C.

Prescriptive analysis

D.

Link analysis

Question 34

Given the following grocery store orders:

If a query is made to the table with the following logic:

Order_Total > 132 OR (Order Total >= 25 AND Order_Total < 74)

Which of the following is the number of orders that will be returned by the query?

Options:

A.

Four

B.

Five

C.

Six

D.

Seven

Question 35

The senior management team at a company receives a detailed sales report at the end of each quarter. The report is several pages long and includes data from dozens of offices across the country. The team wants a better way to get a quick snapshot of what is included in the report. Which of the following modifications would best meet this requirement?

Options:

A.

Modifying documentation elements to include reference data sources

B.

Modifying the font size and style so important data points are more visible

C.

Modifying the report to include a summary section with observations and insights

D.

Modifying the report layout so it is easier to follow and understand

Question 36

Which of the following data sampling methods involves dividing a population into subgroups by similar characteristics?

Options:

A.

Systematic

B.

Simple random

C.

Convenience

D.

Stratified

Question 37

Which of the following is an example of a at flat file?

Options:

A.

CSV file

B.

PDF file

C.

JSON file

D.

JPEG file

Question 38

A table in a hospital database has a column for patient height in inches and a column for patient height in centimeters. This is an example of:

Options:

A.

dependent data.

B.

duplicate data.

C.

invalid data

D.

redundant data

Question 39

A data scientist wants to see which products make the most money and which products attract the most customer purchasing interest in their company.

Which of the following data manipulation techniques would he use to obtain this information?

Options:

A.

Data append

B.

Data blending

C.

Normalize data

D.

Data merge

Question 40

Given the following:

Which of the following is the most important thing for an analyst to do when transforming the table for a trend analysis?

Options:

A.

Fill in the missing cost where it is null.

B.

Separate the table into two tables and create a primary key

C.

Replace the extended cost field with a calculated field.

D.

Correct the dates so they have the same format.

Question 41

A reporting analyst is creating a dashboard that shows the year-over-year performance for a sales organization. Which of the following is the best visual for the analyst use to illustrate the organization's performance?

Options:

A.

Pie chart

B.

Scatter plot

C.

Heat map

D.

Line chart

Question 42

Five dogs have the following heights in millimeters:

300,430, 170, 470, 600

Which of the following is the standard deviation for the five dogs?

Options:

A.

147mm

B.

154mm

C.

394 mm

D.

21,704mm

Question 43

While reviewing survey data, a research analyst notices data is missing from all the responses to a single question. Which of the following methods would BEST address this issue?

Options:

A.

Replace missing data.

B.

Remove duplicate data.

C.

Replace redundant data.

D.

Remove invalid data.

Question 44

When analyzing the values of two variables, you decide to convert both variables so they are on a scale of 0 to 1.

What term describes this action?

Options:

A.

Filtering.

B.

Normalization.

C.

Transposition.

D.

Aggregation.

Question 45

An analyst wants to combine two data sets into a single spreadsheet. Column names from the first spreadsheet are listed in rows in the second spreadsheet. Which of the following is the first step the analyst should take to combine the data sets?

Options:

A.

Blend

B.

Merge

C.

Concatenate

D.

Transpose

Question 46

Given the following data:

Which of the following BEST describes the data set?

Options:

A.

There is data bias.

B.

The data is incomplete.

C.

The data is inconsistent.

D.

The data is outliers.

Question 47

A data analyst is compiling a report that a Chief Executive Officer needs for an impromptu meeting. The report should include information on the previous day's performance. Which of the following reports should the analyst provide?

Options:

A.

Tactical

B.

Ad hoc

C.

Dynamic

D.

Recurring

Question 48

A data analyst has been asked to merge the tables below, first performing an INNER JOIN and then a LEFT JOIN:

Customer Table -

In-store Transactions –

Which of the following describes the number of rows of data that can be expected after performing both joins in the order stated, considering the customer table as the main table?

Options:

A.

INNER: 6 rows; LEFT: 9 rows

B.

INNER: 9 rows; LEFT: 6 rows

C.

INNER: 9 rows; LEFT: 15 rows

D.

INNER: 15 rows; LEFT: 9 rows

Question 49

Andy is a pricing analyst for a retailer. Using a hypothesis test, he wants to assess whether people who receive electronic coupons spend more on average.

What should Andy's null hypothesis be?

Options:

A.

People who receive electronic coupons spend more on average.

B.

People who receive electronic coupons spend less on average.

C.

People who receive electronic coupons do not spend more on average.

D.

People who do not receive electronic coupons spend more on average.

Question 50

An e-commerce company recently tested a new website layout. The website was tested by a test group of customers, and an old website was presented to a control group. The table below shows the percentage of users in each group who made purchases on the websites:

Which of the following conclusions is accurate at a 95% confidence interval?

Options:

A.

In Germany, the increase in conversion from the new layout was not significant.

B.

In France, the increase in conversion from the new layout was not significant.

C.

In general, users who visit the new website are more likely to make a purchase.

D.

The new layout has the lowest conversion rates in the United Kingdom.

Question 51

Which of the following query statements would be used when filtering data in a relational database management system? (Select two).

Options:

A.

ORDER BY

B.

HAVING

C.

WHERE

D.

SELECT

E.

INSERT

F.

GROUP BY

Question 52

Which of the following summary statements upholds integrity in data reporting?

Options:

A.

Sales are approximately equal for Product A and Product B across all strategies.

B.

Strategy 4 provides the best sales in comparison to other strategies.

C.

While Strategy 2 does not result in the highest sales of Product D. over all products it appears to be the most effective.

D.

Product D should be promoted more than the other products in all strategies.

Question 53

An analyst is creating a resource to improve users' experience when they select specific records based on particular dates. Which of the following should the analyst use to create a resource that best meets user needs?

Options:

A.

Drop-down menu

B.

Date range

C.

Text field

D.

Frequency

Question 54

A business intelligence team wants to create a new dashboard in order to solve a problem statement. Which of the following is the correct order of steps the team should take?

Options:

A.

Determine business needs, find data sources, validate the data, create a mock-up, and analyze the information.

B.

Find data sources, determine business needs, validate the data, create a mock-up. and analyze the information.

C.

Create a mock-up, validate the data, analyze the information, determine business needs, and find data sources.

D.

Validate the data, find data sources, analyze the information, and determine business needs.

Question 55

Mario works with a group of R programmers tasked with copying data from an accounting system into a data warehouse.

In what phase are the group's R skills most relevant?

Options:

A.

Extract.

B.

Load.

C.

Transform.

D.

Purge.

Question 56

Given the following data table:

Which of the following are appropriate reasons to undertake data cleansing? (Select two).

Options:

A.

Non-parametric data

B.

Missing data

C.

Duplicate data

D.

Invalid data

E.

Redundant data

F.

Normalized data

Question 57

Which of the following is the best approach to use to gain a general understanding of a data set?

Options:

A.

Descriptive statistics

B.

Basic projections

C.

Gap analysis

D.

Trend analysis

Question 58

Which of the following describes the use of a representative amount of data from a main repository?

Options:

A.

Observation

B.

Delta load

C.

Web scraping

D.

Sampling

Question 59

Consider the following dataset which contains information about houses that are for sale:

Which of the following string manipulation commands will combine the address and region name columns to create a full address?

full_address------------------------- 85 Turner St, Northern Metropolitan 25 Bloomburg St, Northern Metropolitan 5 Charles St, Northern Metropolitan 40 Federation La, Northern Metropolitan 55a Park St, Northern Metropolitan

Options:

A.

SELECT CONCAT(address, ' , ' , regionname) AS full_address FROM melb LIMIT 5;

B.

SELECT CONCAT(address, '-' , regionname) AS full_address FROM melb LIMIT 5;

C.

SELECT CONCAT(regionname, ' , ' , address) AS full_address FROM melb LIMIT 5

D.

SELECT CONCAT(regionname, '-' , address) AS full_address FROM melb LIMIT 5;

Question 60

Which of the following is a process that is used during data integration to collect, blend, and load data?

Options:

A.

MDM

B.

ETL

C.

OLTP

D.

BI

Question 61

A company wants to know how its customers interact with an e-commerce website based on clicks over items. Which of the following is the primary requirement for this report?

Options:

A.

Data content

B.

Frequency

C.

Filtering

D.

Views

Question 62

Which of the following is the best description of discrete data types?

Options:

A.

Non-numeric data used to describe attributes of a population sample

B.

The frequency of the number of times each value occurs by using whole numbers

C.

Numeric values that can be measured on a continuous scale

D.

Non-numeric data used to describe attributes of a population sample ranked in a specific order

Question 63

Alex wants to use data from his corporate sale, CRM, and shipping systems to try and predict future sales.

Which of the following systems is the most appropriate?

Choose the best answer.

Options:

A.

Data mart.

B.

OLAP.

C.

Data Warehouse.

D.

OLTP.

Question 64

An analyst is reporting on the average income for a county and is reviewing the following data:

Which of the following is the reason the analyst would need to cleanse the data in this data set?

Options:

A.

Data completeness

B.

Data outliers

C.

Duplicate data

D.

Missing values

Question 65

Jenny wants to study the academic performance of undergraduate sophomores and wants to determine the average grade point average at different points during an academic year.

What best describes the data set she needs?

Options:

A.

Sample.

B.

Observation.

C.

Variable.

D.

Population.

Question 66

An analyst wants to check the progress and performance regarding the number of customers an organization served in the last six years. Which of the following represents the type of analysis the analyst should perform?

Options:

A.

Correlation analysis

B.

Trend analysis

C.

Regression analysis

D.

Descriptive analysis

Question 67

Given the image below:

Which of the following file formats is depicted?

Options:

A.

JSON

B.

CSV

C.

XML

D.

HTML

Question 68

An analysts building a monthly report for production and wants to ensure the audience is aware of its once-a-month cadence. Which of the following is the MOST important to convey that information?

Options:

A.

The date of the dashboard build

B.

The data refresh date

C.

A report summary

D.

Frequently asked questions

Question 69

A data engineer is creating a database field to capture whether a customer likes vanilla ice cream. Which of the following data types is the best to capture this information?

Options:

A.

Integer

B.

Boolean

C.

Categorical

D.

Numeric

Question 70

An analyst runs a report on a daily basis, and the number of datapoints must be validated before the data can be analyzed. The number of datapoints increases each day by approximately 20% of the total number from the day before. On a given day, the number of datapoints was 8,798. Which of the following should be the total number of datapoints on the next day?

Options:

A.

7,038

B.

9,600

C.

10,600

D.

10,800

Question 71

Which of the following descriptive statistical methods are measures of central tendency? (Choose two.)

Options:

A.

Mean

B.

Minimum

C.

Mode

D.

Variance

E.

Correlation

F.

Maximum

Question 72

Which of the following actions should be taken when transmitting data to mitigate the chance of a data leak occurring? (Choose two.)

Options:

A.

Data identification

B.

Data processing

C.

Data Reporting

D.

Data encryption

E.

Data masking

F.

Fata removal

Question 73

A data analyst needs to collect a similar proportion of data from every state. Which of the following sampling methods would be the most appropriate?

Options:

A.

Systematic sampling

B.

Convenience sampling

C.

Stratified sampling

D.

Random sampling

Question 74

Which of the following best describes a difference between JSON and XML?

Options:

A.

JSON is quicker to read and write.

B.

JSON has to use an end tag.

C.

JSON strings are longer

D.

JSON is much more difficult to parse.

Question 75

An analyst is required to run a text analysis of data that is found in articles from a digital news outlet. Which of the following would be the BEST technique for the analyst to apply to acquire the data?

Options:

A.

Web scraping

B.

Sampling

C.

Data wrangling

D.

ETL

Question 76

An analyst modified a data set that had a number of issues. Given the original and modified versions:

Which of the following data manipulation techniques did the analyst use?

Options:

A.

Imputation

B.

Recoding

C.

Parsing

D.

Deriving

Question 77

An analyst is updating a customer contacts database with information obtained from a survey of new customers. Which of the following data manipulation techniques should the analyst use?

Options:

A.

Join

B.

Append

C.

Transform

D.

Blend

Question 78

A data analyst has received a data set that contains actual and projected sales for the fourth quarter of 2019. Which of the following statistical methods should the analyst use to find the measure of dispersion?

Options:

A.

Mean

B.

Variance

C.

Correlation

D.

Confidence interval

Question 79

A user imports a data file into the accounts payable system each day. On a regular basis. the field input is not what the system is expecting. so it results in an error for the row and a broken import process. To resolve the issue, the user opens the file, finds the error in the row, and manually corrects it before attempting the import again. The import sometimes breaks on subsequent attempts. though. Which of the following changes should be made to this process to reduce the number of errors?

Options:

A.

Delete all incorrect inputs and upload the corrected file.

B.

Have the user manually review the file for data completeness before loading it

C.

Create a data field to data type validator to run the file through prior to import.

D.

Spot-check the file prior to import to catch and correct field errors.

Question 80

A data analyst has been asked to create a daily manufacturing report for the floor manager Which of the following metrics should be included in the report?

Options:

A.

Tons of steel produced per hour

B.

Annual sales budget

C.

End-of-day stock price

D.

Daily corporate employee count

Question 81

You are working with a dataset and want to change the names of categories that you used for different types of books.

What term best describes this action?

Options:

A.

Recording.

B.

Summarizing

C.

Aggregating.

D.

Filtering.

Question 82

A sales analyst needs to report how the sales team is performing to target. Which of the following files will be important in determining 2019 performance attainment?

Options:

A.

2018 goal data

B.

2018 actual revenue

C.

2019 goal data

D.

2019 commission plan

Question 83

Given the diagram below:

Which of the following types of sampling is depicted in the image?

Options:

A.

Stratified

B.

Random

C.

Cluster

D.

Systematic

Question 84

Which of the following should be accomplished NEXT after understanding a business requirement for a data analysis report?

Options:

A.

Rephrase the business requirement.

B.

Determine the data necessary for the analysis

C.

Build a mock dashboard/presentation layout.

D.

Perform exploratory data analysis.

Question 85

Given the table below:

Which of the following variables can be considered inconsistent, and how many distinct values should the variable have?

Options:

A.

Name, one

B.

Gender, two

C.

Level, three

D.

Code, four

E.

Region, five

Question 86

Under which of the following circumstances should the null hypothesis be accepted when a = 0.05?

Options:

A.

When p is 0.00003

B.

When p is 0.001

C.

When p is 0.04

D.

When p is 0.06

Question 87

Which of the following is an example of a discrete variable?

Options:

A.

The temperature of a hot tub

B.

The height of a horse

C.

The time to complete a task

D.

The number of people in an office

Question 88

A customer survey reveals 90% positive feedback. Which of the following statistical methods would be best to utilize to determine the reliability of a data set and predict how a larger sample of customers over the same time period might respond?

Options:

A.

Calculate a high variance on survey responses.

B.

Calculate the maximum range of the survey responses.

C.

Calculate a low standard deviation on survey responses.

D.

Remove any data more than 4 standard deviation from the mean.

Question 89

The duration of a phone call in milliseconds is an example of:

Options:

A.

ordinal data.

B.

nominal data.

C.

boolean data.

D.

continuous data.

Question 90

What role in a data governance is typically responsible for day-to-day oversight of data use?

Options:

A.

Data processors.

B.

Data custodians

C.

Data owners.

D.

Data stewards.

Question 91

Which of the following differentiates a flat text file from other data types?

Options:

A.

Data is separated by a delimiter.

B.

Data is stored in defined rows.

C.

Data is defined with key-value pairs.

D.

Data is housed in a markup language.

Question 92

A data analyst for a media company needs to determine the most popular movie genre. Given the table below:

Which of the following must be done to the Genre column before this task can be completed?

Options:

A.

Append

B.

Merge

C.

Concatenate

D.

Delimit

Question 93

Which of the following best describes the process of examining data for statistics and information about the data?

    Cleansing

Options:

A.

search

B.

Profiling

C.

Governance

Question 94

Which one of the following is a common data warehouse schema?

Options:

A.

Snowflake.

B.

Square.

C.

Spiral.

D.

Sphere.

Demo: 94 questions
Total 314 questions