A county in Illinois is conducting a survey to determine the mean annual income per household. The county is 427sq mi (2.65q km). Which of the following sampling methods would MOST likely result in a representative sample?
Which of the following value is the measure of dispersion "range" between the scores of ten students in a test.
The scores of ten students in a test are 17, 23, 30, 36, 45, 51, 58, 66, 72, 77.
A data analyst must fulfill a request for information that is needed weekly and should be automatically emailed to a specific set of users. Which of the following types of reports should the analyst recommend?
An analyst has been asked to validate data quality. Which of the following are the BEST reasons to validate data for quality control purposes? (Choose two.)
A report is scheduled to run and be distributed at the end of business each day. On Mondays, one of the recipients opens the previous week's reports and combines them to calculate the weekly totals and projections for the coming week. This is a tedious process, and the recipient asks an analyst for help. Which of the following should the analyst recommend?
A junior web developer is developing a new application where users can upload short videos. The first task is to create a homepage that shows the headline "Upload Your Short Videos" and a clickable button that says "upload now".
Which of the following HTML commands would help the developer to complete the task successfully?
A research analyst wants to determine whether the data being analyzed is connected to other datapoints. Which of the following is the BEST type of analysis to conduct?
A database administrator needs to ensure only approved users can access specific database tables to perform financial functions. Which of the following is the best access control method for the administrator to use?
A recurring event is being stored in two databases that are housed in different geographical locations. A data analyst notices the event is being logged three hours earlier in one database than in the other database. Which of the following is the MOST likely cause of the issue?
Which of the following best describes how discrete data differs from continuous data?
Emma is working in a data warehouse and finds a finance fact table links to an organization dimension, which in turn links to a currency dimension that not linked to the fact table.
What type of design pattern is the data warehouse using?
Which of the following data governance concepts fits into the security requirements category?
Which of the following is a non-parametric test?
You have two databases tables that you would like to join together using a foreign key relationship.
What term best describes this action?
A cereal manufacturer wants to determine whether the sugar content of its cereal has increased over the years. Which of the following is the appropriate descriptive statistic to use?
Given the data below:
In which of the following file formats is the data presented?
A data analyst is designing a dashboard that will provide a story of sales and determine which site is providing the highest sales volume per customer The analyst must choose an appropriate chart to include in the dashboard. The following data is available:
Which of the following types of charts should be considered?
An analyst in a consumer bank department wants to showcase the concentration of accounts opened in the United States by ZIP Code to describe the effectiveness of the bank's marketing campaigns. Which of the following would be the best way to visualize the data?
Which of the following data types must be used when working with variables that require classification into two or more groups before analysis?
An analyst develops an IT document and needs to describe the technical terms used in the document. Which of the following is where the analyst should include descriptions of the technical terms?
Which of the following is a domain-specific language used in programming that is designed for managing data that is held in a relational data stream management system?
A database consists of one fact table that is composed of multiple dimensions. Each dimension is represented by a denormalized table. This structure is an example of a:
A data analyst needs to present the results of an online marketing campaign to the marketing manager. The manager wants to see the most important KPIs and measure the return on marketing investment. Which of the following should the data analyst use to BEST communicate this information to the manager?
Which of the following is the first step an analyst should perform upon receiving a business request for analysis?
An organization would like to add a secondary email field to its customer database in order to enrich the customer profiles. Which of the following data manipulation techniques should the analyst use to add this information?
A data analyst is creating a report that will provide information about various regions, products, and time periods. Which of the following formats would be the MOST efficient way to deliver this report?
An analyst needs to summarize the number of people in Chicago in 2022 using the following set of data:
Which of the following steps should the analyst use to provide results? (Select two).
The ACME Corporation hired an analyst to detect data quality issues in their Excel documents. Which of the following are the most common issues? (Select TWO)
Which of the following concepts should be applied if a data set with 40 fields needs to be pared down to 20 fields and contains similar data across multiple fields?
A user receives a large custom report to track company sales across various date ranges. The user then completes a series of manual calculations for each date range. Which of the following should an analyst suggest so the user has a dynamic, seamless experience?
‘Which of the following is the BEST reason to use database views instead of tables?
A JSON file is an example of:
Which of the following types of analysis is used when comparing last week's sales to the previous week's sales?
Given the following grocery store orders:
If a query is made to the table with the following logic:
Order_Total > 132 OR (Order Total >= 25 AND Order_Total < 74)
Which of the following is the number of orders that will be returned by the query?
The senior management team at a company receives a detailed sales report at the end of each quarter. The report is several pages long and includes data from dozens of offices across the country. The team wants a better way to get a quick snapshot of what is included in the report. Which of the following modifications would best meet this requirement?
Which of the following data sampling methods involves dividing a population into subgroups by similar characteristics?
Which of the following is an example of a at flat file?
A table in a hospital database has a column for patient height in inches and a column for patient height in centimeters. This is an example of:
A data scientist wants to see which products make the most money and which products attract the most customer purchasing interest in their company.
Which of the following data manipulation techniques would he use to obtain this information?
Given the following:
Which of the following is the most important thing for an analyst to do when transforming the table for a trend analysis?
A reporting analyst is creating a dashboard that shows the year-over-year performance for a sales organization. Which of the following is the best visual for the analyst use to illustrate the organization's performance?
Five dogs have the following heights in millimeters:
300,430, 170, 470, 600
Which of the following is the standard deviation for the five dogs?
While reviewing survey data, a research analyst notices data is missing from all the responses to a single question. Which of the following methods would BEST address this issue?
When analyzing the values of two variables, you decide to convert both variables so they are on a scale of 0 to 1.
What term describes this action?
An analyst wants to combine two data sets into a single spreadsheet. Column names from the first spreadsheet are listed in rows in the second spreadsheet. Which of the following is the first step the analyst should take to combine the data sets?
Given the following data:
Which of the following BEST describes the data set?
A data analyst is compiling a report that a Chief Executive Officer needs for an impromptu meeting. The report should include information on the previous day's performance. Which of the following reports should the analyst provide?
A data analyst has been asked to merge the tables below, first performing an INNER JOIN and then a LEFT JOIN:
Customer Table -
In-store Transactions –
Which of the following describes the number of rows of data that can be expected after performing both joins in the order stated, considering the customer table as the main table?
Andy is a pricing analyst for a retailer. Using a hypothesis test, he wants to assess whether people who receive electronic coupons spend more on average.
What should Andy's null hypothesis be?
An e-commerce company recently tested a new website layout. The website was tested by a test group of customers, and an old website was presented to a control group. The table below shows the percentage of users in each group who made purchases on the websites:
Which of the following conclusions is accurate at a 95% confidence interval?
Which of the following query statements would be used when filtering data in a relational database management system? (Select two).
Which of the following summary statements upholds integrity in data reporting?
An analyst is creating a resource to improve users' experience when they select specific records based on particular dates. Which of the following should the analyst use to create a resource that best meets user needs?
A business intelligence team wants to create a new dashboard in order to solve a problem statement. Which of the following is the correct order of steps the team should take?
Mario works with a group of R programmers tasked with copying data from an accounting system into a data warehouse.
In what phase are the group's R skills most relevant?
Given the following data table:
Which of the following are appropriate reasons to undertake data cleansing? (Select two).
Which of the following is the best approach to use to gain a general understanding of a data set?
Which of the following describes the use of a representative amount of data from a main repository?
Consider the following dataset which contains information about houses that are for sale:
Which of the following string manipulation commands will combine the address and region name columns to create a full address?
full_address------------------------- 85 Turner St, Northern Metropolitan 25 Bloomburg St, Northern Metropolitan 5 Charles St, Northern Metropolitan 40 Federation La, Northern Metropolitan 55a Park St, Northern Metropolitan
Which of the following is a process that is used during data integration to collect, blend, and load data?
A company wants to know how its customers interact with an e-commerce website based on clicks over items. Which of the following is the primary requirement for this report?
Which of the following is the best description of discrete data types?
Alex wants to use data from his corporate sale, CRM, and shipping systems to try and predict future sales.
Which of the following systems is the most appropriate?
Choose the best answer.
An analyst is reporting on the average income for a county and is reviewing the following data:
Which of the following is the reason the analyst would need to cleanse the data in this data set?
Jenny wants to study the academic performance of undergraduate sophomores and wants to determine the average grade point average at different points during an academic year.
What best describes the data set she needs?
An analyst wants to check the progress and performance regarding the number of customers an organization served in the last six years. Which of the following represents the type of analysis the analyst should perform?
Given the image below:
Which of the following file formats is depicted?
An analysts building a monthly report for production and wants to ensure the audience is aware of its once-a-month cadence. Which of the following is the MOST important to convey that information?
A data engineer is creating a database field to capture whether a customer likes vanilla ice cream. Which of the following data types is the best to capture this information?
An analyst runs a report on a daily basis, and the number of datapoints must be validated before the data can be analyzed. The number of datapoints increases each day by approximately 20% of the total number from the day before. On a given day, the number of datapoints was 8,798. Which of the following should be the total number of datapoints on the next day?
Which of the following descriptive statistical methods are measures of central tendency? (Choose two.)
Which of the following actions should be taken when transmitting data to mitigate the chance of a data leak occurring? (Choose two.)
A data analyst needs to collect a similar proportion of data from every state. Which of the following sampling methods would be the most appropriate?
Which of the following best describes a difference between JSON and XML?
An analyst is required to run a text analysis of data that is found in articles from a digital news outlet. Which of the following would be the BEST technique for the analyst to apply to acquire the data?
An analyst modified a data set that had a number of issues. Given the original and modified versions:
Which of the following data manipulation techniques did the analyst use?
An analyst is updating a customer contacts database with information obtained from a survey of new customers. Which of the following data manipulation techniques should the analyst use?
A data analyst has received a data set that contains actual and projected sales for the fourth quarter of 2019. Which of the following statistical methods should the analyst use to find the measure of dispersion?
A user imports a data file into the accounts payable system each day. On a regular basis. the field input is not what the system is expecting. so it results in an error for the row and a broken import process. To resolve the issue, the user opens the file, finds the error in the row, and manually corrects it before attempting the import again. The import sometimes breaks on subsequent attempts. though. Which of the following changes should be made to this process to reduce the number of errors?
A data analyst has been asked to create a daily manufacturing report for the floor manager Which of the following metrics should be included in the report?
You are working with a dataset and want to change the names of categories that you used for different types of books.
What term best describes this action?
A sales analyst needs to report how the sales team is performing to target. Which of the following files will be important in determining 2019 performance attainment?
Given the diagram below:
Which of the following types of sampling is depicted in the image?
Which of the following should be accomplished NEXT after understanding a business requirement for a data analysis report?
Given the table below:
Which of the following variables can be considered inconsistent, and how many distinct values should the variable have?
Under which of the following circumstances should the null hypothesis be accepted when a = 0.05?
Which of the following is an example of a discrete variable?
A customer survey reveals 90% positive feedback. Which of the following statistical methods would be best to utilize to determine the reliability of a data set and predict how a larger sample of customers over the same time period might respond?
The duration of a phone call in milliseconds is an example of:
What role in a data governance is typically responsible for day-to-day oversight of data use?
Which of the following differentiates a flat text file from other data types?
A data analyst for a media company needs to determine the most popular movie genre. Given the table below:
Which of the following must be done to the Genre column before this task can be completed?
Which of the following best describes the process of examining data for statistics and information about the data?
Cleansing
Which one of the following is a common data warehouse schema?