Black Friday Special Limited Time 70% Discount Offer - Ends in 0d 00h 00m 00s - Coupon code: 70percent

Huawei H13-711_V3.0 HCIA-Big Data V3.0 Exam Practice Test

Demo: 97 questions
Total 649 questions

HCIA-Big Data V3.0 Questions and Answers

Question 1

Which command in Redis is to view the remaining lifetime of a key?

Options:

A.

persist

B.

expire

C.

ttl

D.

expireat

Question 2

When the number of nodes in the Zookeeper cluster is 5 nodes, how many nodes are the disaster recovery capabilities of the cluster equivalent to?

Options:

A.

3

B.

4

C.

6

D.

none of the above

Question 3

Which of the following descriptions about Zookeeper features is wrong?

Options:

A.

The number of Zookeeper nodes must be odd.

B.

Updates sent by the client are applied in the order in which they were sent.

C.

Message updates can only succeed or fail, with no intermediate states.

D.

A message needs to be received by more than half of the servers.it will be able to successfully write to disk

Question 4

Which of the following functions can the Kafka Cluster Mirroring tool achieve?

Options:

A.

Kafka cross-cluster data synchronization method

B.

Kafka data backup within a single cluster

C.

Kafka data recovery within a single cluster

D.

None of the aboveA. True

Question 5

Which configuration is not supported by Fusioninsight Manager user rights management?

Options:

A.

Assign roles to users

B.

Configure permissions for roles

C.

Assign roles to user groups

D.

Configure permissions for user groups

Question 6

Which of the following operations cannot be recorded in the Fusioninsight HD system audit log( )

Options:

A.

delete service instance

B.

Start and stop the service instance

C.

Manually clear the camp

D.

Query history monitoring

Question 7

In the replica placement strategy of HDFS, what is the distance between different servers in the same rack?

Options:

A.

1

B.

2

C.

3

D.

4

Question 8

Which of the following factors contributed to the vigorous development of the era of big data?

Options:

A.

Reduced hardware costs and increased network bandwidth

B.

The rise of cloud computing

C.

The popularization of smart terminals and the improvement of social demands

D.

all of the aboveA. True

Question 9

Which of the following options are advantages of FusioninsightSparkSQL?

Options:

A.

Support data insert, update and delete

B.

long-term stability test

C.

Stable and high performance

D.

SQL Compatibility

E.

everything aboveA. True

Question 10

Which of the following descriptions about the features of Zookeeper is wrong?

Options:

A.

Updates sent by the client are applied in the order in which they were sent

B.

A message is to be received by more than half-respected servers,he will be able to successfully write to disk

C.

A message update can only succeed or fail. There is no intermediate state

D.

The number of Zookeeper nodes must be odd

Question 11

In Hive, which of the following statements about partitions is incorrect

Options:

A.

There can be further partitions or buckets under the partition

B.

The data table can be partitioned by the value of a field

C.

Each partition is a directory

D.

The number of partitions is fixed

Question 12

Which component controls the primary and secondary arbitration of NameNodef in HDFS

Options:

A.

HDFS Client

B.

NodeManager

C.

ResourceManager

D.

ZooKeeper Failover Controller

Question 13

What is the default Block Size of HDFS in Fusioninsight HD system?

Options:

A.

32M

B.

64M

C.

128M

D.

256M

Question 14

In order to improve the fault tolerance of Kafka, Kafka supports the replication strategy of partition. Which of the following descriptions about Leader partition and Follower partition is wrong?

Options:

A.

It is impossible for each node of a kafka cluster to be l with each othereader and flower

B.

If the leader fails, other followers will take over (become the new leader)

C.

Because the leader server carries all the request pressure. Therefore, from the overall consideration of the cluster, kafka will distribute the leader evenly on each instance to ensure the overall performance is stable

D.

Kafka needs to select a leader for partition replication, and the leader is responsible for reading and writing partitionsD. operation, other replica nodes are only responsible for data synchronization

Question 15

Fusioninsight HD Manager interface Hive log collection. Which option is incorrect?

Options:

A.

You can specify an instance for log collection, such as specifying a separate collection of MetaStorel logs

B.

You can specify a time period for log collection, for example, only collect logs from 2016-1-1 to 2016-140

C.

The node IP can be specified for log collection, for example, only the logs of a certain IP can be downloaded

D.

A specific user can be specified for log collection, for example, only the logs generated by userA can be downloaded

Question 16

Which of the descriptions of the Loader job in FusionlnsightHD is correct?

Options:

A.

After the Loader submits the job to YARN for execution, if the Loade phase is abnormal at this time, the job fails to execute.

B.

The Loader submits the job to the small YARN for execution. If a Mapper task fails to execute, it can automatically retry

C.

After the Loadet job fails to execute, garbage data will be generated, which needs to be cleared manually by the user

D.

After the Loader submits a job to YARN for execution, other jobs cannot be submitted until the job is executed.

Question 17

Which of the following statements about Flink barriers is wrong

Options:

A.

Barriers are periodically inserted into the data flow and flow with it as part of the data flow

B.

Barriers are at the heart of Flink snapshots

C.

A barrier separates the snapshot data of the current cycle from the snapshot data of the next cycle

D.

When the barrier is inserted, it will temporarily block the data flow

Question 18

The figure below shows the configuration of HDFS tiered storage. If the number of copies of a block is 4. Which of the following statements is wrong?

Options:

A.

All 4 blocks will be written to RAM_DISK

B.

If it is a copy other than the first copy that fails to write, try to write to the storage type specified by "Alternative storage policy for copy"

C.

The first block copy will be written to RAM_DISK, the rest of the replicas are written to DISK

D.

If the first block copy is written to RAM_If DISK fails, try to write to the storage type specified by "Alternative Storage Policy"

Question 19

When HBasel is deployed in a Fusioninsight HD cluster with N nodes, it is recommended to deploy( )HMasteri process,( )A RegionServeri process.

Options:

A.

3, N

B.

N, N

C.

2, N

D.

2.2

Question 20

In the MRS platform, which component does the F1ume data flow not need to pass through in the node?

Options:

A.

Sink

B.

Channel

C.

LTopic

D.

Source

Question 21

Which of the following is not a mandatory option when creating a Loader job?

Options:

A.

name

B.

connect

C.

type

D.

priority

Question 22

How many shards does an index library of ElasticSearchl have by default?

Options:

A.

5

B.

6

C.

3

D.

4

Question 23

Which service process manages the HBasel Region in the Hadoop platform?

Options:

A.

HMaster

B.

DataNode

C.

RegionServer

D.

Zookeeper

Question 24

As shown in the figure, the following description of the message read by the Kafka message consumer Consumeri is wrong?

Options:

A.

The blue in the picture is a topic of Kafkal, which can be understood as a queue, and each grid represents a message.

B.

The messages generated by the producer are placed at the end of the topic one by one.

C.

Consumers read messages sequentially from right to left.

D.

Consumert uses offset to record the position of the read

Question 25

Regarding the comparison between Hive and traditional data warehouse, which of the following descriptions is wrong?

Options:

A.

Hive metadata storage is independent of data storage, thereby decoupling metadata and data. High flexibility, while traditional data warehouse data application is single, low flexibility

B.

Hive is based on HDFS storage. Theoretically, the storage capacity can be expanded infinitely. The storage capacity of traditional data warehouses will have an upper limit

C.

Since Hive data is stored in HDFS, it can ensure high fault tolerance and high reliability of data

D.

Since Hive is based on a big data platform, query efficiency is faster than traditional data warehouses

Question 26

Which of the following descriptions about Hive features is incorrect?

Options:

A.

Flexible and convenient ETL

B.

Only supports MapReduce computing engine

C.

Direct access to HDFS files and HBase

D.

Easy to use and easy to program

Question 27

Which of the following statements about the read/write process of the leader node of Zookeeper after receiving the data change request is correct?

Options:

A.

Simultaneous writes to disk and memory

B.

Write to disk first, then write to memory

C.

write to memory only

D.

Write to memory first, then write to disk

Question 28

SoIrCloud mode is cluster mode. In this modeSWhich of the following services does the olr server strongly depend on?

Options:

A.

HBase

B.

HDFS

C.

ZooKeeper

D.

YARN

Question 29

In the cooperative work of Zookeeper and YARN, when Active ResourceManagerj fails, which directory will Standby ResourceManager obtain Application-related information from?

Options:

A.

warehouse

B.

Metastore

C.

State store

D.

Storage

Question 30

What is the file format of data storage in HBase?

Options:

A.

HFile

B.

HLog

C.

TextFile

D.

SequenceFile

Question 31

Which of the following commands can be used to clear the data of all databases under the Redis instance?

Options:

A.

dropall

B.

Lflushall

C.

flushdb

D.

dropdb

Question 32

In the Fusioninsight HD system, which of the following methods cannot view the execution result of the Loader job?

Options:

A.

Richard Suga via YARN tasks

B.

Check through the Loader UI interface

C.

View through the alarm of Managerb

D.

View through NodeManageri

Question 33

What is the index name of ElasticSearch?

Options:

A.

ddcvc

B.

doc

C.

logstash-2020.01.17

D.

3sDqsm8Bu-kTplzOjqhL

Question 34

InHive, which of the following statements about partitions is incorrect?

Options:

A.

Each partition is a journal

B.

The number of partitions is fixed

C.

The data table can be partitioned by the value of a field

D.

There can be further partitions or buckets under the partition

Question 35

HUAWEI CLOUD MapReducel service provides a one-stop enterprise-level big data cluster cloud service that is fully controllable by tenants. It is fully compatible with open source interfaces, and combines HUAWEI cloud computing, storage advantages and big data industry experience to provide customers with high performance and low cost.

This flexible and easy-to-use full-stack big data platform can easily run Hadoop, Spark.HBase, Kafka, Storm and other big data components, realize real-time and offline analysis and mining, and discover new business opportunities for enterprises.

Options:

A.

True

B.

False

Question 36

Comparing Spark and MapReducei, which of the following statements are correct?

Options:

A.

Shuffle is not a necessary process in Spark tasks

B.

Spark can cache intermediate results in memory reducing disk I0

C.

Shuffle is a necessary process in Spark tasks

D.

Intermediate results of Spark tasks can only be stored in memory

Question 37

HBasetable design, column families andRowKeyIt is an important part of table design. Which of the following options is not suitableDoRowKey?

Options:

A.

identity number

B.

username

C.

handmachine number

D.

userID

Question 38

In the task scheduling of YARN. Once the ApplicationMasterE applies for the resource, it communicates with the corresponding ResourceManagerj and asks it to start the task

Options:

A.

True

B.

False

Question 39

YarnWhich of the following schedulers are supported?

Options:

A.

Fair

B.

FIFO

C.

Mesos

D.

Capacity

Question 40

HMaster needs to be connected when using HBase for data reading service in Fusioninsight HD

Options:

A.

True

B.

False

Question 41

A telecommunications companyplanopenexhibitionBigdatabusiness,target businesshavecustomer grouping,calendarHistorical bill analysis, real-time call charge analysis and other services. Which of the following options is the most appropriate in terms of functionality and cost to meet business needs?

Options:

A.

Deploy Soark separately

B.

Deploy Map Rabace

C.

Deploy Storm

D.

Deploy MapReduce with Stare

Question 42

One of the goals of the data middle-office technical architecture is to solve the drawbacks that traditional data warehouses are easy to become data islands and big data platforms are easy to become data swamps.

Options:

A.

True

B.

False

Question 43

What is wrong with the description of the TWink capacity mechanism?

Options:

A.

Che? ?Mechanism guarantees an internship? ?Runtime,? ?

B.

Save?mechanism guarantees real-time?run?,? ?

C.

Save?mechanism to ensure that the program is running?After the situation, from the saved state?

D.

Check?mechanism ensures that the program has?After the situation,?from a saved state?

Question 44

Redis adopts a non-central self-organizing structure. Nodes use the Gossip protocol to exchange node status information.

Options:

A.

TRUE

B.

FALSE

Question 45

In the MRS service, the unavailability of the Zookeeper service will result in the unavailability of the kafka service.

Options:

A.

True

B.

False

Question 46

In F1ink, the checkpoint mechanism can ensure that when the application fails during operation, it can recover from a certain checkpoint. During this process, stream snapshots are created in sequence according to the order of data inflow.

Options:

A.

True

B.

False

Question 47

As an authentication server center, Kerberos1 can provide unified authentication services to all services in the cluster and secondary development applications of customers.

Options:

A.

True

B.

False

Question 48

The combine of MapReduce in the Map phase is a pre-grouping process and is optional.

Options:

A.

True

B.

False

Question 49

Hadoop's NameNode is used to store the metadata of the file system.

Options:

A.

True

B.

False

Question 50

The default snapshot method of Redis is AOF.

Options:

A.

True

B.

False

Question 51

In the Fusioninsight HD platform, HBase does not currently support secondary indexes

Options:

A.

True

B.

False

Question 52

Which of the following options are included in the key features of ZooKeeper?

Options:

A.

eventual consistency

B.

Reliablesex

C.

wait for irrelevance

D.

atomicity

Question 53

If the physical memory utilization of some Containers exceeds the configured memory threshold, but the total memory utilization of all Containers does not exceed the set NodeManager1 memory threshold, then the Containert with excessive memory usage can still continue to run.

Options:

A.

True

B.

False

Question 54

HiveComponents support a variety ofAPI interface. Which of the following interfaces is not supported?

Options:

A.

Beeline

B.

JDBC

C.

ODBC

D.

Restful

Question 55

Capacity schedulingThe processor allocates resources for each queue, which of the followingOptionsis in the queueresourcescheduleStrategy?

Options:

A.

Fair

B.

Capacity

C.

FIFO

D.

Mesos

Question 56

Flumofor data collection. What is the basic unit of data it transmits?

Options:

A.

block

B.

event

C.

split

D.

packet

Question 57

Which of the following is not a mandatory option when creating a Loader job?

Options:

Question 58

On the FusionInsight Manager interface, when a warning about insufficient kafka wipe disk capacity is received, and the cause of the warning has been eliminated from the hard disk hardware failure, the system administrator needs to consider expanding the capacity to solve this problem.

Options:

A.

True

B.

False

Question 59

Tez is a distributed computing box that supports directed acyclic graphs. When Hive uses the Tez engine for data analysis, it parses the HQL statements submitted by users into corresponding Tez tasks and submits them to Tez for execution.

Options:

A.

True

B.

False

Question 60

The following aboutWhich Kafka feature description is incorrect??

Options:

A.

Support looking for horizontal expansion

B.

Support multi-level cascade

C.

Support message partitioning

D.

High phagocytosis rate

Question 61

Fusioninsight tool is a set of health detection tools provided for technical support engineers and maintenance engineers. It can check the health status of cluster-related nodes and services, discover potential problems in the cluster in advance, and generate health check reports. It is convenient for technical support engineers and maintenance engineers to quickly understand the health status of the system.

Options:

A.

True

B.

False

Question 62

The emergence of HFS solves the need to store a large number of small files (below 10MB) in HDFS. At the same time, it is necessary to store some mixed scenes of large files (above 10MB)

Options:

A.

True

B.

False

Question 63

The data stream of F1ume can be sent to different Channels according to the headers information.

Options:

A.

True

B.

False

Question 64

In the Fusioninsight HD system, HBase supports dynamic expansion of columns.

Options:

A.

True

B.

False

Question 65

ApplicationMasters apply for and receive resources from ResourceManagerl through the RPC protocol in a polling manner.

Options:

A.

True

B.

False

Question 66

LdapServer's Group (group) is a unified group management for users. If a user is added to the group, the member's dn record will be added to the nember attribute of the group.

Options:

A.

True

B.

False

Question 67

Which of the following commands are of type set?

Options:

A.

scard

B.

sunion

C.

zcount

D.

hexists

Question 68

Which of the following are enterprise-level big data application scenarios

Options:

A.

Internal Operations Management

B.

Supply Chain Management

C.

Customer Analysis

D.

Marketing Analysis

Question 69

What interfaces does Fusioninsight Manager support when interfacing with external platforms?

Options:

A.

SNMP

B.

VPN

C.

BGP

D.

syslog

Question 70

What configuration files can the Fusioninsight HD LLD configuration planning tool generate?

Options:

A.

Monitoring Alarm Threshold Profiles

B.

Cluster installation template file

C.

Configuration files for HDFS and YARN

D.

Configuration file Check required to execute Precheck Nodes.Config

Question 71

Which operations in Hive can be merged?

Options:

A.

UNION ALL

B.

GROUP BY

C.

SELECT

D.

JOIN

Question 72

In Flink( )Interface for streaming data processing.( )interface for batch processing

Options:

A.

Stream API, Batch API

B.

Data Stream APl. DataSet AP

C.

DataBatch AP1.DataStreamAPIi

D.

BatchAP1, Stream APi

Question 73

There are 230 nodes in the FusionlnsightHD cluster of a university. Which of the following processes should be deployed on the data nodes during cluster planning?

ADataNode

B. NameNode

C. NodeManager

D. RegionServer

E.DBServer

F.SolrServer

Options:

Question 74

What types of data sources does F1ink stream processing include?

Options:

A.

Collections

B.

JDBC

C.

Socket streams

D.

Files

Question 75

FusionInsight Hadoop:In the cluster, the cluster size has 70 nodes. If the recommended deployment scheme is adopted, what partitions may exist on the management node?

Options:

A.

/srv/BigData/zookeeper

B.

/srv/BigData/dbdata-om

C.

/srv/BigData

D.

/srv/BigData/jumalnode

E.

/srv/BigData/hadoop/data5

Question 76

Which parts of the data need to be read to execute the HBase data reading business?

Options:

A.

HLog

B.

MemStore

C.

HFile

D.

HMaster

Question 77

In big data computing tasks, about0Which description of intensive tasks is incorrect?oneitem?

Options:

A.

Most of the time during I0-intensive task execution is spent at I0

B.

By improving network transmission efficiency and read and write efficiency, performance can be greatly improved

C.

High PU consumption

D.

The more tasks, the more efficient the CPU

Question 78

Which of the following sub-products does the Fusioninsight family include?

Options:

A.

HD

B.

Libra

C.

Manager

D.

Miner

E.

Farmer

F.

DWS

Question 79

Which of the following statements about segment file in Kafka Logs is correct?

Options:

A.

Mapping all index metadata to memory can avoid index data IO disk operations of segmentfile

B.

The index file is sparsely stored, which can greatly reduce the space occupied by the metadata of the index file

C.

Sparse storage, that is, to store the original complete data only at intervals

D.

Message can be quickly located through index information

Question 80

Which of the following descriptions about the deployment of big data components on Kunpeng and X86 servers is correct?

Options:

A.

No shortcomings in performance

B.

Single component (for example: HDFS) supports mixed deployment of Kunpeng server and X86 server

C.

Supports mixed deployment of Kunpeng servers and ordinary X86 servers in a single cluster

D.

Realize the autonomous control of some equipment

Question 81

In Kafka HA, when the leader corresponding to the partition is down, a new leader needs to be elected from the followers. Which of the following roles should be executed?

Options:

A.

Follower

B.

Controller

C.

Brocker

D.

Leader

Question 82

After submitting the topology using the Streaming client shell command in the Fusioninsight HD system, use Strom The UI view shows that the topology has not processed data for a long time. What are the possible reasons?

Options:

A.

Supervisor is the component that receives data in topology and then performs processing

B.

There is a logic error in the topology business, and it cannot run normally after submission

C.

The topology is too complex or the number of concurrent users is too large, resulting in workerThe startup time is too long, exceeding the waiting time of Supervisort

D.

The supervisor's slots resources are exhausted, and after the topology is submitted, the slots cannot be allocated to start the worker process.

Question 83

In Huawei's big data solution, which of the following components are included in the hadoop layer?

Options:

A.

Miner

B.

Spark

C.

Hive

D.

Flink

Question 84

Which of the following data sources can realize data exchange with Fusioninsight HD through loader?

Options:

A.

MySQL

B.

NoSQL

C.

FTP Server

D.

SFTP Server

Question 85

In the FusininsightHD platform, which components support list encryption?

Options:

A.

HDFS

B.

Flink

C.

HBase

D.

Hive

Question 86

Kafka Cluster Mirroring. Which of the following functions can be achieved by the tool?

Options:

A.

Kafka cross-cluster data synchronization scheme

B.

Kafka data backup within a single cluster

C.

Kafkat but intra-cluster data recovery

D.

None of the aboveA. True

Question 87

What are the main characteristics of big data analysis related technologies?

Options:

A.

machine learning, full features

B.

Event correlation analysis behind the data

C.

Based on massive data

D.

based on exact samples

Question 88

Which of the following statements about the interaction between F1ink and other components is correct?

Options:

A.

The implementation of Flink's checkpoint relies on Zookeeper

B.

Flink can send received components to Kafka

C.

The running of Flink tasks relies on YARN for resource scheduling and management

D.

Flink reads and writes data in HDFS file system

Question 89

SELECT aa.salarybB. address FROM employee aa JoiN SELECT adress FROM employee info where provine='zhejiang') What types of operations does bb ONaa.nanme=bB. name contain?

Options:

A.

create table

B.

Import Data

C.

subquery

D.

JOIN Cha Xun

Question 90

In Hive, which of the following descriptions about bucketing is correct?

Options:

A.

Data can put different data into different buckets according to the way of buckets

B.

Unsortable in bucket

C.

The advantage of bucketing is that it can achieve higher query processing efficiency and make sampling more efficient

D.

You can specify the number of buckets when creating a table

Question 91

Which of the following scenarios in HBase will trigger the F1ush operation?

Options:

A.

HBasePeriod refreshMemstore, silentthink period is 1Hour

B.

When the number of files in WALs reaches a threshold

C.

The total size of the MemStore in the Region has reached the preset Flush Size threshold

D.

The ratio of the total memory occupied by the MemStore to the total memory of the RegionServer exceeds the preset threshold size

Question 92

Which of the following nodes are included in the basic system architecture of HDFS?

Options:

A.

ResourceManager

B.

NameNode

C.

NodeManager

D.

Data Node

Question 93

What are the correct understandings and descriptions of the main features of big data?

Options:

A.

Many data sources and formats

B.

Fast data growth and fast processing

C.

Large amount of data, large amount of calculation

D.

Low data value density, high value

Question 94

If part of the information is allowed to be lost during message processing, which of the following methods are used to close the message reliability processing mechanism?

Options:

A.

Set the parameter Config.Topology ACKKRS to 0

B.

Set Config.Topology_ACKKRS to False

C.

When the spout sends a message, use the unspecified message messageID's interface to send

D.

BWhen olt sends a message, use the Unanchor method to send

Question 95

Which of the following are the actual application scenarios of Kafka

Options:

A.

log collection

B.

Site activity tracking

C.

Resource management

D.

Aggregate statistical system operation data

Question 96

Which of the following statements about Transformation in Flink is correct?

Options:

A.

The time window can be set by window

B.

The Filter operation is to perform a Boolean function on each element

C.

FlatMap can segment text

D.

KeyBy is to group the source data by key,To ensure that the metadata of the same key is divided into the same group

Question 97

Which of the following descriptions about the HBase storage model is correct?

Options:

A.

Each Key Value has a Qualifier identifier

B.

The same key value key is associated with multiple values

C.

KeyValuel has key information such as timestamp and type

D.

Even if there are multiple Key Values with the same key value and the same Qualifier, there may be more than one. In this case, the time stamp is used to distinguish

Demo: 97 questions
Total 649 questions