Black Friday Special Limited Time 70% Discount Offer - Ends in 0d 00h 00m 00s - Coupon code: 70percent

Huawei H13-723_V2.0 HCIP-Big Data Developer V2.0 Exam Practice Test

Demo: 78 questions
Total 526 questions

HCIP-Big Data Developer V2.0 Questions and Answers

Question 1

Fusionlnsigt HD the user wants to passHBase shelloperation to query aHBaseThe contents of the table, this scenario is pushed down

It is recommended that the administrator assign a machine account to this user.

Options:

A.

True

B.

False

Question 2

Fusionlnsigt HD one deployed inTomcatapplication on theHBaseservice, it is recommended to use in this scenario

machine account.

Options:

A.

True

B.

False

Question 3

Suppose there is an application that needs to be accessed frequentlyOracleThe user table in the database, in order to improve performance, introduceRedisto cache

account information.

For this scene,RedisWhich of the following is the best data structure choice for ?

Options:

A.

list

B.

set

C.

hash

D.

sorted set

Question 4

FusionInsight HDin the system,hiveWhat are the supported interface types?

Options:

A.

JDB, Cinterface

B.

ODB, Cinterface

C.

Pythoninterface

D.

Rubyinterface

Question 5

existSpark, assuminglinesIs anDStreamobject,filterStatements can be filtered out80%data for the following two

The correct statement is:

X: lines.filter(…).groupByKey(…)

Y: lines.groupByKey(…).filter(…)

Options:

A.

XCompareYhigher performance

B.

XCompareYlower performance

C.

XandYsame as performance

D.

UncertaintyXandYdifference in performance

Question 6

aboutFlumeWhich of the following descriptions is correct?

Options:

A.

(1) data fromsourcearrivechannel;(2)sinkfromchannelfetch data

B.

(1) data fromchannelarrivesource;(2)sinkfromsourcefetch data

C.

(1) data fromsinkarrivechannel;(2)sourcefromchannelfetch data

D.

(1) data fromchannelarrivesource;(2)sinkfromsourcefetch data

Question 7

When the cluster is normal,RedisClient initiates oncegetCall, the client has () times of message interaction with the server?

Options:

A.

1

B.

2

C.

3

D.

4

Question 8

Which of the following is notMapReducespecialty?

Options:

A.

easy to program

B.

good scalability

C.

real-time computing

D.

High fault tolerance

Question 9

FusionInsight HDWhat are the supported ways to submitOozieOperation? (multiple choice)

Options:

A.

ShellOrder

B.

Java API

C.

Hue UI

D.

Rest API

Question 10

RDDasSparkThe core object, which has the following characteristics? (multiple choice)

Options:

A.

read only

B.

partition

C.

fault tolerance

D.

efficient

Question 11

HDFSIn application development of , which of the following fields areHDFSInterfaces supported by the service? (multiple choice)

Options:

A.

BufferedOutputStream.write

B.

BufferedOutputStream.flush

C.

FileSystem.create

D.

FileSystem.append

Question 12

Fusionlnsigt HDWhat distributed computing frameworks do big data platforms provide? (multiple choice)

Options:

A.

HBase

B.

Spark

C.

MapReduce

D.

Streaming

Question 13

Suppose there is an application with10Tables, each table has tens of millions of records, and the number of fields is about20indivual.

Currently usingRedisto cache this10The data of a table, the design of its data structure, which of the following is the best design?

Options:

A.

usehashstructure, and a table uses ahash key, a row in the table records forhash keyone offield.

B.

usehashstructure, each row record of each table uses onehash key,hash keyoffieldThe fields of the corresponding table records,

and designKEYEach table adds a different prefix to distinguish it.

C.

usestringstructure, each field of each row record of each table uses oneKEY.

D.

usestringstructure, each row record of each table uses oneKEY,valueAll fields recorded for a row in a table? ? back

value of .

Question 14

FusionInsight HDWhich of the following belong toOozieofMapReduce Actionconfiguration item? (multiple choice)

Options:

A.

name-node

B.

source

C.

mapred.mapper.class

D.

job-tracker

Question 15

Suppose there is an application that needs to be accessed frequentlyOracleThe user table in the database, in order to improve performance, introduce

Redisto cache user information. For this scene,RedisWhich of the following is the best data structure choice for ?

Options:

A.

List

B.

Set

C.

Hash

D.

Sorted set

Question 16

pass throughHBaseofcreateTableThe method creates a table, what parameters must be passed in?

Options:

A.

Table Name

B.

table names and columns

C.

Table names and column families

D.

can be empty

Question 17

existFusionInsight HDproductSolrDuring application development, you canSolr Admin UIrightCollectiondo some tests

certificate.

Below aboutSolr Admin UIIs the statement correct? (multiple choice)

Options:

A.

clickClouddownTree,CheckSolr CloudmiddleCollections, a configuration set, andlive_nodesand other metadata information

B.

clickCloud, see eachCollectionunder eachShardReplicadistribution and status

C.

enter aCollectionofShard ReplicaofCore Overviewinterface, you can view theReplicaIndex the actual document

Number of bars, storage size and location information

D.

Solruser group,Solr adminrole andSuper groupof users have access toSolr Admin UI

Question 18

FusionInsight HDin, aboutHiveandSparksql, which of the following descriptions are correct? (multiple choice)

Options:

A.

Both are available externallySQLor classSQLability

B.

Both can run onMapReduceon the computational framework

C.

Both are available via table fields andHBaseAccess by field mappingHBasedata on

D.

Both can run onSparkon the computational framework

Question 19

FusionInsight HDmiddle,StreamingWhat are the characteristics of? (multiple choice)

Options:

A.

stream processing

B.

high performance

C.

Scalable

D.

High fault tolerance

Question 20

existSpark, assuminglinesIs anDStreamObject, which of the following statements can periodically count the number of words on this stream?

Options:

A.

lines.flatMap(_.split??)).map(word=>(word,1)).reduce(_+_).print ( )

B.

lines.flatMap(_.split??)).map(word=>(word,word.length())).reduceByKey(_+_).print ( )

C.

lines.flatMap(_.split??)).map(word=>(word,1)).reduceByKey(_+_).print ( )

D.

lines.flatMap(_.split??)).flatmap(word=>(word,1)).groupByKey(_+_).print ( )

Question 21

Set the block storage size to128M,HDFSWhen the client is writing a file, when writing a100Msize file, real.

How much storage space does it take up?

Options:

A.

128M

B.

100M

C.

64M

D.

50M

Question 22

existFusionInsight HDcluster, aboutkinitOperation command, which of the following statements is wrong? (multiple choice)

Options:

A.

Only the HMI account can be used.

B.

Only the machine account can be used.

C.

A client does not support the simultaneous use of multiple accounts.

D.

The ticket obtained by executing this command is intwenty fourIt will time out when it is small and needs to be executed againkinitcommand to log in again.

Question 23

A project requires Internet access to a certain area? ?Save it, and search the full text of these Internet records to see if there is any? ?information, with

to prevent crime in the region.

In this scenario, which of the following options is the best?

Options:

A.

existSolrCreate an index and save the data, and return all the data during full-text search.

B.

existHBasestore data on theHBaseThe filtering characteristics of , satisfy fuzzy matching query.

C.

When storing data, inSolrindex on theHBaseStore complete data? ?when, throughSolrFull-text search to obtain and record

record key information, through the key information inHBasefor the full record.

Question 24

FusionInsight HDsystematicV100R002C60version,HiveOnly supports based onMapReduceEngine query service,

not supported based onSparkEngine query service.

Options:

A.

True

B.

False

Question 25

existMapReduceIn the development framework,InputFormatWhat is the function of the class?

Options:

A.

mergemapThe result of the stage has the samekeyvalue key-value pair

B.

put allmapThe results of the stages are merged to get the final result

C.

rightmapThe results of the stages are sorted and merged

D.

Split the input data into individualsplit, and willsplitfurther split intoyes, then asmapletter input of numbers

Question 26

There are the following scenarios: new data is generated by the online system every day500G, you need to make statistics on these data by day, week, month and other dimensions summary.

ask if it is suitable for useHiveWhat kind of table to handle?

Options:

A.

Bucket table, bucket according to day

B.

Partition table, partitioned by day

C.

skew table, skew field select date, skew value random

D.

partition+Bucket table, partitioned by time, each partition2560buckets

Question 27

writingMapReduceWhich two interfaces are usually required to be implemented by developers?

Options:

A.

mapandcombine

B.

reduceandcombine

C.

combineandsort

D.

mapandreduce

Question 28

Which of the following methods can generateDStreamobject?

Options:

A.

KafkaUtils.createStream(…)

B.

KafkaUtils.createDirectStream(…)

C.

StreamingContext.socketStream

D.

StreamingContext.fileStream(…)

Question 29

existSpark, assuminglinesIs anDStreamobject,filterStatements can be filtered out80%data for the following two

The correct statement is:

X: lines.filter(…).groupByKey(…)

Y: lines.groupByKey(…).filter(…)

Options:

A.

XCompareYhigher performance

B.

XCompareYlower performance

C.

XandYsame as performance

D.

UncertaintyXandYdifference in performance

Question 30

FusionInsight HDofHive, user-definedUDFcan andHiveBuilt-inUDFduplicate name, in this case,

will use user-definedUDF.

Options:

A.

True

B.

False

Question 31

aboutRedisCluster topology information, is the following description correct?

Options:

A.

The client caches the topology information of the cluster

B.

The server caches the topology information of the cluster

C.

both are

D.

more than two

Question 32

FusionInsight HDin, yesSolrThe creation of various resources and the use of read and write permissions, which of the following statements is wrong?

Options:

A.

SolrUser group users can create configuration sets and createCollectionwhen specifying the configuration set

B.

SolrUsers of the user group can createCollection

C.

Collectioncreator, theCollectionroles with read and write permissions andSolr adminrole can beCollectionEnter

row update

D.

onlyCollectioncreator andSolr adminRoles can be deletedCollection

Question 33

existKafka, which of the following commands can view aTopicHow many partitions are there?

Options:

A.

Kafka-topics.sh–describe

B.

Kafka-topics.sh–list

C.

kafka-topics.sh–alter

D.

kafka-topics.sh–create

Question 34

Fusionlnsiht HDmiddle,Oozieclient'sJava APIwill be called when the task is runOozieClientWhich method of the class?

Options:

A.

suspend

B.

getJoblnfo

C.

submit

D.

run

Question 35

HDFSRuntime,NameNodewill load all the metadata of the file system from disk into memory, so the file system can

The total number of files stored is limited byNameNodememory capacity.

Options:

A.

True

B.

False

Question 36

forFusionInsight HDplatformHBaseComponent, which properties of the secondary index need to be defined to add a secondary index? (multiple choice)

Options:

A.

index name

B.

index column

C.

index column type

D.

The name of the column family to which the indexed column belongs

Question 37

YarnWhich role is to manage individual node resources (CPU/Memory)of?

Options:

A.

NodeManager

B.

Resource Manager

C.

Data Node

D.

NameNode

Question 38

FusionInsight HDin, belonging toStreamingWhat are the roles of the service? (multiple choice)

Options:

A.

Nimbus

B.

Supervisor

C.

Broker

D.

quorumpeer

Question 39

FusionInsight HDmiddle,HiveWhich of the following parameters of the client is used to controlreduceConcurrency? (multiple choice)

Options:

A.

hive.exec.reducers.bytes.per.reducer

B.

hive.exec-ducers.max

C.

hive.exec.max.dynamic.partitions

D.

hive.exec.max.dynamic.partitions.pernode

Question 40

existSpark, assuminglinesIs anDStreamObject, which of the following statements can periodically count the number of words on this stream?

Options:

A.

lines.flatMap(_.split??)).map(word=>(word,1)).reduce(_+_).print ( )

B.

lines.flatMap(_.split??)).map(word=>(word,word.length())).reduceByKey(_+_).print ( )

C.

lines.flatMap(_.split??)).map(word=>(word,1)).reduceByKey(_+_).print ( )

D.

lines.flatMap(_.split??)).flatmap(word=>(word,1)).groupByKey(_+_).print ( )

Question 41

FusionInsight HDsystem, aboutHiveofJDB, CInterface type, which of the following descriptions is correct of?

Options:

A.

In a network isolation environment,ZooKeeperofIPCan be configured as a businessIP, can also be configured to manageIP.

B.

hiveUnlike traditional databases, soJDB, CThe established connection does not need to be closed.

C.

In a secure cluster environment, establish andHiveBefore connecting, you need to use unified authentication to log in.

D.

useJDB, CsubmitSQL, you canSQLPut a semicolon after the statement";"

Question 42

Which of the following application scenarios mainly generate unstructured data?

(multiple choice)

Options:

A.

CRMcustomer relationship management business

B.

Seismic Mapping Analysis Business

C.

Taobao online payment service

D.

Financial Bill Imaging Service

Question 43

existFusionInsight HDcluster, aboutkinitOperation command, which of the following statements is wrong? (many select)

Options:

A.

Only use HMI account.

B.

Only the machine account can be used.

C.

A client does not support the simultaneous use of multiple accounts.

D.

The ticket obtained by executing this command is intwenty fourIt will time out when it is small and needs to be executed againkinitcommand to log in again.

Question 44

SparkandHadoopNeither is suitable for iterative calculation scenarios.

Options:

A.

True

B.

False

Question 45

existFusionInsight HDproduct,KafkaWhat is the role name of the service?

Options:

A.

Producer

B.

Broker

C.

Consumer

D.

ZooKeeper

Question 46

FusionInsight HDin, belonging toStreamingWhat are the methods of data distribution?

(multiple choice)

Options:

A.

mouthShuffle Groupingrandom grouping

B.

mouthField Groupingfield grouping

C.

mouthLocal Groupinglocal grouping

D.

mouthDirect Groupingdirect grouping

Question 47

for running onMapRestoeThe application on the platform, on which this application is basedjarpackage is placed where?

Options:

A.

HIVE

B.

HBASE

C.

HDFS

D.

DB

Question 48

FusionInsight HDin, aboutOoziejob management for suspended jobs

Which of the following operations? (multiple choice)

Options:

A.

mouthkill

B.

mouthrun

C.

mouthresume(wake)

D.

mouthsubmit

Question 49

pass throughHBaseofcreateTableThe method creates a table, what parameters must be passed in?

Options:

A.

Table Name

B.

table names and columns

C.

Table names and column families

D.

can be empty

Question 50

implementHBaseWhat parts of the data need to be read for the data read service?

(multiple choice)

Options:

A.

HFile

B.

HLog

C.

MemStore

D.

HMaster

Question 51

aboutFusionInsight HDplatformHiveservice, itsWebHCatDevelopment interface, the following description does not the correct one is?

Options:

A.

Support based onRESTquery request

B.

WebHCatThe return data format isXML

C.

WebHCatbased onHTTPandHTTPSAgreement to provide services to the outside world

D.

able to passWebHCatCreate tables, query, etc.

Question 52

HDFSWhich of the following business scenarios is not applicable to?

Options:

A.

Mass data storage

B.

Lots of small file storage

C.

large file storage

D.

Streaming file storage

Question 53

HDFSRuntime,(NameNodeAt startup,)NameNodewill file system metadata from

The disk is all loaded into memory, so the total number of files that the file system can store is limited byNameNodeof memory

capacity.

Options:

A.

True

B.

False

Question 54

pass throughHBasefofcreateTableThe method creates a table, what parameters must be passed in?

Options:

A.

Table Name

B.

table names and columns

C.

Table names and column families(family)

D.

can be empty

Question 55

Below aboutZooKeeperThe statement is wrong ().

Options:

A.

If ZooKeeperIf there is an interruption in the process of synchronizing the message, after the failure is recovered, the transmission status before the failure can be adjusted according to the the state continues to synchronize, that is, support for resuming transmission from a breakpoint.

B.

ZooKeeperUse a custom atomic message protocol to ensure the consistency of node data in the entire system.

C.

ZooKeeperThe cluster is elected at startupLeaderRole.

D.

LeaderAfter a node receives a data change request, it first writes to disk and then writes to memory.

Question 56

existBaseIn application development, when a table'sRowkeyRange and distribution are known, pre-score is recommendedregion,

Please call the following code (fragment) to pre-sort a tableregionAfter this table will create severalregion?

splits【0】=Bytes.toBytes("A");splits【1】=Bytes.toBytes("H");splits【2】

=Bytes.toBytes("O");splits【3】=Bytes.toBytes("U");admin.createTable(htd,splits);

Options:

A.

3

B.

4

C.

5

D.

6

Question 57

Which of the following is notMapReducespecialty?

Options:

A.

Easy to program.

B.

Good scalability

C.

Calculate in real time

D.

High fault tolerance

Question 58

existMapReduceIn the development framework,InputFormatWhat is the function of the class?

Options:

A.

mergemapThe result of the stage has the samekeyvalue key-value pair

B.

put allmapThe results of the stages are merged to get the final result

C.

rightmapThe results of the stages are sorted, merged,

D.

Split the input data into individualsplit, and willsplitfurther split intoright, then asmapinput to the function

Question 59

Which of the following causesHDFSofNameNodeEntersafemode(install"aroot form)? (multiple choice)

Options:

A.

Active and standbyNameNodeThe disk space where the metadata is located is insufficient.

B.

The number of missing blocks exceeds the read value.

C.

Lost copies exceed read value.

D.

The corrupted copy exceeds the threshold.

Question 60

existFusionInsight HDclient, executeskinit{account number}command is to getKDCwhich of the item content?

Options:

A.

Krb5

B.

conf

C.

TGT

D.

jaas.conf

Question 61

Fusionlnsigt HD one deployed inTomcatapplication on theHBaseservice, it is recommended to use in this scenario machine account.

Options:

A.

True

B.

False

Question 62

FusionInsight HDin, aboutLoaderIs the job description correct?

Options:

A.

LoaderSubmit the job toARNAfter execution, if at this timeLoaderIf the service is abnormal, this job executes line failed.

B.

LoaderSubmit the job toYARAfter a small execution, if a certainMapperTask execution fails, can be automatically Try again.

C.

LoaderIf the job fails to execute, garbage data will be generated, which needs to be cleared manually by the user.

D.

LoderSubmit an assignment toYARAfter the job is executed, no other jobs can be submitted until the job is executed Industry.

Question 63

Hbase one of Region conduct Split When operating, place aHFileThe file is really separated into two RegionAt what stage does the process take place?

Options:

A.

Splitin the process

B.

Flushin the process

C.

Compactionin the process

D.

HFilein the process of separation

Question 64

existSpark, which of the following statements about broadcast variables is correct? (multiple choice)

Options:

A.

broadcast variable in eachTasksave a copy

B.

broadcast variable in eachExecutorsave a copy

C.

Broadcast variables are read-only

D.

broadcast variable viaSparkContext.broadcast) method to generate

Question 65

FusionInsight HDin, belonging toStreamingWhat are the methods of data distribution? (multiple choice)

Options:

A.

Shuffle Grouping

B.

Field Grouping

C.

Local Grouping

D.

Direct Grouping

Question 66

existSpark, which of the following statements about broadcast variables is correct?

Options:

A.

broadcast variable in everyTasksave a copy

B.

broadcast variable in eachExecutorsave a copy

C.

Broadcast variables are read-only

D.

broadcast variable viaSparkContext.broadcast() method to generate

Question 67

when aMapReduceWhen the application is executed, which of the following actions ismapoccurs before the stage of?

Options:

A.

Split

B.

combine

C.

partition

D.

sort

Question 68

existFusionInsight HDproduct,KafkaWhat is the role name of the service?

Options:

A.

Producer

B.

Broker

C.

Consumer

D.

ZooKeeper

Question 69

Solris a high-performance, basedLucenefull-text search service.SolrrightLuceneexpanded,

provides a ratioLuceneA richer query language and a powerful full-text search function are implemented, with a high degree of reliability.

Extensibility. At the same time fromSolr 4.0Version starts, supportsSolrCloudmodel.

Options:

A.

True

B.

False

Question 70

FusionInsightHDin, aboutHivepartition (partition) function, which is wrong as described below?

Options:

A.

Partition fields are defined when the table is created

B.

There can only be one partition field, and multi-level partitions cannot be created

C.

Using partitions can reduce the data scan range of some queries, thereby improving query efficiency

D.

Partition fields can be used aswherecondition of the sentence

Question 71

FusionInsight HDmiddle,StreamingWhat are the characteristics of? (multiple choice)

Options:

A.

stream processing

B.

high performance

C.

Scalable

D.

High fault tolerance

Question 72

Flumebranch__________real-time data collectionsourceWhich is the type?

Options:

A.

taildir

B.

Log4j

C.

JMS

D.

Thrift

Question 73

existKafka, as follows aboutProducerWhat is wrong with the statement of sending data?

(multiple choice)

Options:

A.

ProducerResponsible for production data,Consumerresponsible for consuming data,ProducerandConsumerneed between

B.

EstablishSocketconnect

C.

Producercan either send data toBroker, and can send data toConsumersuperiorProducer

D.

As a message producer, you can directly write data toZooKeeper Producerconnect any normalBrokerInstances can produce data

Question 74

aboutRedisFeatures, the following description is correct? (multiple choice)

Options:

A.

low latency

B.

Rich data structures

C.

Support data persistence

D.

supportSQLquery interface

Question 75

Oozieofworkflow.xmlfile is the workflow configuration file

Options:

A.

True

B.

False

Question 76

existMapReduceDuring application development,setMapOutputCompressorClassWhat is the role of classes?

Options:

A.

AssumeSmapinput compression method

B.

set upmapHow the output is delivered

C.

set upmapoutput compression method

D.

set upreduceoutput compression method

Question 77

forHBase rowkeyThe design principles described below are correct? (multiple choice)

Options:

A.

Attribute values with high access weight are placed inRowKeyFront

B.

Attribute values with high access weight are placed inRowkeysecond half

C.

Attribute values with good dispersion are placed inRowkeyfirst half

D.

Attribute values with good dispersion are placed inRowkeysecond half

Question 78

Suppose there is an application with10Tables, each table has tens of millions of records, and the number of fields is about20indivual. now

useRedisto cache this10The data of a table, the design of its data structure, which of the following is the best design?

Options:

A.

usehashstructure, and a table uses ahashKEY, a row in the table records forhash keyone of indivualed.

B.

usehashTable structure, each row record of each table uses onehashKEY, hashKEYofebCorrespondence table

C.

record fields, and designKEYEach table adds a different prefix to distinguish it.

useringstructure, each field of each row record of each table uses oneKEY.

D.

usem3structure, each row record of each table uses oneKEY,,aueAll records for a row in the table. The value after the field is concatenated.

Demo: 78 questions
Total 526 questions