63) ________ is used to measure the impact of a set of variables on another variable during data mining.
A) Cluster analysis
B) Context indexing
C) Cloud computing
D) Regression analysis
64) Which of the following statements is true of BigData?
A) BigData contains only structured data.
B) BigData has low velocity and is generated slowly.
C) BigData cannot store graphics, audio, and video files.
D) BigData refers to data sets that are at least a petabyte in size.
65) In the ________ phase, a BigData collection is broken into pieces and hundreds or thousands of independent processors search these pieces for something of interest.
A) crash
B) break
C) reduce
D) map
66) The results generated in the map phase are combined in the ________ phase.
A) pig
B) control
C) reduce
D) construct
67) ________ is an open source program supported by the Apache Foundation that manages thousands of computers and that implements MapReduce.
A) Hadoop
B) BigData
C) Linux
D) Apache Wave
68) Which of the following statements is true of Hadoop?
A) Hadoop is written in C++ and runs on Linux.
B) Hadoop includes a query language called Big.
C) Hadoop is an open source program that implements MapReduce.
D) Technical skills are not required to run and use Hadoop.
69) Reporting analysis is used primarily for classifying and predicting BI data.
70) Structured data is data in the form of rows and columns.
71) With unsupervised data mining, analysts do not create a model or hypothesis before running the analysis.
72) Regression analysis is used to identify groups of entities that have similar characteristics.
73) Cluster analysis measures the impact of a set of variables on another variable.
74) BigData refers to data that have great variety and may have structured data as well as different formats.
75) BigData has low velocity and is generated slowly.
76) MapReduce is a technique for harnessing the power of thousands of computers working in parallel.
77) BigData has volume, velocity, and variation characteristics that far exceed those of traditional reporting and data mining.