What is Hadoop?
A. Hadoop is a single-node file system used as a base for storing traditional formatted data.
B. Hadoop is a framework that allows for the distributed processing of large data sets across clusters of computers using a simple programming model.
C. Hadoop is a universal Big Data programming language used to query large datasets.
D. Hadoop is programming language capable of transforming raw, unstructured data into plain, regular data readable by traditional data warehouses.
Which of the following options is NOT CORRECT?
A. Big data solutions are ideal for analyzing not only raw structured data, but semi structured and unstructured data from a wide variety ofsources.
B. Big data solutions are ideal when all, or most, of the data needs to be analyzed versus a sample of the data; or a sampling of data isn nearlyBig data solutions are ideal when all, or most, of the data needs to be analyzed versus a sample of the data; or a sampling of data isn? nearlyas effective as a larger set of data from which to derive analysis.
C. Big data solutions are ideal for Online Transaction Processing (OLTP) environments.
D. Big data solutions are ideal for iterative and exploratory analysis when business measures on data are not predetermined.
Which of the following methods can be used by Cognos to connect with InfoSphere BigInsights? (Choose two)
A. Connection via the Hive JDBC driver.
B. Connection via the SPL JDBC driver.
C. Connection via the Pig JDBC driver.
D. Connection via the Big SQL JDBC driver.
Which of the following statements should be used to import data from DB2 into InfoSphere BigInsights?
A. You can use the Big SQL LOAD FROM statement to load data into a Big SQL table.
B. You can use the Big SQL IMPORT FROM statement to load data into a HBase table.
C. You can use the Big SQL LOAD FROM statement to load data into a Hive table.
D. You can use the Big SQL IMPORT FROM statement to load data into a Big SQL table.
How is data stored in a Hadoop cluster?
A. The data is divided into blocks, and copies of these blocks are replicated across multiple servers in the Hadoop cluster.
B. The data converted into a single block, and the block is stored in just one of the servers in the Hadoop cluster.
C. The data is divided into blocks, each block is stored in a different server in the Hadoop cluster, but the blocks are not replicated.
D. The data is converted into a single block, and copies of this block are replicated across multiple servers in the Hadoop cluster.
Which of the following options best describes the proper usage of MapReduce jobs in Hadoop environments?
A. MapReduce jobs are used to process vast amounts of data in-parallel on large clusters of commodity hardware in a reliable, fault-tolerant manner.
B. MapReduce jobs are used to process small amounts of data in-parallel on expensive hardware, without fault-tolerance.
C. MapReduce jobs are used to process structured data in sequence, with fault-tolerance.
D. MapReduce jobs are used to execute sequential search outside the Hadoop environment using a built-in UDF to access information stored in non-relational databases.
InfoSphere BigInsights offers the following benefits to your organization, EXCEPT:
A. It cuts costs by providing an efficient compression algorithm based on row level compression using a dictionary to store repetitive patterns.
B. It complements your existing infrastructure by extending your data collection and analysis capabilities.
C. It integrates data from a variety of structured and unstructured sources.
D. It enables analysis at scale.
Which of the following options contain security enhancements available in InfoSphere BigInsights (choose two) ?
A. LDAP authentication
B. Secure file transfers through SFTP protocol.
C. Trusted Context.
D. Kerberos authentication protocol.
Which of the following integrated development environment (IDE) can be used to develop programs to be used with InfoSphere BigInsights?
A. InfoSphere Data Explorer.
B. IBM Data Studio with InfoSphere Data Explorer plug-in.
C. InfoSphere Streams Studio
D. Eclipse with InfoSphere BigInsights plug-in.
Which of the following compression algorithms is used by InfoSphere BigInsights to provide an additional compression option over the ones that come with the base Hadoop distribution?
A. gzip.
B. brzip2.
C. lza.
D. lzo.