Which statement is TRUE concerning optimizing the load performance?
A. You can improve the performance by increasing the number of map tasks assigned to the load
B. When loading large files the number of files that you load does not impact the performance of the LOAD HADOOP statement
C. You can improve the performance by decreasing the number of map tasks that are assigned to the load and adjusting the heap size
D. It is advantageous to run the LOAD HADOOP statement directly pointing to large files located in the host file system as opposed to copying the files to the DFS prior to load
Which of the following is TRUE about storing an Apache Spark object in serialized form?
A. It is advised to use Java serialization over Kryo serialization
B. Storing the object in serialized from will lead to faster access times
C. Storing the object in serialized from will lead to slower access times
D. All of the above
A Resilient Distributed Dataset supports which of the following?
A. Creating a new dataset from an old one
B. Returning a computed value to the driver program
C. Both "Creating a new dataset from an old one" and "Returning a computed value to the driver program"
D. Neither "Creating a new dataset from an old one" nor "Returning a computed value to the driver program"
Which of the following statements regarding Sqoop is TRUE? (Choose two.)
A. All columns in a table must be imported
B. Sqoop bypasses MapReduce for enhanced performance
C. Each row from a source table is represented as a separate record in HDFS
D. When using a password file, the file containing the password must reside in HDFS
E. Multiple options files can be specified when invoking Sqoop from the command line
What does the acronym "PCI" stand for in the phrase "PCI compliant"?
A. Payment Card Industry
B. Personal Credit and Income
C. Premium Credit Inspection
D. Proactive Controls Implementation
Which of the following statement is TRUE with BigSheets?
A. You can create any type of sheet from a parent workbook
B. You must create a child workbook in order to create a chart
C. You can delete a parent workbook without deleting the child workbooks
D. You must run the workbook on the data to get the full results of the analysis
Which annotation is used to exclude operators from a consistent region?
A. @consistent
B. @standalone
C. @indepedent
D. @autonomous
Which Big SQL file format could be expected to result in a longer running query?
A. Text
B. Avro
C. Parquet
D. Sequence_text
Which of the following must happen before the Big SQL EXPLAIN command can execute?
A. Run the ANALYZE command
B. Set the COMPATIBILITY_MODE global variable
C. Execute the SET HADOOP PROPERTY command
D. Call the SYSPROC.SYSINSTALLOBJECTS procedure
Which of the following is TRUE regarding a schema In Jaql?
A. It specifies a particular type for each field
B. It is used only when Jaql includes SQL statements
C. It must be provided when the data format is not otherwise known
D. It can be used both as a data constraint and to optimize processing and storage