Redundant physical infrastructure
Integrated System
Integrated Database
All of the mentioned above
A. Redundant physical infrastructure
Default location of Hadoop configuration is in $HADOOP /conf/ HOME
If $HADOOP HOME is specified, Sqoop will utilise the default installation location
default location of Hadoop configuration is in $HADOOP HOME/conf/
Sqoop command-line tool serves as a wrapper for the bin/hadoop script that is included with Hadoop as a base.
Mapreduce
Spark
Hive
All of the mentioned above
Large-scale
Cloud-based
Data warehousing
All of the mentioned above
A learning model
Uses observations
Develop conclusions
All of the mentioned above
Cloud computing
Power BI
System development
None of the mentioned above
Quality or fidelity of data
Large size of the data that cannot be process
Small size of the data that can easily process
All of the mentioned above
Heterogeneous
Storage
Network
None of the mentioned above
Cluster
Data Node
Master Node
None of the mentioned above
2
3
4
5
Master-slave architecture
Master-worker architecture
Worker-slave architecture
All of the mentioned above
MapReduce, Hive and HBase
Hive, Spark and HBase
Spark, Hive and ZooKeeper
Spark, HBase and Hive
Application software
System software
Operating System
None of the mentioned above
Data can arrive at fast speed
Enormous datasets can accumulate within very short periods of time
Velocity of data translates into the amount of time it takes for the data to be processed
All of the mentioned above
Traditional database
Operational database
Database Management System
None of the mentioned above
Orchestration
HBase
HDFS
None of the mentioned above
Input data
Output data
Process data
All of the mentioned above
Account data
Historical data
Financial data
None of the mentioned above
Query
Statement
Function
None of the mentioned above
Architectural layer
Fundamental layer
Backpropogation layer
None of the mentioned above
Process of attaching meaning to the data
Convert text into insightful information
Effective conclusion
All of the mentioned above
Schema
Table
Both A and B
None of the mentioned above
Maptask
Task execution
Mapper
All of the mentioned above
Translating voice to text for mobile phone messaging
Investment portfolio development
Weather forecasts
All of the mentioned above
t-test
mean
standard deviation
range
Hive is a relational database that supports SQL queries.
Pig is a relational database that supports SQL queries.
Both A and B
None of the mentioned above
C#
C
Java
None of the mentioned above
Operating system
Resource sharing
System integration
None of the mentioned above
Reducer
Mapper
File system
All of these
Intrinsic
Extrinsic
Both A and B
None of the mentioned above
Parallel data processing
Single channel processing
Multi data processing
None of the mentioned above