Querying tool
Mapper
MapReduce
All of the mentioned above
A. Querying tool
SQL
NoSQL
PL / SQL
None of the mentioned above
To visualize the data
To access the data
To process the data
All of the mentioned above
TaskReduce
Mapreduce
TaskTracker
JobTracker
Data
Knowledge
Program
Algorithm
Internally managed data
Data feeds from external sources.
It provides access to each and every layer & components of big data stack
All of the mentioned above
Data and storage
Analysis and reporting
System and development
None of the mentioned above
Application software
System software
Operating System
None of the mentioned above
Algorithmic techniques
Modeling techniques
System development and design techniques
None of the mentioned above
Schema
Table
Both A and B
None of the mentioned above
MapReduce, Hive and HBase
Hive, Spark and HBase
Spark, Hive and ZooKeeper
Spark, HBase and Hive
function that fetches one or more columns from a row as arguments
It returns a single value
Both A and B
None of the mentioned above
Processing of data
User friendly representation
Both A and B
None of the mentioned above
Transportation of data from the ingestion layer to the rest of the data pipeline
Data storage
Data identification
None of the mentioned above
Big Data ingestion pipeline is divided into different layers
Each layer performs a particular function
Both A and B
None of the mentioned above
Translating voice to text for mobile phone messaging
Investment portfolio development
Weather forecasts
All of the mentioned above
Java
C#
C
C++
Parallel data processing
Single channel processing
Multi data processing
None of the mentioned above
Tera
Giga
Peta
Meta
A tool used in predictive analytics
A process that uses data mining and statistics to develop models
Examine current and historical datasets for underlying patterns
All of the mentioned above
Operational data source
Qualitative data source
Both A and B
None of the mentioned above
A learning model
Uses observations
Develop conclusions
All of the mentioned above
Quality or fidelity of data
Large size of the data that cannot be process
Small size of the data that can easily process
All of the mentioned above
Data Node
Block Size
Data block
NameNode
Virtual representation of a physical computer
Virtual representation of a logical computer
Virtual System Integration
All of the mentioned above
Intrinsic
Extrinsic
Both A and B
None of the mentioned above
Redundant physical infrastructure
Integrated System
Integrated Database
All of the mentioned above
Hive is a relational database that supports SQL queries.
Pig is a relational database that supports SQL queries.
Both A and B
None of the mentioned above
DBMS
NoSQL
Data store
None of the mentioned above
Discrete variable
Quantitative variable
Qualitative variable
Superlative variable
Querying tool
Mapper
MapReduce
All of the mentioned above