Application software
System software
Operating System
None of the mentioned above
C. Operating System
Volume
Veracity
Variety
None of the mentioned above
Analyzes data
Data storage
Data ingestion
None of the mentioned above
Operating system
Resource sharing
System integration
None of the mentioned above
Query
Statement
Function
None of the mentioned above
High availability through built-in replication and failover
Management tooling for automation, monitoring, and backup
Fully elastic database as a service with built-in best practices
All of the mentioned above
Accurate
Inconsistence
Variant
None of the mentioned above
Large-scale
Cloud-based
Data warehousing
All of the mentioned above
Webpages
All of the mentioned above
Master-slave architecture
Master-worker architecture
Worker-slave architecture
All of the mentioned above
Data Node
Block Size
Data block
NameNode
HBase
HDFS
Hive
Mapreduce
Internally managed data
Data feeds from external sources.
It provides access to each and every layer & components of big data stack
All of the mentioned above
Transportation of data from the ingestion layer to the rest of the data pipeline
Data storage
Data identification
None of the mentioned above
SQL
DBMS
NoSQL
RDBMS
HDFS file system is well suited for storing data associated with applications that require low latency data access.
HDFS is well-suited for storing data connected to applications that require low-latency data access to be performed.
HDFS is not suited for instances in which multiple/simultaneous writes to the same file are required.
None of the mentioned above
Big data
Data science
Data integration
None of the mentioned above
Reducer
Mapper
File system
All of these
Big Data ingestion pipeline is divided into different layers
Each layer performs a particular function
Both A and B
None of the mentioned above
Data can arrive at fast speed
Enormous datasets can accumulate within very short periods of time
Velocity of data translates into the amount of time it takes for the data to be processed
All of the mentioned above
Public Cloud
Private Cloud
Hybrid Cloud
All of the mentioned above
Includes multiple formats and types of data
Includes structured data in the form of financial transactions,
Includes semi-structured data in the form of emails and unstructured data in the form of images
All of the mentioned above
Tera
Giga
Peta
Meta
Hive
HBase
Analysis and reporting
All of the mentioned above
To visualize the data
To access the data
To process the data
All of the mentioned above
Business needs
Facilitates cloud computing and
Hypervisor
Virtualization
Indexing and Cataloging, Replication
File Storage and Structure, Query Processing
Transactions Support
All of the mentioned above
Show functions
Describe function
Both A and B
None of the mentioned above
Oozie
Zookepper
MapReduce
All of the mentioned above
Python
C++
R
Java
Data
Knowledge
Program
Algorithm