Webpages
All of the mentioned above
D. All of the mentioned above
HBase
HDFS
Hive
Mapreduce
Structured format
Unstructured format
Semi-structured format
None of the mentioned above
Virtual representation of a physical computer
Virtual representation of a logical computer
Virtual System Integration
All of the mentioned above
SQL
NoSQL
PL / SQL
None of the mentioned above
Oozie
Zookepper
MapReduce
All of the mentioned above
Translating voice to text for mobile phone messaging
Investment portfolio development
Weather forecasts
All of the mentioned above
A learning model
Uses observations
Develop conclusions
All of the mentioned above
Cost Reduction
Time Reductions
Smarter Business Decisions
All of the mentioned above
Apache Hadoop
Apache Spark
Apache Kafka
Apache Pytarch
Big Data ingestion pipeline is divided into different layers
Each layer performs a particular function
Both A and B
None of the mentioned above
Volume
Veracity
Variety
None of the mentioned above
Predicted variables
Descriptive variables
Prescriptive variables
All of the mentioned above
No decisions
Better decisions
Unpredictable things
None of the mentioned above
Algorithmic techniques
Modeling techniques
System development and design techniques
None of the mentioned above
Primary Index
Secondary Index
Complex Index
None of the mentioned above
Cluster
Data Node
Master Node
None of the mentioned above
Tera
Giga
Peta
Meta
Python
C++
R
Java
Hive is a relational database that supports SQL queries.
Pig is a relational database that supports SQL queries.
Both A and B
None of the mentioned above
Linear Regression
Time series analysis and forecasting
Data Mining
All of the mentioned above
Internally managed data
Data feeds from external sources.
It provides access to each and every layer & components of big data stack
All of the mentioned above
Reducer
Mapper
File system
All of these
Operational data source
Qualitative data source
Both A and B
None of the mentioned above
Accurate
Inconsistence
Variant
None of the mentioned above
Data Node
Block Size
Data block
NameNode
Redundant physical infrastructure
Integrated System
Integrated Database
All of the mentioned above
Account data
Historical data
Financial data
None of the mentioned above
Cloud computing
Security management
Integrated approach
None of the mentioned above
Distributed computing
Cluster computing
Parallel computing
All of the mentioned above
Data
Knowledge
Program
Algorithm