Spark
HBase
Hive
Pig
D. Pig
Tera
Giga
Peta
Meta
Includes multiple formats and types of data
Includes structured data in the form of financial transactions,
Includes semi-structured data in the form of emails and unstructured data in the form of images
All of the mentioned above
sqoop-list-databases
Hbase-list
hive schema
sqoop-list-columns
Traffic and Engagement Reports
Financial Statement Analysis
Demand Trends and Aggregated Survey Results
All of the mentioned above
Processing of data
User friendly representation
Both A and B
None of the mentioned above
Java
C#
C
C++
Students roll number, age
Videos
Audio files
Both B and C
Predictive models
Descriptive models
Decision models
All of the mentioned above
Big data
Data science
Data integration
None of the mentioned above
Quality or fidelity of data
Large size of the data that cannot be process
Small size of the data that can easily process
All of the mentioned above
Transportation of data from the ingestion layer to the rest of the data pipeline
Data storage
Data identification
None of the mentioned above
Java
PHP
C#
None of the mentioned above
Oozie
Zookepper
MapReduce
All of the mentioned above
High availability through built-in replication and failover
Management tooling for automation, monitoring, and backup
Fully elastic database as a service with built-in best practices
All of the mentioned above
HDFS file system is well suited for storing data associated with applications that require low latency data access.
HDFS is well-suited for storing data connected to applications that require low-latency data access to be performed.
HDFS is not suited for instances in which multiple/simultaneous writes to the same file are required.
None of the mentioned above
Webpages
All of the mentioned above
Predicted variables
Descriptive variables
Prescriptive variables
All of the mentioned above
Virtualization layer
Storage layer
Abstract layer
None of the mentioned above
Linear Regression
Time series analysis and forecasting
Data Mining
All of the mentioned above
HBase
HDFS
Hive
Mapreduce
Reducer
Mapper
File system
All of these
Algorithms
Flowchart
System flow
None of the mentioned above
A statement that the researcher wishes to put to the test using the information gathered during a study.
A research question that will be answered as a result of the findings.
A theory that serves as the foundation for the research.
the application of statistics to determine the extent to which the outcomes could have been caused by chance
Cloud computing
Security management
Integrated approach
None of the mentioned above
Robust and Scalable
Affordable and Cost Effective
Adaptive and Flexible
All of the mentioned above
Query
Statement
Function
None of the mentioned above
Data ingestion and data mining
Data warehouse and data storage
Data aggregation and data mining
All of the mentioned above
Parallel data processing
Single channel processing
Multi data processing
None of the mentioned above
SQL
NoSQL
PL / SQL
None of the mentioned above
Schema
Table
Both A and B
None of the mentioned above