Input data
Output data
Process data
All of the mentioned above
A. Input data
Algorithms
Flowchart
System flow
None of the mentioned above
A learning model
Uses observations
Develop conclusions
All of the mentioned above
Python
C++
R
Java
Show functions
Describe function
Both A and B
None of the mentioned above
Predicted variables
Descriptive variables
Prescriptive variables
All of the mentioned above
Robust and Scalable
Affordable and Cost Effective
Adaptive and Flexible
All of the mentioned above
Accurate
Inconsistence
Variant
None of the mentioned above
Data
Knowledge
Program
Algorithm
Apache Hadoop
Apache Spark
Apache Kafka
Apache Pytarch
Meet compliance requirements
Protect the privacy
Both A and B
None of the mentioned above
HDFS Shell
DFS Shell
K Shell
FS Shell
To visualize the data
To access the data
To process the data
All of the mentioned above
Data ingestion
Data processing
Data analysis
All of the mentioned above
Query
Statement
Function
None of the mentioned above
Business needs
Facilitates cloud computing and
Hypervisor
Virtualization
Includes multiple formats and types of data
Includes structured data in the form of financial transactions,
Includes semi-structured data in the form of emails and unstructured data in the form of images
All of the mentioned above
Exhausting valuable resources on housing data that does not inform business decisions
Spending time sifting through unutilized data sets
Missing out on unique revenue streams and insights
All of the mentioned above
Indexing and Cataloging, Replication
File Storage and Structure, Query Processing
Transactions Support
All of the mentioned above
Discrete variable
Quantitative variable
Qualitative variable
Superlative variable
Oozie
Zookepper
MapReduce
All of the mentioned above
Maptask
Task execution
Mapper
All of the mentioned above
High availability through built-in replication and failover
Management tooling for automation, monitoring, and backup
Fully elastic database as a service with built-in best practices
All of the mentioned above
function that fetches one or more columns from a row as arguments
It returns a single value
Both A and B
None of the mentioned above
Structured format
Unstructured format
Semi-structured format
None of the mentioned above
Webpages
All of the mentioned above
Virtual representation of a physical computer
Virtual representation of a logical computer
Virtual System Integration
All of the mentioned above
Quickly and efficiently
Systematic approach
Both A and B
All of the mentioned above
Account data
Historical data
Financial data
None of the mentioned above
TYPE-1 Hypervisor
TYPE- 2 Hypervisor
Both A and B
None of the mentioned above
t-test
mean
standard deviation
range