Operating system
Resource sharing
System integration
None of the mentioned above
B. Resource sharing
HBase
HDFS
Hive
Mapreduce
Big Data ingestion pipeline is divided into different layers
Each layer performs a particular function
Both A and B
None of the mentioned above
Master-slave architecture
Master-worker architecture
Worker-slave architecture
All of the mentioned above
Heterogeneous
Storage
Network
None of the mentioned above
Cloud computing
Power BI
System development
None of the mentioned above
DBMS
NoSQL
Data store
None of the mentioned above
HDFS file system is well suited for storing data associated with applications that require low latency data access.
HDFS is well-suited for storing data connected to applications that require low-latency data access to be performed.
HDFS is not suited for instances in which multiple/simultaneous writes to the same file are required.
None of the mentioned above
Cloud computing
Security management
Integrated approach
None of the mentioned above
Account data
Historical data
Financial data
None of the mentioned above
function that fetches one or more columns from a row as arguments
It returns a single value
Both A and B
None of the mentioned above
Structured Data
Unstructured Data
Semi-structured Data
None of the mentioned above
Linear Regression
Time series analysis and forecasting
Data Mining
All of the mentioned above
Intrinsic
Extrinsic
Both A and B
None of the mentioned above
Exhausting valuable resources on housing data that does not inform business decisions
Spending time sifting through unutilized data sets
Missing out on unique revenue streams and insights
All of the mentioned above
Architectural layer
Fundamental layer
Backpropogation layer
None of the mentioned above
Transportation of data from the ingestion layer to the rest of the data pipeline
Data storage
Data identification
None of the mentioned above
Operational data source
Qualitative data source
Both A and B
None of the mentioned above
Primary Index
Secondary Index
Complex Index
None of the mentioned above
Distributed computing
Cluster computing
Parallel computing
All of the mentioned above
Redundant physical infrastructure
Integrated System
Integrated Database
All of the mentioned above
Mapreduce
Spark
Hive
All of the mentioned above
Java
C#
C
C++
Business needs
Facilitates cloud computing and
Hypervisor
Virtualization
Algorithmic techniques
Modeling techniques
System development and design techniques
None of the mentioned above
Business organizations
System development
Employees development
All of the mentioned above
t-test
mean
standard deviation
range
Structured data
Unstructured data
Semi-structured data
All of the mentioned above
To visualize the data
To access the data
To process the data
All of the mentioned above
A tool used in predictive analytics
A process that uses data mining and statistics to develop models
Examine current and historical datasets for underlying patterns
All of the mentioned above
Goes beyond the massive volumes
Increasing velocities of data
Both A and B
None of the mentioned above