Warehouse
Map
Reduce
None of the mentioned above
A. Warehouse
HDFS file system is well suited for storing data associated with applications that require low latency data access.
HDFS is well-suited for storing data connected to applications that require low-latency data access to be performed.
HDFS is not suited for instances in which multiple/simultaneous writes to the same file are required.
None of the mentioned above
Virtual representation of a physical computer
Virtual representation of a logical computer
Virtual System Integration
All of the mentioned above
Apache Hadoop
Apache Spark
Apache Kafka
Apache Pytarch
Oozie
Zookepper
MapReduce
All of the mentioned above
C#
C
Java
None of the mentioned above
Accurate
Inconsistence
Variant
None of the mentioned above
Maptask
Task execution
Mapper
All of the mentioned above
Intrinsic
Extrinsic
Both A and B
None of the mentioned above
A tool used in predictive analytics
A process that uses data mining and statistics to develop models
Examine current and historical datasets for underlying patterns
All of the mentioned above
Structured format
Unstructured format
Semi-structured format
None of the mentioned above
Cost Reduction
Time Reductions
Smarter Business Decisions
All of the mentioned above
Process of attaching meaning to the data
Convert text into insightful information
Effective conclusion
All of the mentioned above
Schema
Table
Both A and B
None of the mentioned above
Business organizations
System development
Employees development
All of the mentioned above
Tera
Giga
Peta
Meta
2
3
4
5
Traffic and Engagement Reports
Financial Statement Analysis
Demand Trends and Aggregated Survey Results
All of the mentioned above
Virtualization layer
Storage layer
Abstract layer
None of the mentioned above
Import data → Clean the data → Develop a predictive model → Integrate the model
Clean the data → Develop a predictive model → Import data → Integrate the model
Clean the data → Develop a predictive model → Import data → Integrate the model
None of the mentioned above
Volume
Veracity
Variety
None of the mentioned above
HBase
HDFS
Hive
Mapreduce
Primary data
Secondary data
Quantitative data
Qualitative data
Input data
Output data
Process data
All of the mentioned above
Default location of Hadoop configuration is in $HADOOP /conf/ HOME
If $HADOOP HOME is specified, Sqoop will utilise the default installation location
default location of Hadoop configuration is in $HADOOP HOME/conf/
Sqoop command-line tool serves as a wrapper for the bin/hadoop script that is included with Hadoop as a base.
Algorithms
Flowchart
System flow
None of the mentioned above
TYPE-1 Hypervisor
TYPE- 2 Hypervisor
Both A and B
None of the mentioned above
TaskReduce
Mapreduce
TaskTracker
JobTracker
t-test
mean
standard deviation
range
Algorithmic techniques
Modeling techniques
System development and design techniques
None of the mentioned above
Hive is a relational database that supports SQL queries.
Pig is a relational database that supports SQL queries.
Both A and B
None of the mentioned above