Indexing and Cataloging, Replication
File Storage and Structure, Query Processing
Transactions Support
All of the mentioned above
D. All of the mentioned above
Big data
Data science
Data integration
None of the mentioned above
MapReduce, Hive and HBase
Hive, Spark and HBase
Spark, Hive and ZooKeeper
Spark, HBase and Hive
To visualize the data
To access the data
To process the data
All of the mentioned above
Physical machine
Abstract machine
System integration
All of the mentioned above
HDFS Shell
DFS Shell
K Shell
FS Shell
Python
C++
R
Java
Operational data source
Qualitative data source
Both A and B
None of the mentioned above
t-test
mean
standard deviation
range
Parallel data processing
Single channel processing
Multi data processing
None of the mentioned above
Internally managed data
Data feeds from external sources.
It provides access to each and every layer & components of big data stack
All of the mentioned above
Large-scale
Cloud-based
Data warehousing
All of the mentioned above
Algorithmic techniques
Modeling techniques
System development and design techniques
None of the mentioned above
Quality or fidelity of data
Large size of the data that cannot be process
Small size of the data that can easily process
All of the mentioned above
Processing of data
User friendly representation
Both A and B
None of the mentioned above
Virtual representation of a physical computer
Virtual representation of a logical computer
Virtual System Integration
All of the mentioned above
Robust and Scalable
Affordable and Cost Effective
Adaptive and Flexible
All of the mentioned above
Input data
Output data
Process data
All of the mentioned above
High availability through built-in replication and failover
Management tooling for automation, monitoring, and backup
Fully elastic database as a service with built-in best practices
All of the mentioned above
A statement that the researcher wishes to put to the test using the information gathered during a study.
A research question that will be answered as a result of the findings.
A theory that serves as the foundation for the research.
the application of statistics to determine the extent to which the outcomes could have been caused by chance
Maptask
Task execution
Mapper
All of the mentioned above
Reducer
Mapper
File system
All of these
Includes multiple formats and types of data
Includes structured data in the form of financial transactions,
Includes semi-structured data in the form of emails and unstructured data in the form of images
All of the mentioned above
Primary Index
Secondary Index
Complex Index
None of the mentioned above
Meet compliance requirements
Protect the privacy
Both A and B
None of the mentioned above
sqoop-list-databases
Hbase-list
hive schema
sqoop-list-columns
Structured Data
Unstructured Data
Semi-structured Data
None of the mentioned above
Default location of Hadoop configuration is in $HADOOP /conf/ HOME
If $HADOOP HOME is specified, Sqoop will utilise the default installation location
default location of Hadoop configuration is in $HADOOP HOME/conf/
Sqoop command-line tool serves as a wrapper for the bin/hadoop script that is included with Hadoop as a base.
DBMS
NoSQL
Data store
None of the mentioned above
2
3
4
5
A learning model
Uses observations
Develop conclusions
All of the mentioned above