Hive is a relational database that supports SQL queries.
Pig is a relational database that supports SQL queries.
Both A and B
None of the mentioned above
C. Both A and B
Business organizations
System development
Employees development
All of the mentioned above
Data ingestion and data mining
Data warehouse and data storage
Data aggregation and data mining
All of the mentioned above
Linear Regression
Time series analysis and forecasting
Data Mining
All of the mentioned above
Students roll number, age
Videos
Audio files
Both B and C
Default location of Hadoop configuration is in $HADOOP /conf/ HOME
If $HADOOP HOME is specified, Sqoop will utilise the default installation location
default location of Hadoop configuration is in $HADOOP HOME/conf/
Sqoop command-line tool serves as a wrapper for the bin/hadoop script that is included with Hadoop as a base.
Big Data ingestion pipeline is divided into different layers
Each layer performs a particular function
Both A and B
None of the mentioned above
Cluster
Data Node
Master Node
None of the mentioned above
Big data
Data science
Data integration
None of the mentioned above
Intrinsic
Extrinsic
Both A and B
None of the mentioned above
Warehouse
Map
Reduce
None of the mentioned above
Python
C++
R
Java
Internally managed data
Data feeds from external sources.
It provides access to each and every layer & components of big data stack
All of the mentioned above
High availability through built-in replication and failover
Management tooling for automation, monitoring, and backup
Fully elastic database as a service with built-in best practices
All of the mentioned above
Robust and Scalable
Affordable and Cost Effective
Adaptive and Flexible
All of the mentioned above
Data ingestion
Data processing
Data analysis
All of the mentioned above
HDFS file system is well suited for storing data associated with applications that require low latency data access.
HDFS is well-suited for storing data connected to applications that require low-latency data access to be performed.
HDFS is not suited for instances in which multiple/simultaneous writes to the same file are required.
None of the mentioned above
Primary data
Secondary data
Quantitative data
Qualitative data
Data
Knowledge
Program
Algorithm
Discrete variable
Quantitative variable
Qualitative variable
Superlative variable
C#
C
Java
None of the mentioned above
Parallel data processing
Single channel processing
Multi data processing
None of the mentioned above
Apache Hadoop
Apache Spark
Apache Kafka
Apache Pytarch
Structured Data
Unstructured Data
Semi-structured Data
None of the mentioned above
Indexing and Cataloging, Replication
File Storage and Structure, Query Processing
Transactions Support
All of the mentioned above
function that fetches one or more columns from a row as arguments
It returns a single value
Both A and B
None of the mentioned above
Cloud computing
Power BI
System development
None of the mentioned above
Traditional database
Operational database
Database Management System
None of the mentioned above
t-test
mean
standard deviation
range
2
3
4
5
TYPE-1 Hypervisor
TYPE- 2 Hypervisor
Both A and B
None of the mentioned above