Primary data
Secondary data
Quantitative data
Qualitative data
B. Secondary data
Worth in information
Useless data
Useless information
None of the mentioned above
t-test
mean
standard deviation
range
Business organizations
System development
Employees development
All of the mentioned above
TYPE-1 Hypervisor
TYPE- 2 Hypervisor
Both A and B
None of the mentioned above
Quality or fidelity of data
Large size of the data that cannot be process
Small size of the data that can easily process
All of the mentioned above
Cloud-based
Data warehouse
System ingestion
All of the mentioned above
SQL
NoSQL
PL / SQL
None of the mentioned above
Warehouse
Map
Reduce
None of the mentioned above
SQL
DBMS
NoSQL
RDBMS
Tera
Giga
Peta
Meta
Default location of Hadoop configuration is in $HADOOP /conf/ HOME
If $HADOOP HOME is specified, Sqoop will utilise the default installation location
default location of Hadoop configuration is in $HADOOP HOME/conf/
Sqoop command-line tool serves as a wrapper for the bin/hadoop script that is included with Hadoop as a base.
Includes multiple formats and types of data
Includes structured data in the form of financial transactions,
Includes semi-structured data in the form of emails and unstructured data in the form of images
All of the mentioned above
Exhausting valuable resources on housing data that does not inform business decisions
Spending time sifting through unutilized data sets
Missing out on unique revenue streams and insights
All of the mentioned above
Reducer
Mapper
File system
All of these
Big data
Data science
Data integration
None of the mentioned above
Large-scale
Cloud-based
Data warehousing
All of the mentioned above
Hive
HBase
Analysis and reporting
All of the mentioned above
Analyzes data
Data storage
Data ingestion
None of the mentioned above
Data can arrive at fast speed
Enormous datasets can accumulate within very short periods of time
Velocity of data translates into the amount of time it takes for the data to be processed
All of the mentioned above
HBase
HDFS
Hive
Mapreduce
High availability through built-in replication and failover
Management tooling for automation, monitoring, and backup
Fully elastic database as a service with built-in best practices
All of the mentioned above
A statement that the researcher wishes to put to the test using the information gathered during a study.
A research question that will be answered as a result of the findings.
A theory that serves as the foundation for the research.
the application of statistics to determine the extent to which the outcomes could have been caused by chance
Healthcare,
Education and telecom
Telecom
All of the mentioned above
Cloud computing
Power BI
System development
None of the mentioned above
Public Cloud
Private Cloud
Hybrid Cloud
All of the mentioned above
Predicted variables
Descriptive variables
Prescriptive variables
All of the mentioned above
HDFS file system is well suited for storing data associated with applications that require low latency data access.
HDFS is well-suited for storing data connected to applications that require low-latency data access to be performed.
HDFS is not suited for instances in which multiple/simultaneous writes to the same file are required.
None of the mentioned above
Maptask
Task execution
Mapper
All of the mentioned above
Import data → Clean the data → Develop a predictive model → Integrate the model
Clean the data → Develop a predictive model → Import data → Integrate the model
Clean the data → Develop a predictive model → Import data → Integrate the model
None of the mentioned above
Big Data ingestion pipeline is divided into different layers
Each layer performs a particular function
Both A and B
None of the mentioned above