Home
Current Affairs January 2024

What is the correct answer?

4

Amongst which of the following is / are the correct workflow of predictive analytics,

A. Import data → Clean the data → Develop a predictive model → Integrate the model

B. Clean the data → Develop a predictive model → Import data → Integrate the model

C. Clean the data → Develop a predictive model → Import data → Integrate the model

D. None of the mentioned above

Correct Answer :

A. Import data → Clean the data → Develop a predictive model → Integrate the model


Before to do predictive analytics, we should follow some sequence. This is like Import data from varied sources, such as web archives, databases, and spreadsheets, Clean the data by removing outliers and combining data sources, develop an accurate predictive model based on the aggregated data using statistics, curve fitting tools, or machine learning and integrate the model into a load forecasting system.

Related Questions

What is the correct answer?

4

A hypervisor is a technology responsible for ensuring ____ takes place in an orderly and repeatable way.

A. Operating system

B. Resource sharing

C. System integration

D. None of the mentioned above

What is the correct answer?

4

Prescriptive analytics makes the use of machine learning to help ____ to decide a course of action based on a computer program's predictions.

A. Business organizations

B. System development

C. Employees development

D. All of the mentioned above

What is the correct answer?

4

The MapReduce framework is responsible for processing one or more pieces of data and producing the output results as ______.

A. Maptask

B. Task execution

C. Mapper

D. All of the mentioned above

What is the correct answer?

4

Amongst which of the following is not aligns as a characteristic of HDFS?

A. HDFS file system is well suited for storing data associated with applications that require low latency data access.

B. HDFS is well-suited for storing data connected to applications that require low-latency data access to be performed.

C. HDFS is not suited for instances in which multiple/simultaneous writes to the same file are required.

D. None of the mentioned above

What is the correct answer?

4

Amongst which of the following represents the Use of Hadoop,

A. Robust and Scalable

B. Affordable and Cost Effective

C. Adaptive and Flexible

D. All of the mentioned above

What is the correct answer?

4

Amongst which of the following is / are true to run MongoDB?

A. High availability through built-in replication and failover

B. Management tooling for automation, monitoring, and backup

C. Fully elastic database as a service with built-in best practices

D. All of the mentioned above

What is the correct answer?

4

Predictive analytics relies on capturing relationships between explanatory variables and the _____.

A. Predicted variables

B. Descriptive variables

C. Prescriptive variables

D. All of the mentioned above

What is the correct answer?

4

Amongst which of the following shows an example of unstructured data,

A. Students roll number, age

B. Videos

C. Audio files

D. Both B and C

What is the correct answer?

4

MongoDB support cross platform and is written in _____ language.

A. Python

B. C++

C. R

D. Java

What is the correct answer?

4

Data Visualization Layer,

A. To visualize the data

B. To access the data

C. To process the data

D. All of the mentioned above

What is the correct answer?

4

______ involves the simultaneous execution of multiple sub-tasks that collectively comprise a larger task.

A. Parallel data processing

B. Single channel processing

C. Multi data processing

D. None of the mentioned above

What is the correct answer?

4

MongoDB is a ____ database.

A. SQL

B. DBMS

C. NoSQL

D. RDBMS

What is the correct answer?

4

HDFS stores data in a distributed manner, the data can be processed in parallel on a _____ of nodes.

A. Cluster

B. Data Node

C. Master Node

D. None of the mentioned above

What is the correct answer?

4

Operational Database with distributed systems and ___ based system can harness the true potential with big data.

A. SQL

B. NoSQL

C. PL / SQL

D. None of the mentioned above

What is the correct answer?

4

HDFS operates in a ____ manner.

A. Master-slave architecture

B. Master-worker architecture

C. Worker-slave architecture

D. All of the mentioned above

What is the correct answer?

4

_____ are two techniques used in descriptive analytics to discover historical data.

A. Data ingestion and data mining

B. Data warehouse and data storage

C. Data aggregation and data mining

D. All of the mentioned above

What is the correct answer?

4

In the layered architecture of Big Data Stack, Interfaces and feeds,

A. Internally managed data

B. Data feeds from external sources.

C. It provides access to each and every layer & components of big data stack

D. All of the mentioned above

What is the correct answer?

4

______ node serves as the Slave and is responsible for carrying out the Tasks that have been assigned to it by the JobTracker.

A. TaskReduce

B. Mapreduce

C. TaskTracker

D. JobTracker

What is the correct answer?

4

Amongst which of the following is / are the correct workflow of predictive analytics,

A. Import data → Clean the data → Develop a predictive model → Integrate the model

B. Clean the data → Develop a predictive model → Import data → Integrate the model

C. Clean the data → Develop a predictive model → Import data → Integrate the model

D. None of the mentioned above

What is the correct answer?

4

_____ is the supporting physical infrastructure is fundamental to the operation and scalability of big data architecture.

A. Redundant physical infrastructure

B. Integrated System

C. Integrated Database

D. All of the mentioned above

What is the correct answer?

4

Hypervisors that run several virtual machines on one ____ resources also allow for more efficient utilization,

A. Physical machine

B. Abstract machine

C. System integration

D. All of the mentioned above

What is the correct answer?

4

To process large data sets quickly, big data architectures use.

A. Distributed computing

B. Cluster computing

C. Parallel computing

D. All of the mentioned above

What is the correct answer?

4

Predictive analytics is a process harnesses ____, often massive, data sets into models.

A. Heterogeneous

B. Storage

C. Network

D. None of the mentioned above

What is the correct answer?

4

In Big Data environments, Variety of data includes

A. Includes multiple formats and types of data

B. Includes structured data in the form of financial transactions,

C. Includes semi-structured data in the form of emails and unstructured data in the form of images

D. All of the mentioned above

What is the correct answer?

4

The Hadoop framework is built in Java, which means that MapReduce applications do not need to be written in _____.

A. C#

B. C

C. Java

D. None of the mentioned above

What is the correct answer?

4

Amongst which of the following is not a descriptive statistic?

A. t-test

B. mean

C. standard deviation

D. range

What is the correct answer?

4

Apache Hive is a data ______ infrastructure that is built on top of the Hadoop platform.

A. Warehouse

B. Map

C. Reduce

D. None of the mentioned above

What is the correct answer?

4

Volume, Velocity and Variety are _____ to Big Data,

A. Intrinsic

B. Extrinsic

C. Both A and B

D. None of the mentioned above

What is the correct answer?

4

Custom extensions built in the ____ programming language are also supported by Hive.

A. Java

B. C#

C. C

D. C++

What is the correct answer?

4

Amongst which of the following is / are true with reference to Decision trees,

A. A learning model

B. Uses observations

C. Develop conclusions

D. All of the mentioned above