UNIT 6: Machine Learning Algorithms

UNIT 5: Data Literacy – Data Collection to Data Analysis

September 13, 2024

UNIT 8: AI Ethics and Values

September 13, 2024

107

Machine Learning Algorithms

MCQs :

What is Machine Learning (ML)?
a) A process where machines develop emotions
b) A subset of AI that allows computers to learn from data
c) Programming computers with step-by-step instructions
d) A database management system

Answer: b

Which of the following is a type of Supervised Learning?
a) Clustering b) K-Means
c) Regression d) Reinforcement Learning

Answer: c

In Supervised Learning, data used for training is:
a) Labeled b) Unlabeled
c) Randomized d) Synthetic

Answer: a

What does Unsupervised Learning involve?
a) Learning from labeled data
b) Identifying patterns in unlabeled data
c) Using rewards to learn
d) Programming without data

Answer: b

Which of the following is an example of Reinforcement Learning?
a) Clustering customer data b) Spam filtering
c) Training a robot through trial and error d) Predicting house prices

Answer: c

What is the purpose of Regression in machine learning?
a) To classify data into categories b) To predict a continuous output c) To cluster data points d) To divide data into random groups

Answer: b

Which algorithm is typically used for Classification tasks?
a) K-Means b) Linear Regression c) k-Nearest Neighbors (KNN) d) Decision Tree Regression

Answer: c

Which of the following is an example of Supervised Learning?
a) k-Nearest Neighbors (KNN) b) K-Means Clustering
c) Principal Component Analysis (PCA) d) Self-driving cars learning
Answer: a
What is Clustering in machine learning?
a) Grouping labeled data b) Grouping unlabeled data based on similarities
c) Predicting continuous values d) Optimizing actions for rewards

Answer: b

What is the goal of Reinforcement Learning?
a) Predicting the output based on labeled data b) Grouping data into clusters
c) Learning through trial and error by maximizing rewards d) Reducing dimensionality of the dataset

Answer: c

In which type of learning does the machine learn from feedback through rewards or penalties?
a) Supervised Learning b) Unsupervised Learning
c) Reinforcement Learning d) Classification Learning

Answer: c

What does the K-Means algorithm do?
a) Classifies data based on the nearest neighbor
b) Clusters data into a predefined number of groups
‘c) Predicts a continuous output based on input variables
d) Makes decisions based on feedback loops

Answer: b

Which metric is used to measure the distance between points in K-Means clustering?
a) Manhattan Distance b) Euclidean Distance
c) Hamming Distance d) Chebyshev Distance

Answer: b

What is Pearson’s r used for?
a) Measuring the strength of the relationship between two categorical variables
b) Measuring the correlation between two continuous variables
c) Calculating the error in clustering
d) Evaluating classification accuracy

Answer: b

Which of the following is not a type of Machine Learning?
a) Supervised Learning b) Unsupervised Learning
c) Reinforcement Learning d) Sequential Learning

Answer: d

In which scenario is Regression used?
a) To classify spam or non-spam emails
b) To predict house prices based on square footage
c) To cluster customer segments
d) To train robots in a game-playing environment

Answer: b

Which algorithm is best for grouping customers based on similar purchasing behavior?
a) k-Nearest Neighbors b) K-Means Clustering
c) Linear Regression d) Q-Learning

Answer: b

What is the key difference between Supervised and Unsupervised Learning?
a) Supervised learning uses unlabeled data, and unsupervised learning uses labeled data
b) Supervised learning uses labeled data, and unsupervised learning uses unlabeled data
c) Both use trial and error to learn
d) There is no difference

Answer: b

Which type of data is required for Clustering?
a) Labeled data b) Unlabeled data
c) Numerical data d) Categorical data

Answer: b

What does the Linear Regression algorithm predict?
a) Discrete categories b) Grouping of data points
c) A continuous numerical value d) Text generation

Answer: c

Which of the following is a disadvantage of the K-Means algorithm?
a) Easy to implement b) Computationally efficient
c) Sensitive to outliers d) Effective with large datasets

Answer: c

In k-Nearest Neighbors, what does the ‘k’ represent?
a) The number of clusters b) The number of nearest neighbors to consider
c) The number of input features d) The number of output categories

Answer: b

Which of the following is an example of binary classification?
a) Predicting house prices b) Email spam detection (spam or not spam)
c) Grouping customers into segments d) Predicting weather patterns

Answer: b

What is the main use of Unsupervised Learning?
a) To learn from feedback and rewards
b) To make predictions on labeled data
c) To identify hidden patterns in unlabeled data
d) To classify data into binary categories

Answer: c

Which of the following is not a Supervised Learning algorithm?
a) Decision Tree b) Linear Regression
c) K-Means d) k-Nearest Neighbors

Answer: c

Which problem is addressed by Regression?
a) Predicting continuous values b) Predicting categories
c) Grouping data points d) Recognizing images

Answer: a

Which algorithm is used for image recognition and classification?
a) Regression b) K-Means Clustering
c) K-Nearest Neighbors d) Q-Learning

Answer: c

Which of the following represents a classification problem?
a) Predicting temperature for the next week
b) Predicting whether a patient has a disease or not
c) Predicting sales for a company
d) Predicting the clustering of data points

Answer: b

What is the primary objective of Clustering algorithms?
a) To predict the next word in a sentence
b) To classify new data points into predefined categories
c) To group data points based on their similarities
d) To reduce errors in labeled data

Answer: c

Which of the following is an example of a Reinforcement Learning algorithm?
a) Q-Learning b) Logistic Regression
c) k-Nearest Neighbors d) K-Means

Answer: a

What is the main purpose of Linear Regression?
a) To group data points into clusters
b) To classify data into discrete categories
c) To predict a continuous value based on input variables
d) To optimize reward-based actions
Answer: c
In K-Means Clustering, what happens after the centroids are updated?
a) The algorithm stops immediately
b) Each data point is reassigned to the closest centroid
c) The data points are discarded
d) The number of clusters is recalculated
Answer: b
What type of learning involves labeled data?
a) Unsupervised Learning
b) Supervised Learning
c) Reinforcement Learning
d) Clustering
Answer: b
Which of the following is not a type of Classification problem?
a) Binary Classification
b) Multi-Class Classification
c) Linear Regression
d) Multi-Label Classification
Answer: c
What does a clustering algorithm attempt to do?
a) Classify data into predefined categories
b) Minimize prediction error
c) Group similar data points together
d) Maximize reward in a learning environment
Answer: c
In Reinforcement Learning, what is used to guide the learning process?
a) Labeled data
b) Clusters
c) Rewards and penalties
d) Classification labels
Answer: c
What is the role of the decision boundary in Classification?
a) It separates different categories in the data
b) It calculates the regression line
c) It defines the number of clusters
d) It adjusts centroids in clustering
Answer: a
Which of the following is an example of Unsupervised Learning?
a) Predicting house prices
b) K-Means Clustering
c) Classifying emails as spam or not spam
d) Predicting customer churn
Answer: b
Which algorithm is sensitive to outliers?
a) Decision Tree
b) K-Means Clustering
c) Reinforcement Learning
d) Logistic Regression
Answer: b
What is the purpose of a reward function in Reinforcement Learning?
a) To predict the class label of a new data point
b) To maximize the accuracy of classification
c) To help the agent learn by providing feedback
d) To minimize the distance between clusters
Answer: c
Which metric is used to measure the linear relationship between two variables in Regression?
a) Pearson’s correlation coefficient
b) Mean Squared Error
c) Euclidean Distance
d) Accuracy
Answer: a
What is the key difference between Multi-Class and Multi-Label Classification?
a) Multi-Class allows only one class per instance, while Multi-Label allows multiple classes per instance
b) Multi-Class deals with continuous data, while Multi-Label handles discrete data
c) Multi-Class involves clustering, while Multi-Label involves regression
d) Multi-Class uses unsupervised learning, while Multi-Label uses reinforcement learning
Answer: a
What is the purpose of feature scaling in machine learning?
a) To improve the performance of linear regression
b) To normalize data for better performance in distance-based algorithms
c) To reduce the number of features in the dataset
d) To improve the clustering of data points
Answer: b
What does Q-Learning, a type of Reinforcement Learning algorithm, focus on?
a) Finding the regression line
b) Maximizing rewards through trial and error
c) Grouping data points into clusters
d) Predicting continuous values
Answer: b
In Classification, which of the following is a multi-class algorithm?
a) Linear Regression
b) k-Nearest Neighbors (KNN)
c) Logistic Regression
d) Q-Learning
Answer: b
What happens if the number of clusters (K) is chosen incorrectly in K-Means Clustering?
a) The algorithm will fail to complete
b) The resulting clusters may not reflect meaningful patterns in the data
c) The centroids will overlap
d) The model will stop learning
Answer: b
Which type of learning is primarily used for image recognition?
a) Reinforcement Learning
b) Supervised Learning
c) Unsupervised Learning
d) Clustering
Answer: b
In Regression analysis, what does the slope of the regression line represent?
a) The strength of the clustering
b) The rate of change in the dependent variable with respect to the independent variable
c) The distance between centroids in clustering
d) The optimal reward policy
Answer: b
What is the most significant limitation of the K-Nearest Neighbors algorithm?
a) It requires labeled data for training
b) It is difficult to implement
c) It does not work well with high-dimensional data
d) It cannot be used for regression tasks
Answer: c
Which of the following is a disadvantage of Linear Regression?
a) It can only be used for classification problems
b) It assumes a linear relationship between variables
c) It is computationally intensive
d) It works poorly with small datasets
Answer: b

ASSERTION-REASONING BASED QUESTIONS:

1. Assertion (A): In Supervised Learning, the model is trained using labeled data.

Reason (R): Supervised Learning algorithms find hidden patterns in the data without any prior knowledge of the output.