0% found this document useful (0 votes)

20 views5 pages

Programming Questions

Uploaded by

Hari Sree. M

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as DOCX, PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

20 views5 pages

Programming Questions

Uploaded by

Hari Sree. M

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as DOCX, PDF, TXT or read online on Scribd

You are on page 1/ 5

For the Amazon ML Summer School assessment, the programming questions will likely focus on

implementing and understanding key machine learning concepts. Here are some example questions
and tasks you might encounter:

Example Programming Questions

Linear Regression Implementation

Task: Implement a simple linear regression model from scratch without using any machine learning
libraries.

Input: A dataset with input features and target values.

Output: Predicted target values for a test dataset.

Example:

python

Copy code

import numpy as np

def linear_regression(X, y):

# Adding a column of ones for the intercept term

X = np.hstack((np.ones((X.shape[0], 1)), X))

# Calculating weights using the Normal Equation

weights = np.linalg.inv(X.T @ X) @ X.T @ y

return weights

def predict(X, weights):

X = np.hstack((np.ones((X.shape[0], 1)), X))

return X @ weights

# Example usage

X_train = np.array([[1, 2], [2, 3], [3, 4]])

y_train = np.array([2, 3, 4])

weights = linear_regression(X_train, y_train)

X_test = np.array([[4, 5], [5, 6]])

predictions = predict(X_test, weights)

print(predictions)

Decision Tree Classifier

Task: Implement a decision tree classifier for a given dataset.

Input: A dataset with features and labels.

Output: Predictions for a test dataset.

Example:

python

Copy code

from sklearn.tree import DecisionTreeClassifier

def decision_tree_classifier(X_train, y_train, X_test):

clf = DecisionTreeClassifier()

clf.fit(X_train, y_train)

return clf.predict(X_test)

# Example usage

X_train = [[0, 0], [1, 1], [0, 1], [1, 0]]

y_train = [0, 1, 1, 0]

X_test = [[0, 0], [1, 1]]

predictions = decision_tree_classifier(X_train, y_train, X_test)

print(predictions)

K-means Clustering

Task: Implement the K-means clustering algorithm.

Input: A dataset and the number of clusters (K).

Output: Cluster assignments for each data point.

Example:

python

Copy code
import numpy as np

def kmeans(X, k, max_iters=100):

centroids = X[np.random.choice(X.shape[0], k, replace=False)]

for _ in range(max_iters):

clusters = [np.argmin([np.linalg.norm(x - centroid) for centroid in centroids]) for x in X]

new_centroids = [X[np.array(clusters) == i].mean(axis=0) for i in range(k)]

if np.all(centroids == new_centroids):

break

centroids = new_centroids

return clusters

# Example usage

X = np.array([[1, 2], [2, 3], [3, 4], [8, 9], [9, 10], [10, 11]])

clusters = kmeans(X, 2)

print(clusters)

Principal Component Analysis (PCA)

Task: Implement PCA for dimensionality reduction.

Input: A dataset and the number of principal components.

Output: Transformed dataset with reduced dimensions.

Example:

python

Copy code

import numpy as np

def pca(X, n_components):

X_mean = np.mean(X, axis=0)

X_centered = X - X_mean

covariance_matrix = np.cov(X_centered, rowvar=False)

eigenvalues, eigenvectors = np.linalg.eigh(covariance_matrix)

sorted_indices = np.argsort(eigenvalues)[::-1]

sorted_eigenvectors = eigenvectors[:, sorted_indices]

principal_components = sorted_eigenvectors[:, :n_components]

return X_centered @ principal_components

# Example usage

X = np.array([[1, 2], [3, 4], [5, 6], [7, 8]])

X_pca = pca(X, 1)

print(X_pca)

Text Preprocessing for NLP

Task: Preprocess text data for further analysis.

Input: A list of text documents.

Output: Cleaned and tokenized text.

Example:

python

Copy code

import re

from nltk.tokenize import word_tokenize

from nltk.corpus import stopwords

from nltk.stem import PorterStemmer

def preprocess_text(texts):

stop_words = set(stopwords.words('english'))

ps = PorterStemmer()

processed_texts = []

for text in texts:

text = re.sub(r'\W', ' ', text)

text = re.sub(r'\s+', ' ', text)

tokens = word_tokenize(text)

tokens = [ps.stem(word) for word in tokens if word not in stop_words]

processed_texts.append(tokens)

return processed_texts

# Example usage

texts = ["Hello, world! This is a test.", "Preprocessing text is important."]

processed_texts = preprocess_text(texts)

print(processed_texts)

Preparation Tips

Understand the Fundamentals: Make sure you have a solid understanding of the basic algorithms
and concepts in machine learning.

Practice Coding: Implement algorithms from scratch to deepen your understanding. Use libraries like
NumPy and pandas for data manipulation.

Work on Real Datasets: Use platforms like Kaggle to practice with real-world datasets and improve
your data preprocessing and model training skills.

Review Python Libraries: Familiarize yourself with machine learning libraries such as scikit-learn,
TensorFlow, and PyTorch.

By practicing these types of questions and understanding the underlying concepts, you'll be well-
prepared for the assessment. Good luck!

01 134192 066 9559671601 28052022 103753pm
No ratings yet
01 134192 066 9559671601 28052022 103753pm
1 page
ML Lab
No ratings yet
ML Lab
7 pages
ML Lab Manual Completed
No ratings yet
ML Lab Manual Completed
56 pages
Important Questions
No ratings yet
Important Questions
4 pages
Iml Practical Assignment
No ratings yet
Iml Practical Assignment
22 pages
ML Practical 205160694034
No ratings yet
ML Practical 205160694034
33 pages
DNN ALL Practical 28
No ratings yet
DNN ALL Practical 28
34 pages
AI Manual
No ratings yet
AI Manual
69 pages
Numpy Module
No ratings yet
Numpy Module
10 pages
ML Lab Manual
No ratings yet
ML Lab Manual
90 pages
ML File Syllabus
No ratings yet
ML File Syllabus
43 pages
Artificial Intellegence Lab Practical
No ratings yet
Artificial Intellegence Lab Practical
48 pages
20AI16 - ML Record
No ratings yet
20AI16 - ML Record
24 pages
Skill
No ratings yet
Skill
42 pages
ML Record
No ratings yet
ML Record
19 pages
Machine Learning Engineer Interview Preparation Guide
No ratings yet
Machine Learning Engineer Interview Preparation Guide
14 pages
UNIT2
No ratings yet
UNIT2
20 pages
HW46
No ratings yet
HW46
5 pages
Tushar ML
No ratings yet
Tushar ML
52 pages
IML Lab Manual
No ratings yet
IML Lab Manual
31 pages
Dhaapps Datascience With Gen AI-1
No ratings yet
Dhaapps Datascience With Gen AI-1
23 pages
Module 5.pptx - 20250608 - 201231 - 0000
No ratings yet
Module 5.pptx - 20250608 - 201231 - 0000
43 pages
Assignment 2
No ratings yet
Assignment 2
3 pages
ML Notion 1
No ratings yet
ML Notion 1
18 pages
DM Practical File
No ratings yet
DM Practical File
21 pages
ML Record Print
No ratings yet
ML Record Print
20 pages
Machine L-Lab-Manual
No ratings yet
Machine L-Lab-Manual
90 pages
ML File
No ratings yet
ML File
17 pages
AI Manual
No ratings yet
AI Manual
36 pages
Data Mining & Machine Learning Courseoutline
No ratings yet
Data Mining & Machine Learning Courseoutline
7 pages
ML LAB
No ratings yet
ML LAB
29 pages
Foundations of Python for AI
No ratings yet
Foundations of Python for AI
67 pages
Int 10
No ratings yet
Int 10
21 pages
ML Viva Practice (Answers)
No ratings yet
ML Viva Practice (Answers)
4 pages
Capstone Project - Jaro-Prof. Babji
No ratings yet
Capstone Project - Jaro-Prof. Babji
5 pages
Silver Oak College of Computer Application: Subject:Machine Learning
No ratings yet
Silver Oak College of Computer Application: Subject:Machine Learning
15 pages
To Study About Numpy, Pandas and Matplotlib Libraries in Python
No ratings yet
To Study About Numpy, Pandas and Matplotlib Libraries in Python
21 pages
It, Hardware Exp1
No ratings yet
It, Hardware Exp1
10 pages
Machine Learning Lab
No ratings yet
Machine Learning Lab
33 pages
Project - Machine Learning-Business Report: By: K Ravi Kumar PGP-Data Science and Business Analytics (PGPDSBA.O.MAR23.A)
No ratings yet
Project - Machine Learning-Business Report: By: K Ravi Kumar PGP-Data Science and Business Analytics (PGPDSBA.O.MAR23.A)
38 pages
Data Science Lab Exp Lis
No ratings yet
Data Science Lab Exp Lis
72 pages
ML 4 To 9 Keyur
No ratings yet
ML 4 To 9 Keyur
21 pages
Syl3 ML
No ratings yet
Syl3 ML
5 pages
Shobit Sharma (2124399) ML Lab File PDF
No ratings yet
Shobit Sharma (2124399) ML Lab File PDF
19 pages
AI&ML Lab Report
No ratings yet
AI&ML Lab Report
19 pages
ML Lab
No ratings yet
ML Lab
23 pages
Gradient Ascent
No ratings yet
Gradient Ascent
27 pages
24CSPC212-PIC Lab Manual
No ratings yet
24CSPC212-PIC Lab Manual
45 pages
ML Que
No ratings yet
ML Que
14 pages
Lab Module 1 - End To End ML Project
No ratings yet
Lab Module 1 - End To End ML Project
2 pages
Efficient Python Tricks and Tools For Data Scientists - by Khuyen Tran
No ratings yet
Efficient Python Tricks and Tools For Data Scientists - by Khuyen Tran
20 pages
Approaching (Almost) Any Machine Learning Problem - Abhishek Thakur - No Free Hunch
No ratings yet
Approaching (Almost) Any Machine Learning Problem - Abhishek Thakur - No Free Hunch
22 pages
Lab Manual-MLT
No ratings yet
Lab Manual-MLT
42 pages
Original ML Lab Manual
No ratings yet
Original ML Lab Manual
22 pages
178 DL
No ratings yet
178 DL
31 pages
AAIC Syllabus
No ratings yet
AAIC Syllabus
19 pages
Practical File of AI and ML
No ratings yet
Practical File of AI and ML
26 pages
List of Imported Libraries
No ratings yet
List of Imported Libraries
12 pages
B. Com. Hons Cbcs Syllabus
No ratings yet
B. Com. Hons Cbcs Syllabus
64 pages
P01 Arima
No ratings yet
P01 Arima
68 pages
RLB Contoh
No ratings yet
RLB Contoh
13 pages
PGCBA 6 Brochure
No ratings yet
PGCBA 6 Brochure
12 pages
Ebooks File Sex Differences in Labor Markets Routledge Research in Gender and Society 10 1st Edition David Neumark All Chapters
100% (10)
Ebooks File Sex Differences in Labor Markets Routledge Research in Gender and Society 10 1st Edition David Neumark All Chapters
75 pages
Charleston County Value Added 2014 15
No ratings yet
Charleston County Value Added 2014 15
45 pages
Logistics Management - Chapter 5 PPT NFJnK1J2IS
No ratings yet
Logistics Management - Chapter 5 PPT NFJnK1J2IS
50 pages
An Approach of Pig Weight Estimation Using Binocular Stereo System Based On LabVIEW
No ratings yet
An Approach of Pig Weight Estimation Using Binocular Stereo System Based On LabVIEW
7 pages
Multivariate Analysis
No ratings yet
Multivariate Analysis
7 pages
Econometrics I CH 3 MLR
No ratings yet
Econometrics I CH 3 MLR
30 pages
Chapter 3: Quantitative Demand Analysis Answers To Questions and Problems
No ratings yet
Chapter 3: Quantitative Demand Analysis Answers To Questions and Problems
14 pages
Zhao Liden 2011 JAPinternshiprecruitmentimpressionmanagement
No ratings yet
Zhao Liden 2011 JAPinternshiprecruitmentimpressionmanagement
10 pages
Jaya Sakthi Engineering College: (Approved by AICTE, New Delhi Affiliated To Anna University, Chennai.)
No ratings yet
Jaya Sakthi Engineering College: (Approved by AICTE, New Delhi Affiliated To Anna University, Chennai.)
194 pages
Different Regression Problems Stasticstiacl
No ratings yet
Different Regression Problems Stasticstiacl
11 pages
Book 2
No ratings yet
Book 2
15 pages
CH 01 PDF
No ratings yet
CH 01 PDF
28 pages
The Coefficient of Determination R-Squared Is More Informative Than SMAPE, MAE, MAPE, MSE, and RMSE in Regression Analysis Evaluation
No ratings yet
The Coefficient of Determination R-Squared Is More Informative Than SMAPE, MAE, MAPE, MSE, and RMSE in Regression Analysis Evaluation
28 pages
Logit and Probit Models
50% (2)
Logit and Probit Models
11 pages
Journal Reading II - Dr. Yuda Lutfiadi
No ratings yet
Journal Reading II - Dr. Yuda Lutfiadi
13 pages
Sample of Thesis About Student Satisfaction
100% (3)
Sample of Thesis About Student Satisfaction
5 pages
Thesis Outline - Impact of Capital Structure On Firm Value
No ratings yet
Thesis Outline - Impact of Capital Structure On Firm Value
6 pages
Josh Rombach Case 2
No ratings yet
Josh Rombach Case 2
5 pages
New Rap Mist Paper
No ratings yet
New Rap Mist Paper
28 pages
Estimation and Inference of Heterogeneous Treatment Effects Using Random Forests
No ratings yet
Estimation and Inference of Heterogeneous Treatment Effects Using Random Forests
16 pages
Bus Stop-Environmental Connection: Do Characteristics of The Built Environment Correlate With Bus Stop Crime?
No ratings yet
Bus Stop-Environmental Connection: Do Characteristics of The Built Environment Correlate With Bus Stop Crime?
13 pages
9 AIML Question Bank Updated 5 Units
No ratings yet
9 AIML Question Bank Updated 5 Units
21 pages
Multivariate Data Analysis in Pharmaceutics-A Tutorial Review PDF
No ratings yet
Multivariate Data Analysis in Pharmaceutics-A Tutorial Review PDF
11 pages
Regression Analysis and Linear Models Concepts Applications and Implementation 1st Edition Richard B. Darlington PHD Download
No ratings yet
Regression Analysis and Linear Models Concepts Applications and Implementation 1st Edition Richard B. Darlington PHD Download
52 pages
The Influence of Accounting Software in Achieving The Internationalaccounting Standard Boards Qualitative Characteristics of Financial Information
No ratings yet
The Influence of Accounting Software in Achieving The Internationalaccounting Standard Boards Qualitative Characteristics of Financial Information
13 pages
Group 4 CHM 812 Assgn.
No ratings yet
Group 4 CHM 812 Assgn.
7 pages

Programming Questions

Uploaded by

Programming Questions

Uploaded by

For the Amazon ML Summer School assessment, the programming questions will likely focus on

Example Programming Questions

Linear Regression Implementation

Input: A dataset with input features and target values.

Output: Predicted target values for a test dataset.

def linear_regression(X, y):

# Adding a column of ones for the intercept term

X = np.hstack((np.ones((X.shape[0], 1)), X))

# Calculating weights using the Normal Equation

weights = np.linalg.inv(X.T @ X) @ X.T @ y

def predict(X, weights):

X = np.hstack((np.ones((X.shape[0], 1)), X))

X_train = np.array([[1, 2], [2, 3], [3, 4]])

y_train = np.array([2, 3, 4])

weights = linear_regression(X_train, y_train)

X_test = np.array([[4, 5], [5, 6]])

Decision Tree Classifier

Task: Implement a decision tree classifier for a given dataset.

Input: A dataset with features and labels.

Output: Predictions for a test dataset.

from sklearn.tree import DecisionTreeClassifier

def decision_tree_classifier(X_train, y_train, X_test):

X_train = [[0, 0], [1, 1], [0, 1], [1, 0]]

X_test = [[0, 0], [1, 1]]

predictions = decision_tree_classifier(X_train, y_train, X_test)

Task: Implement the K-means clustering algorithm.

Input: A dataset and the number of clusters (K).

Output: Cluster assignments for each data point.

def kmeans(X, k, max_iters=100):

centroids = X[np.random.choice(X.shape[0], k, replace=False)]

clusters = [np.argmin([np.linalg.norm(x - centroid) for centroid in centroids]) for x in X]

new_centroids = [X[np.array(clusters) == i].mean(axis=0) for i in range(k)]

Principal Component Analysis (PCA)

Task: Implement PCA for dimensionality reduction.

Input: A dataset and the number of principal components.

Output: Transformed dataset with reduced dimensions.

def pca(X, n_components):

X_mean = np.mean(X, axis=0)

covariance_matrix = np.cov(X_centered, rowvar=False)

eigenvalues, eigenvectors = np.linalg.eigh(covariance_matrix)

sorted_eigenvectors = eigenvectors[:, sorted_indices]

principal_components = sorted_eigenvectors[:, :n_components]

return X_centered @ principal_components

X = np.array([[1, 2], [3, 4], [5, 6], [7, 8]])

Text Preprocessing for NLP

Task: Preprocess text data for further analysis.

Input: A list of text documents.

Output: Cleaned and tokenized text.

from nltk.tokenize import word_tokenize

from nltk.corpus import stopwords

from nltk.stem import PorterStemmer

for text in texts:

text = re.sub(r'\W', ' ', text)

text = re.sub(r'\s+', ' ', text)

tokens = [ps.stem(word) for word in tokens if word not in stop_words]

texts = ["Hello, world! This is a test.", "Preprocessing text is important."]

You might also like