HW 1 in 2015

NYU NYU Nahuatl Nikki John Hugo hjiitdd minutes huuutf

Uploaded by

neethuvijay10

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

15 views

HW 1 in 2015

NYU NYU Nahuatl Nikki John Hugo hjiitdd minutes huuutf

Uploaded by

neethuvijay10

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 3

MACHINE LEARNING COMS 4771, HOMEWORK 1

Assigned September 15, 2015. Due October 1, 2015 before 10:00am.

Submit your work via courseworks.columbia.edu.
Please submit separate files for a) write-up, b) Matlab source files and c) figures (if you choose to
include them separately from the writeup). Do not include any other files. Your write-up should
be in ASCII plain text format (.txt) or Postscript (.ps) or the Adobe Portable Document Format
(.pdf). Please do not submit Microsoft Office documents, LaTeX source code, or something more
exotic since we will not be able to read it. LaTeX is preferred and highly recommended, but it is not
mandatory. You can use any document editing software you wish, as long as the final product is in
.ps or .pdf. Even if you do not use LaTeX to prepare your document, you can use LaTeX notation
to mark up complicated mathematical expressions, for example, in comments in your Matlab code.
See the Tutorials page for more information on LaTeX. All your code should be written in Matlab.
Please submit all your source files, each function in a separate file. Clearly denote what each function
does, which problem it belongs to, and what the inputs and the outputs are. Do not resubmit code or
data provided to you. Do not submit code written by others. Identical submissions will be detected
and both parties will get zero credit. In general, shorter code is better. Sample code is available
on the Tutorials web page. Datasets are available from the Handouts web page. You may include
figures directly in your write-up or include them separately as .jpg, .gif, .ps or .eps files, and refer
to them by filename.

1 Problem 1 (10 points)

Cross-validation for Polynomial Fitting: Consider the problem of fitting a polynomial function.
Assume we wish to find a one dimensional function that takes a scalar input and outputs a scalar
f : R → R. The function has the form

f (x; θ) = θ0 + θ1 x + θ2 x2 + . . . + θd xd

where d is the degree of the polynomial. Develop code that finds the θ which minimizes the risk
N
1 X1
Remp (θ) = (yi − f (x; θ))2
N i=1 2

on a data-set. To help you get started, download the Matlab code in “polyreg.m” (on the tutorial
web page) to do polynomial curve fitting. Use your code on the dataset “problem1.mat”. This should
include a matrix x, corresponding to the scalar features {x1 , . . . , xN }, and a matrix y, corresponding
to the scalar labels {y1 , . . . , yN }. Fit a polynomial model to this data for various choices for d, the
degree of the polynomial.
Which value(s) of d seems somewhat more reasonable? Please justify your answer using some
empirical measure.
It is easy to overfit the data when using polynomial regression. As a result, use cross-validation
by randomly splitting the data-set into two halves to select the complexity of the model (in this
case, the degree of the polynomial). Include a plot showing the training and testing risk across
various choices of d, and plot your f (x; θ) overlaid on the data for the best choice of d according to
cross-validation.
2 Problem 2 (10 points)
Regularized risk minimization: Modify the Matlab code for “polyreg.m” such that it learns a multi-
variate regression function f : R100 → R, where the basis functions are of the form
k
X
f (x; θ) = θ i xi
i=1

The data-set is available in “problem2.mat”. As before, the x variable contains {x1 , . . . , xN } and the
y variable contains their scalar labels {y1 , . . . , yN }.
Use an l2 loss function to penalize the complexity of the model, e.g. minimize the risk
N
1 X1 λ
Rreg (θ) = (yi − f (x; θ))2 + kθk2
N i=1 2 2N

Use two-fold cross validation (as in Problem 1) to find the best value for λ. Include a plot showing
training and testing risk across various choices of λ. A reasonable range for this data set would be
from λ = 0 to λ = 1000. Also, mark the λ which minimizes the testing error on the data set.
What do you notice about the training and testing error?

3 Problem 3 (10 points)

Logistic Squashing Function. The logistic squashing function is given by g(z) = 1/(1 + exp(−z)).
Show that it satisfies the property g(−z) = 1 − g(z). Also show that its inverse is given by g −1 (y) =
ln(y/(1 − y).

4 Problem 4 (20 points)

Logistic Regression: Implement a linear logistic regression algorithm for binary classification in
Matlab using gradient descent. Your code should accept a dataset {(x1 , y1 ), . . . , (xN , yN )} where
xi ∈ Rd and yi ∈ {0, 1} and find a parameter vector θ ∈ Rd for the classification function
−1
f (x; θ) = 1 + exp(−θ > x)

which minimizes the empirical risk with logistic loss

N
1 X
Remp (θ) = (yi − 1) log(1 − f (xi ; θ)) − yi log(f (xi ; θ)).
N i=1

Since you are using gradient descent, you will have to specify the step size η and the tolerance .
Pick reasonable values for η and to then use your code to learn a classification function for the
dataset in “dataset4.mat”. Type “load dataset4” and you will have the variables X (input vectors)
and Y (binary labels) in your Matlab environment which contain the dataset.
Show any derivations you need to make for this algorithm.
Use the whole data set as training. Show with figures the resulting linear decision boundary on the
2D X data. Show the binary classification error and the empirical risk you obtained throughout
the run from random initialization until convergence. Note the number of iterations needed for your
choice of η and .

IQ, OQ & PQ. (Rev-1) - USP - Renata
100% (4)
IQ, OQ & PQ. (Rev-1) - USP - Renata
15 pages
Matlab3 HW 220718
0% (1)
Matlab3 HW 220718
2 pages
CIS 419/519 Introduction To Machine Learning Assignment 2: Instructions
No ratings yet
CIS 419/519 Introduction To Machine Learning Assignment 2: Instructions
12 pages
hw01s
No ratings yet
hw01s
10 pages
HW 1
No ratings yet
HW 1
3 pages
ML Coursera Python Assignments
100% (1)
ML Coursera Python Assignments
20 pages
Department of Electrical Engineering School of Science and Engineering EE514/CS535 Machine Learning Homework 1
No ratings yet
Department of Electrical Engineering School of Science and Engineering EE514/CS535 Machine Learning Homework 1
11 pages
Sample Exam For ML YSZ: Question 1 (Linear Regression)
No ratings yet
Sample Exam For ML YSZ: Question 1 (Linear Regression)
4 pages
Matlab Homework Experts 2
No ratings yet
Matlab Homework Experts 2
10 pages
Assignment #3_handout
No ratings yet
Assignment #3_handout
3 pages
Homework 1
No ratings yet
Homework 1
8 pages
Assignment 1
No ratings yet
Assignment 1
3 pages
Sample Exam For ML YSZ Sample For Machine Lerning - CMNKNVMNCS."NMD, MN, MVN, MDNV, MNDV MC, MDN, MDCNVM, NDV, M Ccwdmnbnbew, Mwbe
No ratings yet
Sample Exam For ML YSZ Sample For Machine Lerning - CMNKNVMNCS."NMD, MN, MVN, MDNV, MNDV MC, MDN, MDCNVM, NDV, M Ccwdmnbnbew, Mwbe
4 pages
hw1
No ratings yet
hw1
12 pages
HW 1
No ratings yet
HW 1
4 pages
Question 1 (Linear Regression)
No ratings yet
Question 1 (Linear Regression)
18 pages
ELEN4903 hw1 Spring2018
No ratings yet
ELEN4903 hw1 Spring2018
2 pages
HW 23 P 4 Rie
No ratings yet
HW 23 P 4 Rie
5 pages
Machine Learning Assignments
No ratings yet
Machine Learning Assignments
3 pages
Homework 4
No ratings yet
Homework 4
3 pages
assgmt1
No ratings yet
assgmt1
7 pages
Assignment 1
No ratings yet
Assignment 1
3 pages
AML Assignment 1 (1)
No ratings yet
AML Assignment 1 (1)
3 pages
HW 2
No ratings yet
HW 2
5 pages
244 Cheat Sheet
No ratings yet
244 Cheat Sheet
4 pages
exercise01
No ratings yet
exercise01
3 pages
hw1_2025
No ratings yet
hw1_2025
2 pages
Tutorial_Bayesian Regression and Classifier
No ratings yet
Tutorial_Bayesian Regression and Classifier
5 pages
ML File - Merged
No ratings yet
ML File - Merged
24 pages
Qs ML
No ratings yet
Qs ML
8 pages
Assign 1
No ratings yet
Assign 1
5 pages
Lab Manual 05
No ratings yet
Lab Manual 05
13 pages
Discussion & Conclusion Group4
No ratings yet
Discussion & Conclusion Group4
1 page
Group6 - Laboratory Activity No.6
No ratings yet
Group6 - Laboratory Activity No.6
11 pages
MLDS_A1_Spring2025
No ratings yet
MLDS_A1_Spring2025
3 pages
Introduction To Matlab Tutorial 11
No ratings yet
Introduction To Matlab Tutorial 11
37 pages
178 hw3
No ratings yet
178 hw3
3 pages
CPSC 540 Assignment 1 (Due January 19)
100% (1)
CPSC 540 Assignment 1 (Due January 19)
9 pages
indexamc_merged
No ratings yet
indexamc_merged
16 pages
Assignment 1
No ratings yet
Assignment 1
2 pages
hw3
No ratings yet
hw3
7 pages
Matlab Prject
No ratings yet
Matlab Prject
10 pages
HW 4
No ratings yet
HW 4
6 pages
ECE 3040 Lecture 6: Programming Examples: © Prof. Mohamad Hassoun
No ratings yet
ECE 3040 Lecture 6: Programming Examples: © Prof. Mohamad Hassoun
17 pages
18-660: Numerical Methods For Engineering Design and Optimization
No ratings yet
18-660: Numerical Methods For Engineering Design and Optimization
27 pages
Solutions Manual Scientific Computing
0% (1)
Solutions Manual Scientific Computing
192 pages
S Ccs Answers
No ratings yet
S Ccs Answers
192 pages
CS4100 CS5100 CW1 20241001
No ratings yet
CS4100 CS5100 CW1 20241001
10 pages
CMP 1
No ratings yet
CMP 1
31 pages
A2 Linear Models From Scratch
No ratings yet
A2 Linear Models From Scratch
2 pages
Machine Learning Homework
No ratings yet
Machine Learning Homework
8 pages
Homework-2
No ratings yet
Homework-2
8 pages
Lecture 3 - Linear Regression
No ratings yet
Lecture 3 - Linear Regression
31 pages
DM Practice
No ratings yet
DM Practice
15 pages
Assignment 1 (1)
No ratings yet
Assignment 1 (1)
4 pages
R Programming Exam: Instructions
No ratings yet
R Programming Exam: Instructions
2 pages
Linear Regression
No ratings yet
Linear Regression
31 pages
Machine Learning Homework1 Solutions
No ratings yet
Machine Learning Homework1 Solutions
16 pages
Mathematical Optimization: Fundamentals and Applications
From Everand
Mathematical Optimization: Fundamentals and Applications
Fouad Sabry
No ratings yet
A-level Maths Revision: Cheeky Revision Shortcuts
From Everand
A-level Maths Revision: Cheeky Revision Shortcuts
Scool Revision
3.5/5 (8)
A Short Course in Discrete Mathematics
From Everand
A Short Course in Discrete Mathematics
Edward A. Bender
3/5 (1)
YEAR_12_EXAM_QUESTIONS_REVISION__1_
No ratings yet
YEAR_12_EXAM_QUESTIONS_REVISION__1_
53 pages
Learn Revit Parameters
No ratings yet
Learn Revit Parameters
16 pages
MACH1000 2-28 Ports Gigabit Ethernet PoE Switch
No ratings yet
MACH1000 2-28 Ports Gigabit Ethernet PoE Switch
3 pages
Fsi Divergence
No ratings yet
Fsi Divergence
10 pages
Molecular Graphics
No ratings yet
Molecular Graphics
13 pages
MagNet - Tutorials
No ratings yet
MagNet - Tutorials
241 pages
18 Dec 2021 - Database (SQL)
No ratings yet
18 Dec 2021 - Database (SQL)
58 pages
PrivyID API Documentation v1.9
No ratings yet
PrivyID API Documentation v1.9
31 pages
Projectile - Revision Sheet
No ratings yet
Projectile - Revision Sheet
3 pages
Class 10th Linear Equation
No ratings yet
Class 10th Linear Equation
3 pages
ملخص خصائص مكمنية
No ratings yet
ملخص خصائص مكمنية
5 pages
Drill Speeds and Feeds2
No ratings yet
Drill Speeds and Feeds2
13 pages
Maths Class 7 Question Bank
No ratings yet
Maths Class 7 Question Bank
117 pages
04 - Drop Call Presentation
No ratings yet
04 - Drop Call Presentation
32 pages
en Magneto Resistive Proximity Sensor SMTO
No ratings yet
en Magneto Resistive Proximity Sensor SMTO
2 pages
Gage R&R
No ratings yet
Gage R&R
23 pages
Measuring Volume
No ratings yet
Measuring Volume
8 pages
Lecturenote - 802493092HW I-Chap-4 - Handout
No ratings yet
Lecturenote - 802493092HW I-Chap-4 - Handout
34 pages
Minutes Parents Forum.
No ratings yet
Minutes Parents Forum.
16 pages
Electrical Energy Cambridge (CIE) IGCSE Physics Revision Notes 2021 3
No ratings yet
Electrical Energy Cambridge (CIE) IGCSE Physics Revision Notes 2021 3
1 page
Diffusion Phenomena in IN THIN FILMS AND MICROELECTRONIC MATERIALS
No ratings yet
Diffusion Phenomena in IN THIN FILMS AND MICROELECTRONIC MATERIALS
9 pages
Linear Algebra and Random Processes (CS6015)
No ratings yet
Linear Algebra and Random Processes (CS6015)
5 pages
Solving Partial Differential Equations: PIN Number: 239
No ratings yet
Solving Partial Differential Equations: PIN Number: 239
7 pages
Propositional Logic
No ratings yet
Propositional Logic
41 pages
Method Statement Sonic Integrity Testing
No ratings yet
Method Statement Sonic Integrity Testing
25 pages
l28-32h Project Guide
100% (1)
l28-32h Project Guide
286 pages
How To Mount Software RAID1 Member Using Mdadm
No ratings yet
How To Mount Software RAID1 Member Using Mdadm
4 pages
Filters
100% (2)
Filters
7 pages
LC377
No ratings yet
LC377
66 pages