Problem Set 9

Problem Set 9

This is to be completed by December 21st, 2017.

Exercises

1. Datacamp
• Complete the lesson:
a. Intermediate R: Practice
2. R Lab:
• Consider a two class classification problem with one class denoted positive.
• Given a list of probability predictions for the positive class, a list of the correct probabilities (0’s and 1’s), and a number N>=2 of data points, construct a function which produces an Nx2 matrix/dataframe whose ith row (starting at 1) is the pair (x,y) where x is the false positive rate and y is the true positive rate of a classifier which classifies to true if the probability is greater or equal to (i-1)/(N-1).
• Construct another function which produces the line graph associated to the points from the previous function .
• Finally, produce another function which estimates the area under the curve of the previous graph.