(CS/CNS/EE 155) Machine Learning & Data Mining

2018/2019 Winter Term (previous year)

Course Description

Prerequisite: background in algorithms, linear algebra, calculus, probability, and statistics (CS/CNS/EE/NB 154 or CS/CNS/EE 156a or instructor’s permission)

This course will cover popular methods in machine learning and data mining, with an emphasis on developing a working understanding of how to apply these methods in practice. This course will also cover core foundational concepts underpinning and motivating modern machine learning and data mining approaches. This course will also cover some recent research developments.

Course Details

Lectures on Tu/Th at 2:30pm-4pm in Beckman Institute Auditorium
Recitations on Th at 7pm (usually lasting 1 hour), in Annenberg 105
We will be using Moodle & Gradescope for managing homeworks and grades [Moodle link][Gradescope link]
We will be using Piazza for discussion forums and announcements [link]
Lectures will be video recorded [link (no Lecture 1, sorry)][last year's]
6 Homeworks (worth approximately 60% of final grade)
3 Miniprojects (worth approximately 30% of final grade)
Final Exam (worth approximately 10% of final grade)

Late Homework Policy

Assignments will be due at 9pm on Wednesday via Moodle. Students are allowed to use up to 48 late hours. Late hours must be used in units of hours. Specify the number of hours used when turning in the assignment. Late hours cannot be used on the final exam. There will be no TA support over the weekends.

Collaboration Policy

Detailed policy available here
TLDR;

Homeworks: Can discuss in groups, but write solutions on your own (e.g., 50 foot rule). Can consult online resources such as Wikipedia.
Miniprojects: Students are allowed to collaborate fully within their miniproject teams, but no collaboration is allowed between teams.
Final Exam: No collaboration of any kind is allowed. Can consult online resources such as Wikipedia.

Instructor

Yisong Yue yyue@caltech.edu

Teaching Assistants

Natalie Bernat	nbernat@caltech.edu
Nishanth Bhaskara	nbhaskar@caltech.edu
Julia Deacon	jcdeacon@caltech.edu
Marcus Dominguez-Kuhne	mddoming@caltech.edu
Rupesh Jeyaram	rjeyaram@caltech.edu
Cherie Jia	cjia@caltech.edu
Karthik Karnik	kkarnik@caltech.edu
Meera Krishnamoorthy	mkrishna@caltech.edu
Karthik Nair	knair@caltech.edu
Kapil Sinha	ksinha@caltech.edu
Vaishnavi Shrivastava	vshrivas@caltech.edu
Albet Tseng	atseng@caltech.edu
Michelle Zhao	mzhao@caltech.edu

Office Hours

Additional Textbooks and Resources

Machine Learning: a Probabilistic Perspective, by Kevin Murphy

Convex Optimization: Algorithms and Complexity (Free Version), by Sebastien Bubeck

Deep Learning, by Ian Goodfellow, Yoshua Bengio, and Aaron Courville

A Course in Machine Learning, by Hal Daume III

Matrix Cookbook

Probability Review

Maximum Entropy and Logistic Regression

A Beginner's Guide to Recurrent Networks and LSTMs [link]

Stochastic Gradient Descent Tricks [link]

Practical Bayesian Optimization for Efficient Grid Search of Tuning Parameters. [paper][software]

Overview of Topic Models. [paper]

Tutorial on Learning Reductions. [pdf]

Learning Reductions Overview. [paper]

Assignments

Homework LaTeX template (could be useful for writing homework solutions) [zip]
Homework 1, due 9pm on January 16th. [pdf][code]
Homework 2, due 9pm on January 23th. [files]
Homework 3, due 9pm on January 30th. [files]
Homework 4, due 9pm on February 6th. [files]
Miniproject 1, Kaggle submissions due 2pm on February 12th, report due 9pm on February 14th. [files]
Homework 5, due 9pm on February 20th. [files]
Miniproject 2, visualizations due 9pm on February 27nd, report due 9pm on February 28th. [pdf][files]
Homework 6, due 9pm on March 6th. [files]
Miniproject 3, submission due 9pm on March 13th, report due 9pm on March 14th. [files]

Lectures & Recitation Schedule

Note: schedule is subject to change.

				Further Reading:
1/08/2019	Lecture:	Administrivia, Basics, Bias/Variance, Overfitting	[slides]
1/10/2019	Lecture:	Perceptron, Gradient Descent	[slides]	Daume Chapter 3 Mistake Bounds for Perceptron [link] AdaGrad [link] Stochastic Gradient Descent Tricks [link] Bubeck Chaper 3
1/10/2019	Recitation:	Introduction to Python for Machine Learning
1/15/2019	Lecture:	SVMs, Logistic Regression, Neural Nets, Loss Functions, Evaluation Metrics	[slides]	Bounds on Error Expectation for SVMs [link]
1/17/2019	Lecture:	Regularization, Lasso	[slides]	Murphy 13.3
1/17/2019	Recitation:	Linear Algebra		The Matrix Cookbook [link]
1/22/2019	Lecture:	Decision Trees, Bagging, Random Forests	[slides]	Overview of Decision Trees [pdf] Overview of Bagging [pdf] Overview of Random Forests [pdf]
1/24/2019	Lecture:	Boosting, Ensemble Selection	[slides]	Schapire's Overview of Boosting [pdf] Papers on Ensemble Selection. [paper1][paper2]
1/29/2019	Lecture:	Deep Learning	[slides]	Deep Learning Book [html]
1/31/2019	Lecture:	Deep Learning Part 2	[slides]
1/31/2019	Recitation:	Keras Tutorial		[link]
2/5/2019	Lecture:	Unsupervised Learning, Clustering, Dimensionality Reduction	[slides]
2/7/2019	Lecture:	Latent Factor Models, Non-Negative Matrix Factorization	[slides]	Original Netflix Paper [link]
2/12/2019	Lecture:	Embeddings	[slides]	Locally Linear Embedding [link] Playlist Embedding [link] word2vec [link]
2/14/2019		NO LECTURE
2/19/2019	Lecture:	Recent Applications	[slides]
2/21/2019	Lecture:	Probabilistic Models, Naive Bayes	[slides]	Murphy 3.5
2/21/2019	Recitation:	Probability & Sampling
2/26/2019	Lecture:	Hidden Markov Models	[slides][notes]	Murphy 17.3--17.5
2/28/2019	Lecture:	Hidden Markov Models Part 2	(same material as above)
2/28/2019	Recitation:	Dynamic Programming
3/5/2019	Lecture:	Recent Applications: Deep Generative Models	[slides]
3/7/2019	Lecture:	Survey of Advanced Topics	[slides]
3/12/2019	Lecture:	Review & Q/A