CS 4803 / 7643 Deep Learning
Spring 2021, WF 2:00 - 3:15 pm, Remote
This is an exciting time to be studying (Deep) Machine Learning, or Representation Learning, or for lack of a better term, simply Deep Learning!
Deep Learning is rapidly emerging as one of the most successful and widely applicable set of techniques across a range of domains (vision, language, speech, reasoning, robotics, AI in general), leading to some pretty significant commercial success and exciting new directions that may previously have seemed out of reach.
This course will introduce students to the basics of Neural Networks (NNs) and expose them to some cutting-edge research. It is structured in modules (background, Convolutional NNs, Recurrent NNs, Deep Reinforcement Learning, Deep Structured Prediction). Modules will be presented via instructor lectures and reinforced with homeworks that teach theoretical and practical aspects. The course will also include a project which will allow students to explore an area of Deep Learning that interests them in more depth.
- Class meets
- Wednesday, Friday 2:00 - 3:15 pm; Remote
- Class link
- https://piazza.com/class/kjsselshfiz18c (Code: DL2021)
- CS4803: gatech.instructure.com/courses/172518
- CS7643: gatech.instructure.com/courses/172536
- CS4803: gradescope.com/courses/228228 (Code: 2RPWYG)
- CS7643: gradescope.com/courses/229744 (Code: 4PG38J)
|W1: Jan 15||
Intro lecture + class logistics.
PS0 out: PDF ZIP
|W2: Jan 20||
Supervised Learning, Linear Classification, Loss functions
|W2: Jan 22||
Gradient Descent, Linear Algebra View.
|W3: Jan 27||
Gradient Descent, Neural Networks.
|W3: Jan 29||
Neural Networks and Backpropagation.
|W4: Feb 3||
Neural Networks, Backpropagation, and Automatic Differentiation.
Slides (pdf) [Updated]
Gradients notes and MLP/ReLU Jacobian notes.
|W4: Feb 5||
Jacobians, AutoDiff, Activation Functions.
|W5: Feb 10||
Optimization, Regularization, Data Augmentation.
OMSCS Video: Data Wrangling (see dropbox link piazza @8 M1L4 directory).
PS/HW1 due Feb 10th 11:59pm, PS/HW2 out
|W5: Feb 12||
Optimization, Imbalance, Convolution
|W6: Feb 17||
CNNs notes, CNNs Backprop notes.
|W6: Feb 19||
Convolutional Neural Networks (CNNs)
Slides (pdf) (NOTE: Watch M2L6 in dropbox, see piazza)
|W7: Feb 24||
|W7: Feb 26||
Visualization cont. Advanced CNN architectures.
PS/HW2 due Feb. 28th, PS/HW3 out
|W8: Mar 3||
Advanced CNN architectures, Bias and Fairness Video (M2L10 on dropbox)
|W8: Mar 5||
Bias and Fairness, Introduction to Structured Neural Representations, Language Models
|W9: Mar 10||
Recurrent Neural Networks 2, Long Short-Term Memory
|W9: Mar 12||
Neural Attention Models (Transformers). Guest Lecture by Arjun Majumdar
PS/HW3 due Mar. 18th, PS/HW4 out
|W10: Mar 17||Project Planning session|
|W10: Mar 19||
Masked Language Model details, Embeddings.
Project Proposal due (March 22nd)
|W11: Mar 24||No Class (Spring Break Day)|
|W11: Mar 26||
Annotated Slides (pdf).
|W12: Mar 31||
RL: Dynamic Programming (Policy Iteration), Q-Learning, DQN.
|W12: Apr 2||
RL: Policy Gradients, REINFORCE, Actor-Critic.
PS/HW4 due Apr 8th
|W13: Apr 7||
Guest Lecture by Michael Auli (FB): Self-supervised learning for speech processing
|W13: Apr 9||
Guest Lecture by Ishan Misra (FB): Self-Supervised Learning
|W14: Apr 14||
Guest Lecture by Mahaveer Jain (FB): RNN-T Based Automatic Speech Recognition Systems
Slides (pdf), Source Slides (pptx).
|W14: Apr 16||
Generative Adversarial Networks (GANs).
|W15: Apr 21||
|W15: Apr 23||
Few-shot learning, architecture search, wrap-up.
Project due on May 3rd
- 80% Homework (4 homeworks)
- 20% Final Project
- 3% (potential bonus) Class Participation
Late policy for deliverables
- No penalties for medical reasons or emergencies. Please see GT Catalog for rules about contacting the office of the Dean of Students. DO NOT send us any private health or other information, and make sure to contact them as soon as you are able after the emergency.
- All assignments are due as mentioned on Canvas/Piazza. Every homework and project deliverable will have a 48-hour grace period (EXCEPT FOR PS0) during which no penalty will apply. This is intended for you as time to verify that your submission has been submitted (we recommend you re-download it and look it over to make sure all questions/deliverables have been answered). Canvas will show your submission as late, but you do not have to ask for this grace period. Deliverables after the grace period will receive a grade of 0.
- To re-emphasize: Any assignments submitted after grace period will receive a 0 (no exceptions).
CS 4803/7643 should not be your first exposure to machine learning. Ideally, you need:
- Intro-level Machine Learning
- CS 3600 for the undergraduate section and CS 7641/ISYE 6740/CSE 6740 or equivalent for the graduate section.
- Dynamic programming, basic data structures, complexity (NP-hardness)
- Calculus and Linear Algebra
- positive semi-definiteness, multivariate derivates (be prepared for lots and lots of gradients!)
- This is a demanding class in terms of programming skills.
- HWs will involve Python and a mix of libraries (PyTorch, NumPy).
- Your language of choice for project.
- Ability to deal with abstract mathematical concepts
Project Details (20% of course grade)
The class project is meant for students to (1) gain experience implementing deep models and (2) try Deep Learning on problems that interest them. The amount of effort should be at the level of one homework assignment per group member (2-5 people per group, 3 recommended).
What is the delivery format?
Due to Covid-19, this Spring 21 instance is “Remote Sync”, meaning that lectures are delivered over VC at the scheduled class time. All other deliverables are accepted online. The class has no in-person interaction.
We will be following these Covid-19 guidelines and procedures, with the exception of homework submission, as all homeworks for this course are submitted through Gradescope.
The class is full. Can I still get in?
Sorry. The course admins in CoC control this process. Please talk to them.
Unregistered Students who intend to register:
Whether you are registered or plan to register, if you want to take the course I expect you to attend the classes and submit any deliverables that are assigned in the first or second weeks of class.
If you are not registered for this course, you will not have access to Gradescope for submission of PS0. Please email the instructor (zkira at gatech) to be added to Gradescope and be able to submit anything due on the first day of class. Make sure to mention which course (CS 4803DL or 7643) you plan to enroll in.
Registered students who are not able to access Gradescope:
This will happen if you were registered to the course very recently. Gradescope rosters are synced periodically and it may take some time for you to receive a Gradescope sign-up notification. If you still face problems with accessing Gradescope, please email the course email (below).
I am graduating this Fall and I need this class to complete my degree requirements. What should I do?
Talk to the advisor or graduate coordinator for your academic program. They are keeping track of your degree requirements and will work with you if you need a specific course.
Can I audit this class or take it pass/fail?
No. Due to the large demand for this class, we will not be allowing audits or pass/fail. Letter grades only. This is to make sure students who want to take the class for credit can.
I have a question. What is the best way to reach the course staff?
Registered students – your first point of contact is Piazza (so that other students may benefit from your questions and our answers). You can also use private messages as well for things that might include answers or other information you do not want others to have (do not send private information such as health information though). If you have a personal matter, email the instructor (zkira at gatech)</p>
Related Classes / Online Resources
- CS231n Convolutional Neural Networks for Visual Recognition, Stanford
- Machine Learning, Oxford
- Deep Learning, New York University
- Deep Learning, CMU
- Deep Learning, University of Maryland
- Hugo Larochelle’s Neural Networks class
Note to people outside Georgia Tech
Feel free to use the slides and materials available online here. If you use our slides, an appropriate attribution is requested. Please email the instructor with any corrections or improvements.