Syllabus
-
Markov Decision Processes
- Introduction
- Sensor Networks
- Supply Chain Management
- Energy Efficiency
- Reading Assignment (Policy Functions)
- Homework Assignment (Bellman Equation)
- Markov Decision Processes
- The Bellman Equations
-
Dynamic Programming
- Route Planning
- Options Pricing
- Scheduling
- Operating Systems
- Reading Assignment (History of DP)
- Homework Assignment (Value Iteration)
- Dynamic Programming
-
Monte Carlo Methods
- Medical Diagnosis
- Network Routing Optimization
- Physics Research
- Reading Assignment (Exploration vs Exploitation)
- Homework Assignment (Greedy Policies)
- MC Prediction and MC Control
-
Model Free Learning
- Delivery Management
- Automated Trading
- Backgammon
- Dopamine in Neuroscience
- Reading Assignment (SARSA)
- Homework Assignment (Q Learning)
- Temporal Difference Learning
-
RL in Continuous Spaces
- Self Driving Cars
- Delivery Drones
- Rescue Robots
- Assembly Robots
- Reading Assignment (Control Theory)
- Midterm Assignment (Make a Bipedal Robot Walk )
- Continuous Space Techniques
-
Deep Reinforcement Learning
- Traffic Optimization
- Gaming
- Meta Learning
- Homework Assignment (Deep Q Learning)
- Reading Assignment (DQN Improvements)
- The Evolution of Deep Q Learning
-
Policy Based Methods
- Web System Configuration
- Text Summarization
- AI Assisted Design
- Portfolio Optimization
- Reading Assignment (Evolutionary Algorithms)
- Homework Assignment (REINFORCE)
- Stochastic Policy Search
-
Policy Gradient Methods
- Dialogue Systems
- Photo Editing
- Language Translation
- Tutoring Systems
- Reading Assignment (Evolved Policy Gradients)
- Homework Assignment (TRPO)
- Generalized Advatange Estimation
-
Actor Critic Methods
- Advanced Trading Techniques
- Human-Machine Cooperation
- Insurance Cost Analysis
- Reading Assignment (Actor Critic Algorithms)
- Homework Assignment (Bayesian Actor Critic)
- Asynchronous Advantage Actor Critic
-
Multi Agent RL
- Move 37
- Transporation Networks
- Decentralized Autonomous Organizations
- The Future of AI
- Reading Assignment (Cooperative Agents)
- Inverse Reinforcement Learning
- Final Project (Multi Agent Research Project)