Reinforcement Learning

Reinforcement Learning

МООК
en
Английский
This content is rated 0 out of 5
Source
  • From www.udacity.com
Conditions
  • Self-paced
  • Free Access
More info
  • 16 Sequences
  • Introductive Level

Their employees are learning daily with Edflex

  • Safran
  • Air France
  • TotalEnergies
  • Generali
Learn more

Course details

Syllabus

* Reinforcement Learning Basics* Introduction to BURLAP* TD Lambda* Convergence of Value and Policy Iteration* Reward Shaping* Exploration* Generalization* Partially Observable MDPs* Options* Topics in Game Theory* Further Topics in RL Models

Prerequisite

None.

Instructors

  • Charles Isbell - Charles Isbell is a Professor and Senior Associate Dean at the School of Interactive Computing at Georgia Tech. His research passion is artificial intelligence, particularly on building autonomous agents that must live and interact with large numbers of other intelligent agents, some of whom may be human. Lately, he has turned his energies toward adaptive modeling, especially activity discovery (as distinct from activity recognition), scalable coordination, and development environments that support the rapid prototyping of adaptive agents. He is developing adaptive programming languages, and trying to understand what it means to bring machine learning tools to non-expert authors, designers and developers. He sometimes interacts with the physical world through racquetball, weight-lifting and Ultimate Frisbee.
  • Michael Littman - Michael Littman is a Professor of Computer Science at Brown University. He also teaches Udacity’s Algorithms course (CS215) on crunching social networks. Prior to joining Brown in 2012, he led the Rutgers Laboratory for Real-Life Reinforcement Learning (RL3) at Rutgers, where he served as the Computer Science Department Chair from 2009-2012. He is a Fellow of the Association for the Advancement of Artificial Intelligence (AAAI), served as program chair for AAAI's 2013 conference and the International Conference on Machine Learning in 2009, and received university-level teaching awards at both Duke and Rutgers. Charles Isbell taught him about racquetball, weight-lifting and Ultimate Frisbee, but he's not that great at any of them. He's pretty good at singing and juggling, though.

Editor

Le Georgia Institute of Technology, connu aussi sous le nom de Georgia Tech ou GT, est une université de recherche mixte publique, et située à Atlanta (Géorgie), aux États-Unis. Elle fait partie du réseau plus large du Système universitaire de Géorgie (en anglais, University System of Georgia). Georgia Tech possède des antennes à Savannah (Géorgie, États-Unis), Metz (France), Athlone (Irlande), Shanghai (Chine), et Singapour.

Georgia Tech a acquis sa réputation grâce à ses programmes d'ingénierie et d'informatique, ceux-ci figurant parmi les meilleurs du monde5,6. L'offre de formation est complétée par des programmes dans les domaines des sciences, de l'architecture, des sciences humaines et du management.

Platform

Udacity est une entreprise fondé par Sebastian Thrun, David Stavens, et Mike Sokolsky offrant cours en ligne ouvert et massif.

Selon Thrun, l'origine du nom Udacity vient de la volonté de l'entreprise d'être "audacieux pour vous, l'étudiant ". Bien que Udacity se concentrait à l'origine sur une offre de cours universitaires, la plateforme se concentre désormais plus sur de formations destinés aux professionnels.

This content is rated 4.5 out of 5
(no review)
This content is rated 4.5 out of 5
(no review)
Complete this resource to write a review