(대학원생 마일리지 적용) 학과 세미나 안내 (7/12(금) 1시 30분, 프린스턴대 이동헌 교수 / Seminar Notice : July 12 at 13:30 pm, Dr. Donghun Lee, Princeton Univ.)
산업및시스템공학과 세미나가 다음과 같이 진행될 예정입니다.
1. 일시및장소 : 7/12(금) 오후 1시 30분, 산업경영학동 2층멀티미디어실(2501호)
2. 주제 : Max-Bias in Q-learning: Consequences and Corrections
3. 연사 : 이동헌 박사(Princeton University)
4. 언어: 한국어
김 민 경 드림
Dear ISysE Professors and Students,
ISysE dept. office invites you to attend the following semiar.
DATE & TIME: Friday, July 12 at 1:30 pm
VENUE: E2 Bld., #2501
TITLE: Max-Bias in Q-learning: Consequences and Corrections
SPEAKER : Donghun Lee, Princeton University
With kind regards,
Min Kyugn Kim
## Abstract Q-learning is a classic reinforcement learning algorithm that influenced many modern algorithms found in successful AI applications.
Its asymptotic convergence properties are remarkable, but the practitioners should use caution as Q-learning contains systematic max-bias. This talk will illuminate the consequence of naive application of Q-learning and present bias-corrected Q-learning, using two different problems:
the game of Roulette, and the control problem of intelligent batteries in smart grids.
## Key Reference