기본 정보
연구 분야
프로젝트
발행물
구성원
book-chapter|
인용수 1
·2023
Off-Policy Reinforcement Learning with Loss Function Weighted by Temporal Difference Error
Bumgeun Park, Taeyoung Kim, Woohyeon Moon, Sarvar Hussain Nengroo, Dongsoo Har
Lecture notes in computer science
키워드
Reinforcement learningComputer scienceConvergence (economics)WeightingTemporal difference learningFunction (biology)Rate of convergenceSuiteAlgorithmMathematical optimization
타입
book-chapter
IF / 인용수
- / 1
게재 연도
2023