Off-Policy Reinforcement Learning with Loss Function Weighted by Temporal Difference Error | 김태영 교수 연구실 | 서경대학교 전자컴퓨터공학과

김태영 교수 연구실

서비스 플랜

연구실 검색

프로젝트 공고

정부 과제 추천

AI 기반 기업 서칭

홈

기본 정보

연구 분야

프로젝트

발행물

구성원

book-chapter|

인용수 1

·2023

Off-Policy Reinforcement Learning with Loss Function Weighted by Temporal Difference Error

Bumgeun Park, Taeyoung Kim, Woohyeon Moon, Sarvar Hussain Nengroo, Dongsoo Har

Lecture notes in computer science

키워드

Reinforcement learningComputer scienceConvergence (economics)WeightingTemporal difference learningFunction (biology)Rate of convergenceSuiteAlgorithmMathematical optimization

타입

book-chapter

IF / 인용수

- / 1

원문

https://doi.org/10.1007/978-981-99-4761-4_51

게재 연도

2023