기본 정보
연구 분야
프로젝트
논문
구성원
article|
인용수 0
·2025
Generalized Self-Play Reinforcement Learning for Othello under Dynamic Board Constraints
Bong‐Joong Kim, Y. L. Lee, Euiseok Hwang
초록

This research presents a self-play reinforcement learning framework for the game of Othello, enabling generalization across diverse board constraints. These constraints include variable board sizes, blocked cells, and total inference time limitations. FastOthelloNet incorporates a lightweight convolutional input architecture and employs Monte Carlo Tree Search (MCTS) for efficient planning. Unlike AlphaZero, which assumes a fixed board structure, or MuZero, which explicitly models latent dynamics, FastOthelloNet is trained directly on a randomized Othello environment with dynamic constraints, eliminating the need for a complex world model.

키워드
Reinforcement learningGeneralizationTemporal difference learningMonte Carlo tree searchInferenceTree (set theory)Latent variable
타입
article
IF / 인용수
- / 0
게재 연도
2025

주식회사 디써클

대표 장재우,이윤구서울특별시 강남구 역삼로 169, 명우빌딩 2층 (TIPS타운 S2)대표 전화 0507-1312-6417이메일 info@rndcircle.io사업자등록번호 458-87-03380호스팅제공자 구글 클라우드 플랫폼(GCP)

© 2026 RnDcircle. All Rights Reserved.