Generalized Self-Play Reinforcement Learning for Othello under Dynamic Board Constraints | 김봉중 교수 연구실 | 광주과학기술원 신소재공학과

김봉중 교수 연구실

서비스 플랜

연구실 검색

프로젝트 공고

정부 과제 추천

AI 기반 기업 서칭

홈

기본 정보

연구 분야

프로젝트

논문

구성원

article|

인용수 0

·2025

Generalized Self-Play Reinforcement Learning for Othello under Dynamic Board Constraints

Bong‐Joong Kim, Y. L. Lee, Euiseok Hwang

초록

This research presents a self-play reinforcement learning framework for the game of Othello, enabling generalization across diverse board constraints. These constraints include variable board sizes, blocked cells, and total inference time limitations. FastOthelloNet incorporates a lightweight convolutional input architecture and employs Monte Carlo Tree Search (MCTS) for efficient planning. Unlike AlphaZero, which assumes a fixed board structure, or MuZero, which explicitly models latent dynamics, FastOthelloNet is trained directly on a randomized Othello environment with dynamic constraints, eliminating the need for a complex world model.

키워드

Reinforcement learningGeneralizationTemporal difference learningMonte Carlo tree searchInferenceTree (set theory)Latent variable

타입

article

IF / 인용수

- / 0

원문

https://doi.org/10.1109/ictc66702.2025.11388216

게재 연도

2025

프로젝트 공고 서비스 문의 자주 묻는 질문 이용약관 개인정보처리방침

주식회사 디써클

대표 장재우,이윤구서울특별시 강남구 역삼로 169, 명우빌딩 2층 (TIPS타운 S2)대표 전화 0507-1312-6417이메일 info@rndcircle.io사업자등록번호 458-87-03380호스팅제공자 구글 클라우드 플랫폼(GCP)