RnDCircle Logo
arrow left icon

Artificial Intelligence & Probabilistic Reasoning Lab

한국과학기술원 본교(제1캠퍼스) 김재철AI대학원

김기응 교수

Multi-Agent Systems

Policy Optimization

Reinforcement Learning

발행물

전체 논문

130

121

Exploration in Gradient-based Reinforcement Learning
Nicolas Meuleau, Leonid Peshkin, Kee-Eung Kim
MIT, AI Memo(2001-003), 2001

122

Approximate Solutions to Factored Markov Decision Processes via Greedy Search in the Space of Finite State Controllers
Kee-Eung Kim, Thomas Dean, Nicolas Meuleau
Proceedings of the Fifth International Conference on Artificial Intelligence in Planning and Scheduling (AIPS), 2000

123

Learning to Cooperate via Policy Search
Leonid Peshkin, Kee-Eung Kim, Nicolas Meuleau, Leslie Pack Kaelbling
Proceedings of the Sixteenth Conference on Uncertainty in Artificial Intelligence (UAI), 2000

124

Linear Algebra in Very High-Dimension Vector Spaces With an Application to Solving Markov Decision Processes
Kee-Eung Kim, Thomas Dean, Samuel Hazlehurst
Neural Computing Surveys, 2000

125

Linear Algebra in Very High-Dimension Vector Spaces: Algorithms and Data Structures for Implementing Exact and Approximate Solution Methods
Kee-Eung Kim, Thomas Dean, Samuel Hazlehurst
Department of Computer Science, Brown University, Technical Report(CS-00-02), 2000

126

High-Dimension Vector Spaces: Algorithms and Data Structures for Implementing Exact and Approximate Solution Methods
Department of Computer Science, Brown University
Technical Report(CS-00-02), 2000

127

Learning Finite-State Controllers for Partially Observable Environments
Nicolas Meuleau, Leonid Peshkin, Kee-Eung Kim, Leslie Pack Kaelbling
UAI, 1999

128

Solving POMDPs by Searching the Space of Finite Policies
Nicolas Meuleau, Kee-Eung Kim, Leslie Pack Kaelbling, Anthony R. Cassandra
UAI, 1999

129

Solving Very Large Weakly Coupled Markov Decision Processes
Nicolas Meuleau, Milos Hauskrecht, Kee-Eung Kim, Leonid Peshkin, Leslie Pack Kaelbling, Thomas Dean, Craig Boutilier
AAAI, 1998

130

Solving Planning Problems with Large State and Action Spaces
Thomas Dean, Kee-Eung Kim, Robert Givan
AIPS, 1998