확률적 플래닝의 기초 (Foundations of Probabilistic Planning)

1. 개요

확률적 플래닝(probabilistic planning)은 행동의 결과가 확률적(stochastic)인 환경에서의 계획을 다룬다. 고전적 계획의 결정론적 가정을 완화하여, 실제 로봇 환경의 불확실성을 반영한다.

$P(s' \mid s, a) : \text{상태 } s \text{에서 행동 } a \text{를 실행한 후 상태 } s' \text{에 도달할 확률}$

확률적 환경에서는 고정된 행동 시퀀스가 아닌, 상태에 따라 행동을 결정하는 **정책(policy)**이 계획의 형태이다.

$\pi : S \rightarrow A$

Ghallab, M., Nau, D., & Traverso, P. (2016). Automated Planning and Acting. Cambridge University Press.
Kaelbling, L. P., Littman, M. L., & Cassandra, A. R. (1998). “Planning and Acting in Partially Observable Stochastic Domains.” Artificial Intelligence, 101(1-2), 99-134.

버전	날짜	변경 사항
v0.1	2026-04-05	초안 작성