10.3 희소 보상(Sparse Reward) 문제의 해결: 목표 기반 RL (Goal-Conditioned RL)

Home / 로봇, 자율주행을 위한 Embodied AI & ... / Chapter 10. 심층 강화학습(Deep RL... / 10.3 희소 보상(Sparse Reward) 문...

10.3 희소 보상(Sparse Reward) 문제의 해결: 목표 기반 RL (Goal-Conditioned RL)

10.3희소 보상(Sparse Reward) 문제의 해결: 목표 기반 RL (Goal-Conditioned RL)
10.3.1로봇 조작(Manipulation)에서의 보상 설계 난제
10.3.2사후 경험 재생(Hindsight Experience Replay, HER): 실패를 성공으로 재해석하기
10.3.3동적 목표 설정과 커리큘럼 학습(Curriculum Learning)의 결합

Generated by Rust Site Gen