Protected: Model-free reinforcement learning(1) – Value iteration methods (Monte Carlo, TD, TD(λ))

オンライン学習

2024.07.05 2022.01.20

AIシステム設計・意思決定構造の設計を専門としています。
Ontology・DSL・Behavior Treeによる判断の外部化、マルチエージェント構築に取り組んでいます。

Specialized in AI system design and decision-making architecture.
Focused on externalizing decision logic using Ontology, DSL, and Behavior Trees, and building multi-agent systems.

タイトルとURLをコピーしました