Protected: Model-free reinforcement learning (2) – Method iteration (Q-learning, SARSA, Actor-click method)

IOT技術:IOT Technology

2024.07.05 2022.01.21

AIシステム設計・意思決定構造の設計を専門としています。
Ontology・DSL・Behavior Treeによる判断の外部化、マルチエージェント構築に取り組んでいます。

Specialized in AI system design and decision-making architecture.
Focused on externalizing decision logic using Ontology, DSL, and Behavior Trees, and building multi-agent systems.

Exit mobile version

タイトルとURLをコピーしました