Reinforcement Learning

Protected: Trade-off between exploration and utilization -Regret and stochastic optimal measures, heuristics

Reinforcement learning with regrets, stochastic optimal measures, and heuristics

2022.01.19

オンライン学習強化学習微分積分:Calculus最適化:Optimization機械学習:Machine Learning確率・統計:Probability and Statistics

Protected: Planning Problems (2) Implementation of Dynamic Programming (Value Iterative Method and Measure Iterative Method)

Implementation of Dynamic Programming (Value Iteration and Policy Iteration) for Planning Problems as Reinforcement Learning for Digital Transformation , Artificial Intelligence and Machine Learning Tasks

2022.01.18

オンライン学習強化学習微分積分:Calculus最適化:Optimization機械学習:Machine Learning確率・統計:Probability and Statistics

Protected: Planning Problems(1) – Approaches Using Dynamic Programming and Theoretical Underpinnings

Reinforcement learning by planning problems (dynamic programming and linear programming) for sequential decision problems in known environments used for digital transformation , artificial intelligence and machine learning tasks.

2022.01.17

強化学習微分積分:Calculus最適化:Optimization機械学習:Machine Learning確率・統計:Probability and Statistics

Machine Learning Professional Series “Reinforcement Learning” Reading Memo

A reference book on reinforcement learning to observe the current situation and determine what action to take, used in digital transformation ,artificial intelligence , and machine learning tasks.

2022.01.10

機械学習:Machine Learning深層学習:Deep Learning

Online learning and online prediction

Online learning is a sequential machine learning technique used in digital transformation , artificial intelligence , and machine learning tasks, and online prediction combines these techniques with decision-making problems.

2022.01.05

オンライン学習強化学習微分積分:Calculus最適化:Optimization機械学習:Machine Learning確率・統計:Probability and Statistics

This is a good introduction to deep learning (Machine Learning Startup Series)Reading Notes

Overview of deep learning for digital transformation and artificial intelligence tasks, including machine learning, gradient descent, regularization, error back propagation, self-encoders, convolutional neural networks, recurrent neural networks, Boltzmann machines, and reinforcement learning.

2021.11.15

微分積分:Calculus数理論理学:Mathematical logic最適化:Optimization機械学習:Machine Learning深層学習:Deep Learning確率・統計:Probability and Statistics線形代数:Linear Algebra

Decision Theory and Mathematical Decision Making Techniques

We will discuss mathematical decision-making techniques used in reinforcement learning, online prediction, and algorithms for high-speed automated stock trading. After describing the four decision strategies, I will introduce subjective probability, Bayesian theory, multiple concept theory, and supply rise theory.

2021.06.05

数学:Mathematics確率・統計:Probability and Statistics

Protected: Reinforcement Learning Overview

An overview of reinforcement learning for learning sequential decision rules

2021.05.20

アルゴリズム:Algorithms最適化:Optimization機械学習:Machine Learning深層学習:Deep Learning