Numpy

切片

Categorical

Binomial

Tensorflow

Random

Jupyter

Python

强化学习

None

Pandas

Batch size

Dynamic programming

Format

Financial

Stock

Yield

Early stopping

Regression problem

Pyscopg2

Postgresql

Tensorflow lite

Policy

Reward

Return

Value function

Optimal value function

Optimal policy

Policy improvement

Policy evaluation

Policy interation