Skip to content

Latest commit

 

History

History
5 lines (3 loc) · 149 Bytes

File metadata and controls

5 lines (3 loc) · 149 Bytes

A common choice is to choose the lastst policy to sample from the environment.

Before collecting more data you can take K steps. ![[q_iter.png]]