vault backup: 2024-11-24 02:29:04

This commit is contained in:
Marco Realacci 2024-11-24 02:29:04 +01:00
parent ae69c5fb2b
commit 97f314dbb4
11 changed files with 162 additions and 41 deletions

View file

@ -0,0 +1,16 @@
...
application of the rl methodology to the addressed problem
agent collaboration
- no
- proactive
- reactive
rerward function: is immediate?
environment
model free or model based? (se l'agent impara la transition matrix)
exploration vs exploitation: policy
converge studies