« Apprentissage par renforcement hors-ligne » : différence entre les versions
(Page créée avec « ==en construction== == Définition == XXXXXXXXX == Français == ''' XXXXXXXXX ''' == Anglais == ''' Offline Reinforcement Learning''' '''Offline RL''' Offline RL is... ») |
(Aucune différence)
|
Version du 27 mars 2023 à 08:46
en construction
Définition
XXXXXXXXX
Français
XXXXXXXXX
Anglais
Offline Reinforcement Learning
Offline RL
Offline RL is a paradigm that learns exclusively from static datasets of previously collected interactions, making it feasible to extract policies from large and diverse training datasets. Effective offline RL algorithms have a much wider range of applications than online RL, being particularly appealing for real-world applications such as education, healthcare, and robotics.
Contributeurs: Patrick Drouin, wiki