Reinforcement learning from human preferences

Page de redirection

Rediriger vers :

Apprentissage par renforcement et rétroaction humaine

Récupérée de « https://datafranca.org/wiki/index.php?title=Reinforcement_learning_from_human_preferences&oldid=78943 »

ENGLISH

Contributeurs: wiki