Reinforcement Learning from Human Feedback (Q115570683)

From Wikidata
Jump to navigation Jump to search
variant of reinforcement learning
  • RLHF
  • Reinforcement learning from human feedback
  • reinforcement learning from human preferences
edit
Language Label Description Also known as
English
Reinforcement Learning from Human Feedback
variant of reinforcement learning
  • RLHF
  • Reinforcement learning from human feedback
  • reinforcement learning from human preferences

Statements