AI Research Wiki

Tag: instruction-following

1 item with this tag.

  • Apr 11, 2026

    InstructGPT: Training Language Models to Follow Instructions with Human Feedback

    • rlhf
    • alignment
    • instruction-following
    • language-model

Created with Quartz v4.5.2 © 2026

  • GitHub
  • Discord Community