AI Research Wiki

Tag: language-model

7 items with this tag.

  • Apr 11, 2026

    Generative Pre-trained Transformer (GPT)

    • architecture
    • language-model
    • decoder-only
    • autoregressive
  • Apr 11, 2026

    Chinchilla: Training Compute-Optimal Large Language Models

    • scaling-laws
    • compute-optimal
    • language-model
  • Apr 11, 2026

    Improving Language Understanding by Generative Pre-Training

    • pretraining
    • fine-tuning
    • transfer-learning
    • language-model
    • transformer
  • Apr 11, 2026

    Language Models are Unsupervised Multitask Learners

    • language-model
    • zero-shot
    • transfer-learning
    • scaling
    • transformer
  • Apr 11, 2026

    Language Models are Few-Shot Learners

    • few-shot-learning
    • in-context-learning
    • language-model
    • scaling
  • Apr 11, 2026

    InstructGPT: Training Language Models to Follow Instructions with Human Feedback

    • rlhf
    • alignment
    • instruction-following
    • language-model
  • Apr 11, 2026

    LLaMA: Open and Efficient Foundation Language Models

    • open-source
    • efficient-training
    • language-model
    • scaling-laws

Created with Quartz v4.5.2 © 2026

  • GitHub
  • Discord Community