AI Research Wiki

Tag: scaling

7 items with this tag.

  • Apr 11, 2026

    Mixture of Experts (MoE)

    • architecture
    • sparse-models
    • scaling
    • efficiency
  • Apr 11, 2026

    Mixture of Experts

    • architecture
    • efficiency
    • scaling
  • Apr 11, 2026

    Scaling Laws

    • scaling
    • empirical-laws
    • compute
  • Apr 11, 2026

    Language Models are Unsupervised Multitask Learners

    • language-model
    • zero-shot
    • transfer-learning
    • scaling
    • transformer
  • Apr 11, 2026

    Language Models are Few-Shot Learners

    • few-shot-learning
    • in-context-learning
    • language-model
    • scaling
  • Apr 11, 2026

    GPT-4 Technical Report

    • multimodal
    • frontier-model
    • scaling
    • alignment
  • Apr 11, 2026

    Switch Transformers: Scaling to Trillion Parameter Models with Simple and Efficient Sparsity

    • mixture-of-experts
    • sparsity
    • scaling
    • efficiency

Created with Quartz v4.5.2 © 2026

  • GitHub
  • Discord Community