AI Research Wiki
Search
Search
Dark mode
Light mode
Explorer
Tag: scaling
7 items with this tag.
Apr 11, 2026
Mixture of Experts (MoE)
architecture
sparse-models
scaling
efficiency
Apr 11, 2026
Mixture of Experts
architecture
efficiency
scaling
Apr 11, 2026
Scaling Laws
scaling
empirical-laws
compute
Apr 11, 2026
Language Models are Unsupervised Multitask Learners
language-model
zero-shot
transfer-learning
scaling
transformer
Apr 11, 2026
Language Models are Few-Shot Learners
few-shot-learning
in-context-learning
language-model
scaling
Apr 11, 2026
GPT-4 Technical Report
multimodal
frontier-model
scaling
alignment
Apr 11, 2026
Switch Transformers: Scaling to Trillion Parameter Models with Simple and Efficient Sparsity
mixture-of-experts
sparsity
scaling
efficiency