AI Research Wiki
Search
Search
Dark mode
Light mode
Explorer
Tag: efficiency
5 items with this tag.
Apr 11, 2026
Mixture of Experts (MoE)
architecture
sparse-models
scaling
efficiency
Apr 11, 2026
FlashAttention
efficiency
attention
systems
Apr 11, 2026
Mixture of Experts
architecture
efficiency
scaling
Apr 11, 2026
FlashAttention: Fast and Memory-Efficient Exact Attention with IO-Awareness
attention
efficiency
gpu-optimization
systems
Apr 11, 2026
Switch Transformers: Scaling to Trillion Parameter Models with Simple and Efficient Sparsity
mixture-of-experts
sparsity
scaling
efficiency