AI Research Wiki

❯

Folder: papers

21 items under this folder.

Apr 11, 2026
Attention Is All You Need
Apr 11, 2026
Neural Machine Translation by Jointly Learning to Align and Translate
Apr 11, 2026
BERT: Pre-training of Deep Bidirectional Transformers for Language Understanding
Apr 11, 2026
Chain-of-Thought Prompting Elicits Reasoning in Large Language Models
Apr 11, 2026
Chinchilla: Training Compute-Optimal Large Language Models
Apr 11, 2026
Constitutional AI: Harmlessness from AI Feedback
Apr 11, 2026
Direct Preference Optimization: Your Language Model is Secretly a Reward Model
Apr 11, 2026
FlashAttention: Fast and Memory-Efficient Exact Attention with IO-Awareness
Apr 11, 2026
Improving Language Understanding by Generative Pre-Training
Apr 11, 2026
Language Models are Unsupervised Multitask Learners
Apr 11, 2026
Language Models are Few-Shot Learners
Apr 11, 2026
GPT-4 Technical Report
Apr 11, 2026
InstructGPT: Training Language Models to Follow Instructions with Human Feedback
Apr 11, 2026
LLaMA: Open and Efficient Foundation Language Models
Apr 11, 2026
LoRA: Low-Rank Adaptation of Large Language Models
Apr 11, 2026
RoFormer: Enhanced Transformer with Rotary Position Embedding
Apr 11, 2026
Scaling Laws for Neural Language Models
Apr 11, 2026
Sequence to Sequence Learning with Neural Networks
Apr 11, 2026
Switch Transformers: Scaling to Trillion Parameter Models with Simple and Efficient Sparsity
Apr 11, 2026
Exploring the Limits of Transfer Learning with a Unified Text-to-Text Transformer
Apr 11, 2026
Efficient Estimation of Word Representations in Vector Space

Created with Quartz v4.5.2 © 2026

GitHub
Discord Community