Models
Datasets
Spaces
Docs
Enterprise
Pricing
Log In
Sign Up

Collections

Discover the best community collections!

Collections including paper arxiv:1706.03762

Language Models - Essential Research Papers

Attention Is All You Need

Paper • 1706.03762 • Published Jun 12, 2017 • 108
Language Models are Few-Shot Learners

Paper • 2005.14165 • Published May 28, 2020 • 18
LLaMA: Open and Efficient Foundation Language Models

Paper • 2302.13971 • Published Feb 27, 2023 • 20
Llama 2: Open Foundation and Fine-Tuned Chat Models

Paper • 2307.09288 • Published Jul 18, 2023 • 248

Attention Is All You Need

Paper • 1706.03762 • Published Jun 12, 2017 • 108

High-Resolution Image Synthesis with Latent Diffusion Models

Paper • 2112.10752 • Published Dec 20, 2021 • 15
Adding Conditional Control to Text-to-Image Diffusion Models

Paper • 2302.05543 • Published Feb 10, 2023 • 58
Proximal Policy Optimization Algorithms

Paper • 1707.06347 • Published Jul 20, 2017 • 11
Direct Preference Optimization: Your Language Model is Secretly a Reward Model

Paper • 2305.18290 • Published May 29, 2023 • 64

Judging LLM-as-a-judge with MT-Bench and Chatbot Arena

Paper • 2306.05685 • Published Jun 9, 2023 • 39
Attention Is All You Need

Paper • 1706.03762 • Published Jun 12, 2017 • 108
Does AI-Assisted Coding Deliver? A Difference-in-Differences Study of Cursor's Impact on Software Projects

Paper • 2511.04427 • Published Nov 6, 2025

Attention Is All You Need

Paper • 1706.03762 • Published Jun 12, 2017 • 108

Materials on Open models

The Gradient of Generative AI Release: Methods and Considerations

Paper • 2302.04844 • Published Feb 5, 2023 • 8
Attention Is All You Need

Paper • 1706.03762 • Published Jun 12, 2017 • 108

paper digestion

Language Models are Few-Shot Learners

Paper • 2005.14165 • Published May 28, 2020 • 18
BERT: Pre-training of Deep Bidirectional Transformers for Language Understanding

Paper • 1810.04805 • Published Oct 11, 2018 • 25
Attention Is All You Need

Paper • 1706.03762 • Published Jun 12, 2017 • 108
Lookahead Anchoring: Preserving Character Identity in Audio-Driven Human Animation

Paper • 2510.23581 • Published Oct 27, 2025 • 41

Toolkit - AI Papers

Neural Machine Translation by Jointly Learning to Align and Translate

Paper • 1409.0473 • Published Sep 1, 2014 • 7
Attention Is All You Need

Paper • 1706.03762 • Published Jun 12, 2017 • 108
BERT: Pre-training of Deep Bidirectional Transformers for Language Understanding

Paper • 1810.04805 • Published Oct 11, 2018 • 25
Hierarchical Reasoning Model

Paper • 2506.21734 • Published Jun 26, 2025 • 46

Attention Is All You Need

Paper • 1706.03762 • Published Jun 12, 2017 • 108

NeRF: Representing Scenes as Neural Radiance Fields for View Synthesis

Paper • 2003.08934 • Published Mar 19, 2020 • 2
Learning Transferable Visual Models From Natural Language Supervision

Paper • 2103.00020 • Published Feb 26, 2021 • 19
Emerging Properties in Self-Supervised Vision Transformers

Paper • 2104.14294 • Published Apr 29, 2021 • 5
Segment Anything

Paper • 2304.02643 • Published Apr 5, 2023 • 6

Language Models - Essential Research Papers

Attention Is All You Need

Paper • 1706.03762 • Published Jun 12, 2017 • 108
Language Models are Few-Shot Learners

Paper • 2005.14165 • Published May 28, 2020 • 18
LLaMA: Open and Efficient Foundation Language Models

Paper • 2302.13971 • Published Feb 27, 2023 • 20
Llama 2: Open Foundation and Fine-Tuned Chat Models

Paper • 2307.09288 • Published Jul 18, 2023 • 248

Materials on Open models

The Gradient of Generative AI Release: Methods and Considerations

Paper • 2302.04844 • Published Feb 5, 2023 • 8
Attention Is All You Need

Paper • 1706.03762 • Published Jun 12, 2017 • 108

Attention Is All You Need

Paper • 1706.03762 • Published Jun 12, 2017 • 108

paper digestion

Language Models are Few-Shot Learners

Paper • 2005.14165 • Published May 28, 2020 • 18
BERT: Pre-training of Deep Bidirectional Transformers for Language Understanding

Paper • 1810.04805 • Published Oct 11, 2018 • 25
Attention Is All You Need

Paper • 1706.03762 • Published Jun 12, 2017 • 108
Lookahead Anchoring: Preserving Character Identity in Audio-Driven Human Animation

Paper • 2510.23581 • Published Oct 27, 2025 • 41

High-Resolution Image Synthesis with Latent Diffusion Models

Paper • 2112.10752 • Published Dec 20, 2021 • 15
Adding Conditional Control to Text-to-Image Diffusion Models

Paper • 2302.05543 • Published Feb 10, 2023 • 58
Proximal Policy Optimization Algorithms

Paper • 1707.06347 • Published Jul 20, 2017 • 11
Direct Preference Optimization: Your Language Model is Secretly a Reward Model

Paper • 2305.18290 • Published May 29, 2023 • 64

Toolkit - AI Papers

Neural Machine Translation by Jointly Learning to Align and Translate

Paper • 1409.0473 • Published Sep 1, 2014 • 7
Attention Is All You Need

Paper • 1706.03762 • Published Jun 12, 2017 • 108
BERT: Pre-training of Deep Bidirectional Transformers for Language Understanding

Paper • 1810.04805 • Published Oct 11, 2018 • 25
Hierarchical Reasoning Model

Paper • 2506.21734 • Published Jun 26, 2025 • 46

Judging LLM-as-a-judge with MT-Bench and Chatbot Arena

Paper • 2306.05685 • Published Jun 9, 2023 • 39
Attention Is All You Need

Paper • 1706.03762 • Published Jun 12, 2017 • 108
Does AI-Assisted Coding Deliver? A Difference-in-Differences Study of Cursor's Impact on Software Projects

Paper • 2511.04427 • Published Nov 6, 2025

Attention Is All You Need

Paper • 1706.03762 • Published Jun 12, 2017 • 108

Attention Is All You Need

Paper • 1706.03762 • Published Jun 12, 2017 • 108

NeRF: Representing Scenes as Neural Radiance Fields for View Synthesis

Paper • 2003.08934 • Published Mar 19, 2020 • 2
Learning Transferable Visual Models From Natural Language Supervision

Paper • 2103.00020 • Published Feb 26, 2021 • 19
Emerging Properties in Self-Supervised Vision Transformers

Paper • 2104.14294 • Published Apr 29, 2021 • 5
Segment Anything

Paper • 2304.02643 • Published Apr 5, 2023 • 6

Previous
1
2
3
...
9
Next

Company

TOS Privacy About Careers

Website

Models Datasets Spaces Pricing Docs