Showing 12 of 32 research papers

Transformer
Jun 12, 2017

Attention Is All You Need

Ashish Vaswani et al.

Introduced the Transformer architecture, which is the foundation for most modern large language models.

Transformer
NLP
Architecture
Research Paper
8 authors
Read Paper
GANs
Jun 10, 2014

Generative Adversarial Nets

Ian J. Goodfellow et al.

Proposed the GAN framework, a novel way to train generative models, leading to breakthroughs in image generation.

GANs
Generative Models
Computer Vision
Research Paper
8 authors
Read Paper
ResNet
Dec 10, 2015

Deep Residual Learning for Image Recognition

Kaiming He et al.

Introduced residual networks (ResNets), enabling the training of much deeper neural networks than previously possible.

ResNet
Computer Vision
Deep Learning
Research Paper
4 authors
Read Paper
BERT
Oct 11, 2018

BERT: Pre-training of Deep Bidirectional Transformers for Language Understanding

Jacob Devlin et al.

A powerful language representation model that considers the full context of a word by looking at the text before and after it.

BERT
NLP
Language Model
Research Paper
4 authors
Read Paper
GPT-3
May 28, 2020

Language Models are Few-Shot Learners

Tom B. Brown et al.

Introduced GPT-3 and demonstrated that large language models can perform a variety of tasks without fine-tuning.

GPT-3
LLM
Few-Shot Learning
Research Paper
8 authors
Read Paper
AlphaGo
Jan 27, 2016

Mastering the game of Go with deep neural networks and tree search

David Silver et al.

Detailed the AlphaGo system, which defeated a world champion Go player, a landmark achievement for AI.

AlphaGo
Reinforcement Learning
Game AI
Research Paper
8 authors
Read Paper
Diffusion Models
Jun 16, 2020

Denoising Diffusion Probabilistic Models

Jonathan Ho et al.

A foundational paper on diffusion models which have become state-of-the-art for high-quality image generation.

Diffusion Models
Generative Models
Image Generation
Research Paper
3 authors
Read Paper
CLIP
Feb 26, 2021

Learning Transferable Visual Models From Natural Language Supervision

Alec Radford et al.

Introduced CLIP, a model that learns visual concepts from natural language, enabling powerful zero-shot image classification.

CLIP
Multimodal AI
Computer Vision
Research Paper
8 authors
Read Paper
Vision Transformer
Oct 22, 2020

An Image is Worth 16x16 Words: Transformers for Image Recognition at Scale

Alexey Dosovitskiy et al.

Applied the Transformer architecture directly to images, challenging the dominance of CNNs in computer vision.

Vision Transformer
ViT
Computer Vision
Research Paper
8 authors
Read Paper
DQN
Feb 25, 2015

Human-level control through deep reinforcement learning

Volodymyr Mnih et al.

The Deep Q-Network (DQN) paper that demonstrated an AI learning to play Atari games from raw pixel data.

DQN
Reinforcement Learning
Deep Learning
Research Paper
8 authors
Read Paper
Prompt Engineering
Jan 28, 2022

Chain-of-Thought Prompting Elicits Reasoning in Large Language Models

Jason Wei et al.

Showed that prompting LLMs to generate a series of intermediate reasoning steps improves their performance on complex tasks.

Prompt Engineering
LLM
Reasoning
Research Paper
8 authors
Read Paper
AlexNet
Dec 3, 2012

ImageNet Classification with Deep Convolutional Neural Networks

Alex Krizhevsky et al.

The AlexNet paper, which kickstarted the deep learning revolution by winning the ImageNet competition.

AlexNet
CNN
Computer Vision
Research Paper
3 authors
Read Paper