AI Discovery Boards

Showing 12 of 32 research papers

Transformer

Jun 12, 2017

Attention Is All You Need

Ashish Vaswani et al.

Introduced the Transformer architecture, which is the foundation for most modern large language models.

Transformer

NLP

Architecture

Research Paper

8 authors

Read Paper

GANs

Jun 10, 2014

Generative Adversarial Nets

Ian J. Goodfellow et al.

Proposed the GAN framework, a novel way to train generative models, leading to breakthroughs in image generation.

GANs

Generative Models

Computer Vision

Research Paper

8 authors

Read Paper

ResNet

Dec 10, 2015

Deep Residual Learning for Image Recognition

Kaiming He et al.

Introduced residual networks (ResNets), enabling the training of much deeper neural networks than previously possible.

ResNet

Computer Vision

Deep Learning

Research Paper

4 authors

Read Paper

BERT

Oct 11, 2018

BERT: Pre-training of Deep Bidirectional Transformers for Language Understanding

Jacob Devlin et al.

A powerful language representation model that considers the full context of a word by looking at the text before and after it.

BERT

NLP

Language Model

Research Paper

4 authors

Read Paper

GPT-3

May 28, 2020

Language Models are Few-Shot Learners

Tom B. Brown et al.

Introduced GPT-3 and demonstrated that large language models can perform a variety of tasks without fine-tuning.

GPT-3

LLM

Few-Shot Learning

Research Paper

8 authors

Read Paper

AlphaGo

Jan 27, 2016

Mastering the game of Go with deep neural networks and tree search

David Silver et al.

Detailed the AlphaGo system, which defeated a world champion Go player, a landmark achievement for AI.

AlphaGo

Reinforcement Learning

Game AI

Research Paper

8 authors

Read Paper

Diffusion Models

Jun 16, 2020

Denoising Diffusion Probabilistic Models

Jonathan Ho et al.

A foundational paper on diffusion models which have become state-of-the-art for high-quality image generation.

Diffusion Models

Generative Models

Image Generation

Research Paper

3 authors

Read Paper

CLIP

Feb 26, 2021

Learning Transferable Visual Models From Natural Language Supervision

Alec Radford et al.

Introduced CLIP, a model that learns visual concepts from natural language, enabling powerful zero-shot image classification.

CLIP

Multimodal AI

Computer Vision

Research Paper

8 authors

Read Paper

Vision Transformer

Oct 22, 2020

An Image is Worth 16x16 Words: Transformers for Image Recognition at Scale

Alexey Dosovitskiy et al.

Applied the Transformer architecture directly to images, challenging the dominance of CNNs in computer vision.

Vision Transformer

ViT

Computer Vision

Research Paper

8 authors

Read Paper

DQN

Feb 25, 2015

Human-level control through deep reinforcement learning

Volodymyr Mnih et al.

The Deep Q-Network (DQN) paper that demonstrated an AI learning to play Atari games from raw pixel data.

DQN

Reinforcement Learning

Deep Learning

Research Paper

8 authors

Read Paper

Prompt Engineering

Jan 28, 2022

Chain-of-Thought Prompting Elicits Reasoning in Large Language Models

Jason Wei et al.

Showed that prompting LLMs to generate a series of intermediate reasoning steps improves their performance on complex tasks.

Prompt Engineering

LLM

Reasoning

Research Paper

8 authors

Read Paper

AlexNet

Dec 3, 2012

ImageNet Classification with Deep Convolutional Neural Networks

Alex Krizhevsky et al.

The AlexNet paper, which kickstarted the deep learning revolution by winning the ImageNet competition.

AlexNet

CNN

Computer Vision

Research Paper

3 authors

Read Paper