ChatGPT-related Papers

This is a list of ChatGPT-related papers. Any feedback is welcome.

Survey paper

Instruction tuning

Finetuned Language Models Are Zero-Shot Learners
Scaling Instruction-Finetuned Language Models
Super-NaturalInstructions: Generalization via Declarative Instructions on 1600+ NLP Tasks
Self-Instruct: Aligning Language Model with Self Generated Instructions [github]
Stanford Alpaca: An Instruction-following LLaMA Model [github]
Dolly: Democratizing the magic of ChatGPT with open models [blog] [blog]
Vicuna: An Open-Source Chatbot Impressing GPT-4 with 90% ChatGPT Quality [github] [website]
LaMini-LM: A Diverse Herd of Distilled Models from Large-Scale Instructions [github]
Principle-Driven Self-Alignment of Language Models from Scratch with Minimal Human Supervision
LIMA: Less Is More for Alignment
Enhancing Chat Language Models by Scaling High-quality Instructional Conversations [github]
How Far Can Camels Go? Exploring the State of Instruction Tuning on Open Resources [github]
Faith and Fate: Limits of Transformers on Compositionality
SAIL: Search-Augmented Instruction Learning
The False Promise of Imitating Proprietary LLMs
Instruction Mining: High-Quality Instruction Data Selection for Large Language Models
SteerLM: Attribute Conditioned SFT as an (User-Steerable) Alternative to RLHF (EMNLP2023 Findings)

Reinforcement learning from human feedback

Reinforcement learning with verifiable rewards

Reinforcement learning without verifiable rewards

Evaluation

Benchmarks

A Survey on Large Language Model Benchmarks

General

Coding

Math

Instruction following

Agent

τ-bench: A Benchmark for Tool-Agent-User Interaction in Real-World Domains

Long-context

Large Language Model

See https://github.com/tomohideshibata/BERT-related-papers#large-language-model

External tools

Agent

MoE/Routing

Technical report of open/proprietary model

Misc.

Name		Name	Last commit message	Last commit date
Latest commit History 106 Commits
README.md		README.md

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

ChatGPT-related Papers

Table of Contents

Survey paper

Instruction tuning

Reinforcement learning from human feedback

Reinforcement learning with verifiable rewards

Reinforcement learning without verifiable rewards

Evaluation

Benchmarks

General

Coding

Math

Instruction following

Agent

Long-context

Large Language Model

External tools

Agent

MoE/Routing

Technical report of open/proprietary model

Misc.

About

Uh oh!

Releases

Packages

tomohideshibata/ChatGPT-related-papers

Folders and files

Latest commit

History

Repository files navigation

ChatGPT-related Papers

Table of Contents

Survey paper

Instruction tuning

Reinforcement learning from human feedback

Reinforcement learning with verifiable rewards

Reinforcement learning without verifiable rewards

Evaluation

Benchmarks

General

Coding

Math

Instruction following

Agent

Long-context

Large Language Model

External tools

Agent

MoE/Routing

Technical report of open/proprietary model

Misc.

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Packages