Bad dataset

If anyone is curious here is my run on the Alpaca dataset using another decoder model (codegen-16B-nl). Appears the dataset isn't diverse, multiple closely related answers. I believe this dataset is not capable of generalizing well to new data.

The loss from the original Alpaca training script follows a similar pattern used in OPT-IML to compute loss based on the label.

![image](https://user-images.githubusercontent.com/7272343/226157310-5bb6d9a6-61ad-43d3-946c-d4846b19f135.png)

My run on codegen-16B-nl
---

![image](https://user-images.githubusercontent.com/7272343/226156868-e52af8bb-b2f1-4ab4-86dd-f450518454cc.png)

Another user's run on LLaMA 7B
---
![image](https://user-images.githubusercontent.com/7272343/226156880-55016e43-1599-4bfd-8c77-4337339dd628.png)

Some more discussion: https://twitter.com/abacaj/status/1637310768780648448

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Bad dataset #65

My run on codegen-16B-nl

Another user's run on LLaMA 7B

Metadata

Assignees

Labels

Projects

Milestone

Relationships

Development

Bad dataset #65

Description

My run on codegen-16B-nl

Another user's run on LLaMA 7B

Metadata

Metadata

Assignees

Labels

Projects

Milestone

Relationships

Development

Issue actions