Support serve + chat + generate with Mixtral teacher model #1002

cdoern · 2024-04-25T17:12:26Z

both generate and serve already had a flag for the context size, meaning the user can override the default of 4096 in those two scenarios.

Added a new flag called model-family to serve which currently supports merlinite or mixtral. This is then used to load the proper chat template from a hardcoded map of supported templates. If nothing is specified, we default to the current one hardcoded in instructlab

Added the same flag to generate, since the generate prompt template needs extra padding for mixtral

Also, add model-name to ilab train as linux train had a hardcoded merlinite reference for the model name.

Most other "hardcoded" references are really just defaults which can be overridden by the user.
#973 speaks more about semantic fixes between serve and chat using the wrong model paths/names but is also meant to speak to hardcoded references to merlinite. #29 is the tracker for actually making sure serve uses the proper template.

resolves #973
resolves #29

src/instructlab/train/linux_train.py

src/instructlab/generator/generate_data.py

src/instructlab/lab.py

mairin · 2024-04-29T14:49:40Z

Two workflows to test, need to run an end-to-end test....

run this with a granite model, ensure that it correctly loads the merlinite template and works correctly
rnu this with a mixtral model, ensure correctly loads mixtral template and works correctly

src/instructlab/lab.py

alimaredia · 2024-04-29T15:11:35Z

src/instructlab/lab.py

+    "--model-name",
+    default="instructlab/merlinite-7b-lab",
+    show_default=True,
+    help="model name to use in training",


chore: Might want to change this to something like "huggingface path to use in training" just so users don't only put only merlinite-7b-lab instead of instructlab/merlinite-7b-lab.

Can we extrapolate any of this data from other args? If not let's update the help text so it's less confusing for users

anik120

@jaideepr97 @cdoern could you help clarify the intention of the PR with the changes to the title/descriptions you folks mentioned in the call please, it'll help with the review process, especially grokking failing test cases.

src/instructlab/lab.py

anik120 · 2024-04-29T15:49:42Z

src/instructlab/generator/generate_data.py

@@ -302,6 +337,7 @@ def generate_data(
    logger,
    api_base,
    tls_insecure,
+    model_family: str,


Do we need to introduce a new model_family? Can we not interpret the model family from model_name?
Name: merlinite-4b-asdf.gguf. Family: merlinite
Name: granite-4b-asdf.gguf. Family: merlinite
Name: mixtral-4b-asdf.gguf. Family: mixtral

This solves for what @mairin commented too, without any additional code changes.

Two workflows to test, need to run an end-to-end test....

run this with a granite model, ensure that it correctly loads the merlinite template and works correctly

rnu this with a mixtral model, ensure correctly loads mixtral template and works correctly

Was there a resolution to this? I'm curious about this point as well

Nah I was told ignore for this PR it'll be a follow up 🤷🏽

@russel

Looks like @russel has more context and can help move this forward

src/instructlab/server.py

src/instructlab/lab.py

src/instructlab/server.py

jaideepr97 · 2024-04-29T17:07:43Z

@anik120 updated the PR title

The changes included in this PR do the following:

changes to serve so that mixtral chat template is passed to the model server
changes to generate so that mixtral generation template is passed to the model server
1 line change to train so that any model other than merlinite can be trained on linux

src/instructlab/generator/generate_data.py

nathan-weinberg · 2024-04-29T17:54:34Z

src/instructlab/generator/generate_data.py

@@ -302,6 +337,7 @@ def generate_data(
    logger,
    api_base,
    tls_insecure,
+    model_family: str,


Was there a resolution to this? I'm curious about this point as well

src/instructlab/lab.py

nathan-weinberg · 2024-04-29T17:55:59Z

src/instructlab/lab.py

+@click.option(
+    "--model-family",
+    default="merlinite",
+    help="model family to use when picking a generation template",


Suggested change

help="model family to use when picking a generation template",

help="Model family generation template to serve with, e.g. 'merlinite', 'granite', etc.",

src/instructlab/lab.py

nathan-weinberg · 2024-04-29T17:57:21Z

src/instructlab/lab.py

+    "--model-name",
+    default="instructlab/merlinite-7b-lab",
+    show_default=True,
+    help="model name to use in training",


Can we extrapolate any of this data from other args? If not let's update the help text so it's less confusing for users

both generate and serve already had a flag for the context size, meaning the user can override the default of 4096 in those two scenarios. Added a new flag called `model-family` to serve which currently supports `merlinite` or `mixtral`. This is then used to load the proper chat template from a hardcoded map of supported templates. If nothing is specified, we default to the current one hardcoded in instructlab. any function that can start a server needs this flag as well. This meaans chat and generate need this flag. Added the same flag to generate, since the generate prompt template needs extra padding for mixtral Also, add `model-name` to `ilab train` as linux train had a hardcoded merlinite reference for the model name. Most other "hardcoded" references are really just defaults which can be overridden by the user. Signed-off-by: Charlie Doern <cdoern@redhat.com>

russellb

I think this is good and we can continue with follow-up PRs to improve UX, help text, and so forth - I'd like to go ahead and merge since some others are waiting for the functional part to be merged

nathan-weinberg

Comments will be addressed in followup

bbrowning · 2024-04-29T20:03:44Z

Should we be using the chat templates as specified in the tokenizer_config.json of each model from HuggingFace instead of hardcoding them? Essentially https://huggingface.co/docs/transformers/main/en/chat_templating#how-do-i-use-chat-templates

This would get us out of having to hardcode the templates at all and instead we'll just use what the model's metadata says it requires. This would mean expecting models to maintain appropriate chat templates, but I believe that's a pretty common practice.

russellb · 2024-04-29T23:40:17Z

Definitely sounds like a good follow-up issue to look at! Thanks, @bbrowning !

cdoern force-pushed the model-agnostic branch 2 times, most recently from 341e50d to a32921a Compare April 25, 2024 18:50

nathan-weinberg requested a review from a team April 25, 2024 18:51

khaledsulayman reviewed Apr 25, 2024

View reviewed changes

src/instructlab/train/linux_train.py Outdated Show resolved Hide resolved

cdoern force-pushed the model-agnostic branch 5 times, most recently from 55d0c0b to 5072867 Compare April 26, 2024 13:17

oindrillac marked this pull request as draft April 26, 2024 14:24

cdoern force-pushed the model-agnostic branch 2 times, most recently from 215a6bd to 07c6b87 Compare April 27, 2024 15:44

oindrillac reviewed Apr 27, 2024

View reviewed changes

src/instructlab/generator/generate_data.py Outdated Show resolved Hide resolved

cdoern force-pushed the model-agnostic branch from 07c6b87 to fa4d102 Compare April 27, 2024 18:06

jaideepr97 reviewed Apr 27, 2024

View reviewed changes

src/instructlab/lab.py Outdated Show resolved Hide resolved

cdoern force-pushed the model-agnostic branch from fa4d102 to 707be62 Compare April 27, 2024 21:18

cdoern marked this pull request as ready for review April 27, 2024 21:20

cdoern force-pushed the model-agnostic branch from 707be62 to 1362b48 Compare April 27, 2024 21:22

github-actions bot added the testing Relates to testing label Apr 27, 2024

cdoern removed the testing Relates to testing label Apr 29, 2024

cdoern force-pushed the model-agnostic branch from 1362b48 to f975847 Compare April 29, 2024 13:51

github-actions bot added the testing Relates to testing label Apr 29, 2024

cdoern force-pushed the model-agnostic branch 2 times, most recently from a68c55f to eac2023 Compare April 29, 2024 14:16

nathan-weinberg self-requested a review April 29, 2024 14:37

cdoern force-pushed the model-agnostic branch from eac2023 to 6994764 Compare April 29, 2024 14:38

mairin mentioned this pull request Apr 29, 2024

Unable to train instructlab/granite-7b-lab-GGUF model #967

Closed

cdoern commented Apr 29, 2024

View reviewed changes

src/instructlab/lab.py Outdated Show resolved Hide resolved

cdoern commented Apr 29, 2024

View reviewed changes

src/instructlab/lab.py Outdated Show resolved Hide resolved

khaledsulayman approved these changes Apr 29, 2024

View reviewed changes

alimaredia reviewed Apr 29, 2024

View reviewed changes

anik120 previously requested changes Apr 29, 2024

View reviewed changes

jaideepr97 reviewed Apr 29, 2024

View reviewed changes

src/instructlab/server.py Outdated Show resolved Hide resolved

cdoern force-pushed the model-agnostic branch from 5690c86 to 71d224d Compare April 29, 2024 16:45

jaideepr97 reviewed Apr 29, 2024

View reviewed changes

src/instructlab/lab.py Outdated Show resolved Hide resolved

jaideepr97 reviewed Apr 29, 2024

View reviewed changes

src/instructlab/lab.py Outdated Show resolved Hide resolved

jaideepr97 reviewed Apr 29, 2024

View reviewed changes

src/instructlab/lab.py Outdated Show resolved Hide resolved

jaideepr97 reviewed Apr 29, 2024

View reviewed changes

src/instructlab/lab.py Outdated Show resolved Hide resolved

jaideepr97 reviewed Apr 29, 2024

View reviewed changes

src/instructlab/server.py Outdated Show resolved Hide resolved

cdoern force-pushed the model-agnostic branch from 71d224d to 0166edc Compare April 29, 2024 16:59

jaideepr97 changed the title ~~Make ilab model agnostic~~ Support serve + chat + generate with Mixtral teacher model Apr 29, 2024

cdoern force-pushed the model-agnostic branch from 0166edc to 826e002 Compare April 29, 2024 17:11

tiran reviewed Apr 29, 2024

View reviewed changes

src/instructlab/generator/generate_data.py Show resolved Hide resolved

cdoern force-pushed the model-agnostic branch 3 times, most recently from 2cf4be6 to de6fec2 Compare April 29, 2024 17:53

nathan-weinberg requested changes Apr 29, 2024

View reviewed changes

cdoern force-pushed the model-agnostic branch from de6fec2 to ffb5b20 Compare April 29, 2024 17:59

cdoern force-pushed the model-agnostic branch from ffb5b20 to 2500589 Compare April 29, 2024 18:02

russellb approved these changes Apr 29, 2024

View reviewed changes

nathan-weinberg approved these changes Apr 29, 2024

View reviewed changes

mergify bot merged commit 03dc971 into instructlab:main Apr 29, 2024

russellb mentioned this pull request Apr 29, 2024

Look into pulling chat templates from huggingface #1038

Closed

hickeyma mentioned this pull request May 1, 2024

Users incorrectly think that by specifying the --model-family flag then the correct model is selected for ilab generate #1060

Closed

	help="model family to use when picking a generation template",
	help="Model family generation template to serve with, e.g. 'merlinite', 'granite', etc.",

Support serve + chat + generate with Mixtral teacher model #1002

Support serve + chat + generate with Mixtral teacher model #1002

Uh oh!

Conversation

cdoern commented Apr 25, 2024 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

mairin commented Apr 29, 2024 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

Uh oh!

Uh oh!

alimaredia Apr 29, 2024

Choose a reason for hiding this comment

Uh oh!

nathan-weinberg Apr 29, 2024

Choose a reason for hiding this comment

Uh oh!

anik120 left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

anik120 Apr 29, 2024 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

nathan-weinberg Apr 29, 2024

Choose a reason for hiding this comment

Uh oh!

anik120 Apr 29, 2024

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

jaideepr97 commented Apr 29, 2024 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

Uh oh!

Uh oh!

nathan-weinberg Apr 29, 2024

Choose a reason for hiding this comment

Uh oh!

Uh oh!

nathan-weinberg Apr 29, 2024

Choose a reason for hiding this comment

Uh oh!

Uh oh!

nathan-weinberg Apr 29, 2024

Choose a reason for hiding this comment

Uh oh!

russellb left a comment

Choose a reason for hiding this comment

Uh oh!

nathan-weinberg left a comment

Choose a reason for hiding this comment

Uh oh!

bbrowning commented Apr 29, 2024

Uh oh!

russellb commented Apr 29, 2024

Uh oh!

Uh oh!

cdoern commented Apr 25, 2024 •

edited

Loading

mairin commented Apr 29, 2024 •

edited

Loading

anik120 Apr 29, 2024 •

edited

Loading

jaideepr97 commented Apr 29, 2024 •

edited

Loading