llm-gen-ai

Model quantization

llama-7b.py covers how to run the llama-2-7b-hf model in CPU-based machines and how to apply quantization configurations to CPU machines.
Note: Please clone the repo https://huggingface.co/meta-llama/Llama-2-7b-hf/tree/main to Llama-2-7b-hf folder.
Hugging face docs - https://huggingface.co/docs/transformers/index
Model used for the experiment - https://huggingface.co/meta-llama/Llama-2-7b-hf
Model related information - https://huggingface.co/meta-llama/Llama-2-7b-hf

Prompt_Guardrails_Classifier.ipynb Jupyter Notebook covers the implementation of prompt guardrails.

Name		Name	Last commit message	Last commit date
Latest commit History 12 Commits
.gitignore		.gitignore
LICENSE		LICENSE
Prompt_Guardrails_Classifier.ipynb		Prompt_Guardrails_Classifier.ipynb
README.md		README.md
llama-7b.py		llama-7b.py
requirements.txt		requirements.txt