Official Pytorch Implementation of Boosting Lossless Speculative Decoding via Feature Sampling and Partial Alignment Distillation
You are encouraged to modify/distribute this code. However, please acknowledge this code and cite the paper appropriately.
@article{gui2024boosting,
title={Boosting Lossless Speculative Decoding via Feature Sampling and Partial Alignment Distillation},
author={Gui, Lujun and Xiao, Bin and Su, Lei and Chen, Weipeng},
journal={arXiv preprint arXiv:2408.15562},
year={2024}
}