[Integration] Upload tutorial for making a bitnet ckpt for vLLM #135

LeiWang1999 · 2024-08-09T09:05:15Z

This pull request includes several significant changes to the integration/BitNet directory, focusing on enhancing the documentation, improving scripts for model checkpoints, and refining quantization utilities. The most important changes include adding new sections to the README, introducing scripts for generating model checkpoints, and updating quantization functions.

Documentation Updates:

integration/BitNet/README.md: Added sections for "Latest News" and "Make Checkpoints for vLLM" with instructions and scripts for generating model checkpoints.

Script Enhancements:

integration/BitNet/maint/generate_bitnet_model_bitblas_format.sh: Added a new script for generating BitNet model checkpoints in BitBLAS format.
integration/BitNet/maint/generate_bitnet_model_native_format.sh: Added a new script for generating BitNet model checkpoints in native format.
integration/BitNet/maint/create_bitblas_ckpt.py: Renamed and updated the script to use argparse for handling input arguments. [1] [2]

Code Improvements:

integration/BitNet/utils_quant.py: Added @torch.compile decorator to quantization functions and refactored the forward method to use post_quant_process. [1] [2] [3]

Miscellaneous:

integration/BitNet/vllm_workspace/inference_with_compress_format.py: Added a script for inference with compressed format models.
integration/BitNet/vllm_workspace/inference_with_native_format.py: Added a script for inference with native format models.
integration/BitNet/vllm_workspace/utils.py: Added utility functions for comparing model outputs and log probabilities.

LeiWang1999 added 3 commits August 9, 2024 07:55

fix install with absolute path

7aee8f4

efficient inference with torch compile

5cb5349

update vllm ckpt tutorial for bitnet

e73c563

LeiWang1999 merged commit 7c6bccf into microsoft:main Aug 9, 2024
3 of 4 checks passed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[Integration] Upload tutorial for making a bitnet ckpt for vLLM #135

[Integration] Upload tutorial for making a bitnet ckpt for vLLM #135

LeiWang1999 commented Aug 9, 2024

[Integration] Upload tutorial for making a bitnet ckpt for vLLM #135

[Integration] Upload tutorial for making a bitnet ckpt for vLLM #135

Conversation

LeiWang1999 commented Aug 9, 2024

Documentation Updates:

Script Enhancements:

Code Improvements:

Miscellaneous: