-
Notifications
You must be signed in to change notification settings - Fork 4.3k
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
[MoE/ZeRO] fix .github conflict with main branch. #5827
Commits on May 30, 2024
-
[Fix/Example] Fix Llama Inference Loading Data Type (hpcaitech#5763)
* [fix/example] fix llama inference loading dtype * revise loading dtype of benchmark llama3
Configuration menu - View commit details
-
Copy full SHA for 677cbfa - Browse repository at this point
Copy the full SHA 677cbfaView commit details
Commits on May 31, 2024
-
[release] update version (hpcaitech#5752)
* [release] update version * [devops] update compatibility test * [devops] update compatibility test * [devops] update compatibility test * [devops] update compatibility test * [test] fix ddp plugin test * [test] fix gptj and rpc test * [devops] fix cuda ext compatibility * [inference] fix flash decoding test * [inference] fix flash decoding test
Configuration menu - View commit details
-
Copy full SHA for 68359ed - Browse repository at this point
Copy the full SHA 68359edView commit details
Commits on Jun 3, 2024
-
Configuration menu - View commit details
-
Copy full SHA for 3f2be80 - Browse repository at this point
Copy the full SHA 3f2be80View commit details -
[test] Fix/fix testcase (hpcaitech#5770)
* [fix] branch for fix testcase; * [fix] fix test_analyzer & test_auto_parallel; * [fix] remove local change about moe; * [fix] rm local change moe;
Configuration menu - View commit details
-
Copy full SHA for 1b76564 - Browse repository at this point
Copy the full SHA 1b76564View commit details -
Configuration menu - View commit details
-
Copy full SHA for 4064432 - Browse repository at this point
Copy the full SHA 4064432View commit details
Commits on Jun 4, 2024
-
[CI/tests] simplify some test case to reduce testing time (hpcaitech#…
…5755) * [ci/tests] simplify some test case to reduce testing time * [ci/tests] continue to remove test case to reduce ci time cost * restore some test config * [ci/tests] continue to reduce ci time cost
Configuration menu - View commit details
-
Copy full SHA for e22b827 - Browse repository at this point
Copy the full SHA e22b827View commit details -
[misc] update dockerfile (hpcaitech#5776)
* [misc] update dockerfile * [misc] update dockerfile
Configuration menu - View commit details
-
Copy full SHA for 32f4187 - Browse repository at this point
Copy the full SHA 32f4187View commit details -
Configuration menu - View commit details
-
Copy full SHA for ee6fd38 - Browse repository at this point
Copy the full SHA ee6fd38View commit details
Commits on Jun 5, 2024
-
[Inference]Add Streaming LLM (hpcaitech#5745)
* Add Streaming LLM * add some parameters to llama_generation.py * verify streamingllm config * add test_streamingllm.py * modified according to the opinions of review * add Citation * change _block_tables tolist
Configuration menu - View commit details
-
Copy full SHA for b45000f - Browse repository at this point
Copy the full SHA b45000fView commit details -
Configuration menu - View commit details
-
Copy full SHA for 50b4c8e - Browse repository at this point
Copy the full SHA 50b4c8eView commit details -
[misc] Accelerate CI for zero and dist optim (hpcaitech#5758)
* remove fp16 from lamb * remove d2h copy in checking states --------- Co-authored-by: Edenzzzz <[email protected]>
Configuration menu - View commit details
-
Copy full SHA for 79f7a7b - Browse repository at this point
Copy the full SHA 79f7a7bView commit details -
[Test/CI] remove test cases to reduce CI duration (hpcaitech#5753)
* [test] smaller gpt2 test case * [test] reduce test cases: tests/test_zero/test_gemini/test_zeroddp_state_dict.py * [test] reduce test cases: tests/test_zero/test_gemini/test_grad_accum.py * [test] reduce test cases tests/test_zero/test_gemini/test_optim.py * Revert "[test] smaller gpt2 test case" Some tests might depend on the size of model (num of chunks) This reverts commit df705a5. * [test] reduce test cases: tests/test_checkpoint_io/test_gemini_checkpoint_io.py * [CI] smaller test model for two mwo the two modifid cases * [CI] hardcode gpt model for tests/test_zero/test_gemini/test_search.py since we need a fixed answer there
Configuration menu - View commit details
-
Copy full SHA for 80c3c87 - Browse repository at this point
Copy the full SHA 80c3c87View commit details -
[hotfix] fix testcase in test_fx/test_tracer (hpcaitech#5779)
* [fix] branch for fix testcase; * [fix] fix test_analyzer & test_auto_parallel; * [fix] remove local change about moe; * [fix] rm local change moe; * [fix] fix test_deepfm_model & test_dlrf_model; * [fix] fix test_hf_albert & test_hf_gpt;
Configuration menu - View commit details
-
Copy full SHA for 10a19e2 - Browse repository at this point
Copy the full SHA 10a19e2View commit details -
[gemini] optimize reduce scatter d2h copy (hpcaitech#5760)
* [gemini] optimize reduce scatter d2h copy * [fix] fix missing reduce variable * [refactor] remove legacy async reduce scatter code * [gemini] missing sync * Revert "[refactor] remove legacy async reduce scatter code" This reverts commit 58ad76d. * [gemini] further optimize with async all reduce * [fix] pass flag from manager to chunk
Configuration menu - View commit details
-
Copy full SHA for 3f7e313 - Browse repository at this point
Copy the full SHA 3f7e313View commit details -
Allow building cuda extension without a device. (hpcaitech#5535)
Added FORCE_CUDA environment variable support, to enable building extensions where a GPU device is not present but cuda libraries are.
Configuration menu - View commit details
-
Copy full SHA for c46e097 - Browse repository at this point
Copy the full SHA c46e097View commit details -
Configuration menu - View commit details
-
Copy full SHA for b9d646f - Browse repository at this point
Copy the full SHA b9d646fView commit details
Commits on Jun 6, 2024
-
[install]fix setup (hpcaitech#5786)
* fix * [pre-commit.ci] auto fixes from pre-commit.com hooks for more information, see https://pre-commit.ci --------- Co-authored-by: pre-commit-ci[bot] <66853113+pre-commit-ci[bot]@users.noreply.github.com>
Configuration menu - View commit details
-
Copy full SHA for a1e39f4 - Browse repository at this point
Copy the full SHA a1e39f4View commit details -
Configuration menu - View commit details
-
Copy full SHA for 5ead00f - Browse repository at this point
Copy the full SHA 5ead00fView commit details -
Configuration menu - View commit details
-
Copy full SHA for 73e88a5 - Browse repository at this point
Copy the full SHA 73e88a5View commit details
Commits on Jun 7, 2024
-
Configuration menu - View commit details
-
Copy full SHA for 7a7e869 - Browse repository at this point
Copy the full SHA 7a7e869View commit details -
Configuration menu - View commit details
-
Copy full SHA for 929e1e3 - Browse repository at this point
Copy the full SHA 929e1e3View commit details -
Configuration menu - View commit details
-
Copy full SHA for 7e65b71 - Browse repository at this point
Copy the full SHA 7e65b71View commit details -
moupdate ci tests, st ci test cases passed, tp failed in generation f…
…or ppo, sp is buggy
Configuration menu - View commit details
-
Copy full SHA for 0b4a335 - Browse repository at this point
Copy the full SHA 0b4a335View commit details -
Configuration menu - View commit details
-
Copy full SHA for 7ae87b3 - Browse repository at this point
Copy the full SHA 7ae87b3View commit details -
Configuration menu - View commit details
-
Copy full SHA for b1031f7 - Browse repository at this point
Copy the full SHA b1031f7View commit details -
[pre-commit.ci] auto fixes from pre-commit.com hooks
for more information, see https://pre-commit.ci
Configuration menu - View commit details
-
Copy full SHA for 1b880ce - Browse repository at this point
Copy the full SHA 1b880ceView commit details -
Configuration menu - View commit details
-
Copy full SHA for b8b5cac - Browse repository at this point
Copy the full SHA b8b5cacView commit details -
Configuration menu - View commit details
-
Copy full SHA for 62eb28b - Browse repository at this point
Copy the full SHA 62eb28bView commit details -
Configuration menu - View commit details
-
Copy full SHA for 0bbac15 - Browse repository at this point
Copy the full SHA 0bbac15View commit details -
Configuration menu - View commit details
-
Copy full SHA for bf57b13 - Browse repository at this point
Copy the full SHA bf57b13View commit details -
Configuration menu - View commit details
-
Copy full SHA for 45195ac - Browse repository at this point
Copy the full SHA 45195acView commit details -
Configuration menu - View commit details
-
Copy full SHA for e16ccc2 - Browse repository at this point
Copy the full SHA e16ccc2View commit details -
Configuration menu - View commit details
-
Copy full SHA for ac1520c - Browse repository at this point
Copy the full SHA ac1520cView commit details -
Configuration menu - View commit details
-
Copy full SHA for 790e136 - Browse repository at this point
Copy the full SHA 790e136View commit details -
Refactor modeling by adding attention backend
Signed-off-by: char-1ee <[email protected]>
Configuration menu - View commit details
-
Copy full SHA for 04386d9 - Browse repository at this point
Copy the full SHA 04386d9View commit details -
Configuration menu - View commit details
-
Copy full SHA for eec77e5 - Browse repository at this point
Copy the full SHA eec77e5View commit details -
Pass inference model shard configs for module init
Signed-off-by: char-1ee <[email protected]>
Configuration menu - View commit details
-
Copy full SHA for 5f398fc - Browse repository at this point
Copy the full SHA 5f398fcView commit details -
Configuration menu - View commit details
-
Copy full SHA for ceba662 - Browse repository at this point
Copy the full SHA ceba662View commit details -
Configuration menu - View commit details
-
Copy full SHA for 0d7ff10 - Browse repository at this point
Copy the full SHA 0d7ff10View commit details -
Configuration menu - View commit details
-
Copy full SHA for 77db216 - Browse repository at this point
Copy the full SHA 77db216View commit details -
Remove flash attention backend
Signed-off-by: char-1ee <[email protected]>
Configuration menu - View commit details
-
Copy full SHA for f5981e8 - Browse repository at this point
Copy the full SHA f5981e8View commit details
Commits on Jun 10, 2024
-
Configuration menu - View commit details
-
Copy full SHA for 2abdede - Browse repository at this point
Copy the full SHA 2abdedeView commit details -
Configuration menu - View commit details
-
Copy full SHA for b303976 - Browse repository at this point
Copy the full SHA b303976View commit details -
Merge pull request hpcaitech#5771 from char-1ee/refactor/modeling
[Inference] Refactor modeling attention layer by abstracting attention backends
Configuration menu - View commit details
-
Copy full SHA for 77a219a - Browse repository at this point
Copy the full SHA 77a219aView commit details
Commits on Jun 11, 2024
-
Configuration menu - View commit details
-
Copy full SHA for 84eab13 - Browse repository at this point
Copy the full SHA 84eab13View commit details -
[Inference]refactor baichuan (hpcaitech#5791)
* refactor baichuan * remove unused code and add TODO for lazyinit
Configuration menu - View commit details
-
Copy full SHA for c0948af - Browse repository at this point
Copy the full SHA c0948afView commit details -
Merge pull request hpcaitech#5759 from hpcaitech/colossalchat_upgrade
[ColossalChat] Colossalchat upgrade
Configuration menu - View commit details
-
Copy full SHA for 74f4a29 - Browse repository at this point
Copy the full SHA 74f4a29View commit details -
Configuration menu - View commit details
-
Copy full SHA for 587bbf4 - Browse repository at this point
Copy the full SHA 587bbf4View commit details -
Configuration menu - View commit details
-
Copy full SHA for aa125bc - Browse repository at this point
Copy the full SHA aa125bcView commit details
Commits on Jun 12, 2024
-
Configuration menu - View commit details
-
Copy full SHA for aac941e - Browse repository at this point
Copy the full SHA aac941eView commit details -
Configuration menu - View commit details
-
Copy full SHA for b6ea9e7 - Browse repository at this point
Copy the full SHA b6ea9e7View commit details -
Configuration menu - View commit details
-
Copy full SHA for 79d63ec - Browse repository at this point
Copy the full SHA 79d63ecView commit details -
[Inference] Fix flash-attn import and add model test (hpcaitech#5794)
* Fix torch int32 dtype Signed-off-by: char-1ee <[email protected]> * Fix flash-attn import Signed-off-by: char-1ee <[email protected]> * Add generalized model test Signed-off-by: char-1ee <[email protected]> * Remove exposed path to model Signed-off-by: char-1ee <[email protected]> * Add default value for use_flash_attn Signed-off-by: char-1ee <[email protected]> * Rename model test Signed-off-by: char-1ee <[email protected]> --------- Signed-off-by: char-1ee <[email protected]>
Configuration menu - View commit details
-
Copy full SHA for 8554585 - Browse repository at this point
Copy the full SHA 8554585View commit details -
Configuration menu - View commit details
-
Copy full SHA for ec99700 - Browse repository at this point
Copy the full SHA ec99700View commit details -
[Gemini] Use async stream to prefetch and h2d data moving (hpcaitech#…
…5781) * use async stream to prefetch and h2d data moving * Remove redundant code
Configuration menu - View commit details
-
Copy full SHA for d9dddf5 - Browse repository at this point
Copy the full SHA d9dddf5View commit details
Commits on Jun 13, 2024
-
[gemini] quick fix on possible async operation (hpcaitech#5803)
* [gemini] quick fix on possible async operation * [gemini] quick fix on possible async operation
Configuration menu - View commit details
-
Copy full SHA for 3bcbba9 - Browse repository at this point
Copy the full SHA 3bcbba9View commit details
Commits on Jun 14, 2024
-
[shardformer] upgrade transformers to 4.39.3 (hpcaitech#5815)
* [shardformer]upgrade transformers for gpt2/gptj/whisper (hpcaitech#5807) * [shardformer] fix modeling of gpt2 and gptj * [shardformer] fix whisper modeling * [misc] update requirements --------- Co-authored-by: ver217 <[email protected]> * [shardformer]upgrade transformers for mistral (hpcaitech#5808) * upgrade transformers for mistral * fix * fix * [shardformer]upgrade transformers for llama (hpcaitech#5809) * update transformers fix * fix * fix * [inference] upgrade transformers (hpcaitech#5810) * update transformers fix * fix * fix * fix * fix * [gemini] update transformers for gemini (hpcaitech#5814) --------- Co-authored-by: ver217 <[email protected]>
Configuration menu - View commit details
-
Copy full SHA for 2ddf624 - Browse repository at this point
Copy the full SHA 2ddf624View commit details -
Configuration menu - View commit details
-
Copy full SHA for be92747 - Browse repository at this point
Copy the full SHA be92747View commit details -
Merge branches 'feature/moe' and 'feature/moe' of https://github.com/…
…Hz188/ColossalAI into feature/moe
Configuration menu - View commit details
-
Copy full SHA for 76aeec3 - Browse repository at this point
Copy the full SHA 76aeec3View commit details -
update moe hybrid parallel plugin with newest version of zero & fix z…
…ero working/master params bug
Configuration menu - View commit details
-
Copy full SHA for 64fc0f7 - Browse repository at this point
Copy the full SHA 64fc0f7View commit details -
Configuration menu - View commit details
-
Copy full SHA for 8b277cc - Browse repository at this point
Copy the full SHA 8b277ccView commit details -
Configuration menu - View commit details
-
Copy full SHA for ed42193 - Browse repository at this point
Copy the full SHA ed42193View commit details -
Configuration menu - View commit details
-
Copy full SHA for 88b78fa - Browse repository at this point
Copy the full SHA 88b78faView commit details
Commits on Jun 17, 2024
-
Configuration menu - View commit details
-
Copy full SHA for 419d25e - Browse repository at this point
Copy the full SHA 419d25eView commit details -
Configuration menu - View commit details
-
Copy full SHA for 3364ac9 - Browse repository at this point
Copy the full SHA 3364ac9View commit details -
Configuration menu - View commit details
-
Copy full SHA for f7298bc - Browse repository at this point
Copy the full SHA f7298bcView commit details -
Configuration menu - View commit details
-
Copy full SHA for e6839fb - Browse repository at this point
Copy the full SHA e6839fbView commit details -
Configuration menu - View commit details
-
Copy full SHA for cc9d0bb - Browse repository at this point
Copy the full SHA cc9d0bbView commit details -
Support 4d parallel + flash attention (hpcaitech#5789)
* support tp + sp + pp * remove comments --------- Co-authored-by: Edenzzzz <[email protected]>
Configuration menu - View commit details
-
Copy full SHA for 8795bb2 - Browse repository at this point
Copy the full SHA 8795bb2View commit details
Commits on Jun 18, 2024
-
Configuration menu - View commit details
-
Copy full SHA for 1405cf1 - Browse repository at this point
Copy the full SHA 1405cf1View commit details