Release Notes
Changelog
- c415087 chore: bump version: 0.37.0-rc4 -> 0.37.0
- 736fba6 docs: add release notes for 0.37.0 (#9995)
- 73dee98 docs: fix broken links (#9996)
- ecf8ac7 chore: bump version: 0.37.0-rc3 -> 0.37.0-rc4
- 1b50305 fix: fix default id search for runs (#9988)
- 0990c11 chore: bump version: 0.37.0-rc2 -> 0.37.0-rc3
- a78b190 fix: fix hf on_save raise exception (#9977)
- 0560939 fix: bring in handleEmptyCell from #9963 (#9984)
- 7caf18a chore: bump version: 0.37.0-rc1 -> 0.37.0-rc2
- 08d782a fix: show search progress in run table (#9976)
- 478c78f fix: Cluster page height (#9975)
- 2772a3c fix: correct
dataPath
for hyperparameters (#9971) - 94f2d95 chore: bump version: 0.37.0-rc0 -> 0.37.0-rc1
- 63e7df0 chore: 0.37.0 environment images (#9967)
- b2267d1 chore: bump version: 0.37.0-dev0 -> 0.37.0-rc0
- f758303 chore: lock published urls to preserve redirects
- 2a8e7dd chore: lock api state for backward compatibility check
- 3f54d07 chore: bump version: 0.36.1-dev0 -> 0.37.0-dev0
- baf451f chore: do not log error for resource pools with zero agents (#9960)
- 6a8606e docs: Add hpc installation guide (#9945)
- 3241edb fix: fix flaky generic task pause test (#9962)
- 43556e9 fix: Remove CSS rule for hiding the Form.Item error message (#9872)
- 5906001 perf: improve the initial page load speed (#9939)
- eb1b0de docs: Add workload alerting (#9938)
- cedfcfe chore: refactor and test RBAC config policies work [CM-530] (#9943)
- 2d884b9 docs: Add cluster overview (#9936)
- e17d12c feat: release notes and improvements for workload alerting (#9944)
- 0db2e3b ci: deflake make slurmcluster, hopefully (#9957)
- 95f079d feat: add GET global config policies API (#9952)
- d943d85 chore: fix global PUT for task config policies (#9941)
- 410edf6 fix: broken MNIST download in e2e tests (#9937)
- 004c194 ci: fix flaky test_allocation_csv tests (#9953)
- 88a4c67 feat: add Config Policies GET API and modify CRUD functions to accept both Workload types (#9946)
- a73c8db test: debug auth [TESTENG-95] (#9942)
- 13db674 test: experiment list show archived filter [ET-753] (#9932)
- 02e302f chore: remove unused languages from code editor (#9898)
- f6d874d docs: Replace slack links (#9919)
- 26b0954 chore: implement Delete config policies API handlers (#9927)
- 2d12be1 test: add projects tests [CM-467] (#9928)
- 062cb52 fix: use different modules for Trial and Cluster topology (#9917)
- 0928958 chore: change log level for log retention policies (#9935)
- b559467 chore: bump coverage target (#9920)
- 3a2ea56 fix: do not filter slots for mixed-slot-type pools (#9902)
- a58ed7c chore: reassign RM code to CM in CODEOWNERS (#9926)
- cb3515e fix: update LogRetentionDays from master config when master starts/upgrades (#9930)
- 13b7b3f ci: increase timeout for k8s intg tests (#9929)
- 6f36969 fix: flaky workspace test (#9931)
- 867eb31 fix: update huggingface example (#9925)
- 5b2275f fix: Refactor sorting logic in WorkspaceProjects for filtering projects (#9903)
- fd7f77a fix: move validation dataloader check in PyTorchTrial [MD-515] (#9923)
- db2881f chore: fix config policy unmarshal tests (#9924)
- 3900742 chore: update test log pattern webhook cache (#9922)
- f44687d chore: create config policies table and add NTSC CRUD operations (#9915)
- de89f68 feat: support updating web hook url [MD-482] (#9890)
- 02fbdbb fix: huggingface callback raise process preempted exception (#9913)
- 8c799b8 chore: prune cruft out of no_op fixture (#9912)
- 11de119 chore(deps): bump path-to-regexp and express in /webui/react (#9909)
- 03961b5 test: add workspace tests (#9905)
- c877383 fix: GetTrialRemainingLogRetentionDays should take global log retention days into account [CM-518] (#9914)
- fb0d5f9 fix: change workspace name and set resource quota simultaneously (#9847)
- 8fb9f6b docs: Update ROCM support (#9893)
- 481bddb chore(deps): bump github.com/docker/docker from 24.0.9+incompatible to 25.0.6+incompatible (#9780)
- c1499ac chore: removing model_hub references from Makefile (#9901)
- c961dbd feat: new run object for Run Centric API (#9897)
- bfeb418 feat: Implement custom trigger for webhooks (#9879)
- b6eb05e chore: Remove model hub (#9869)
- 4a28c10 chore: add unmarshal functions for task config policies (#9896)
- d842383 fix: timezone handling error in queued allocation time update (#9892)
- 55b3f9b test: cover project id filtering on bulk actions [ET-138] (#9870)
- 036477b chore: stub new APIs for task config policies [CM-485] (#9880)
- be2622a test: Delete workspace after webhook test (#9891)
- a30bc25 feat: Add rbac for config policies (#9873)
- 8c83d31 chore: create WorkloadType enum and Go config + constraints structs (#9885)
- 0a18c5a fix: add backwards compatibility for Pods to Jobs for k8s <v1.27 [CM-461] (#9878)
- 8e6bba8 ci: fix master-config syntax (#9889)
- d5d647a fix: inconsistent timezone handling in daily allocation aggregation (#9888)
- b4209ef test: login redirect with nested route (#9881)
- 8cacba6 ci: add e2e bulk kill test (#9868)
- 590c362 fix: Hf callback metric naming (#9887)
- 61fd26b fix: reset Model Registry page number on pageload [ET-640] (#9876)
- ce27f81 fix: show
-
for empty data in run table (#9871) - b1c0814 fix: prevent
hyperparameter search modal
submitting the same request multiple times (#9883) - d54713c fix: use new ruamel yaml APIs (#9882)
- ad5fe5a fix: prevent out of bounds navigation on new list views (#9875)
- a605f00 fix: reject reconnecting agents with different resource pool configuration (#9815)
- db92bad feat: Support RBAC in webhook (#9859)
- 0ef81aa fix: sorting by arbitrary metadata (#9874)
- c1b7767 feat: Auto-Populate POSIX Information on sign in using SSO [CM-399] (#9755)
- 54b6165 feat: Logic of different modes for webhook (#9865)
- a773551 fix: allow for objects inside array metadata to be typed properly (#9864)
- ee269c8 test: successful login with weak or strong password (#9858)
- e21fc6f ci: pin chromadb version to avoid incompatibility (#9849)
- a1234a1 chore: bump version: 0.36.0-dev0 -> 0.36.1-dev0
- d79c90d chore: add docs dropdown link for new version
- ce6da74 docs: add release notes for 0.36.0 (#9854)
- a55af74 fix: use task sessions in Core API [MD-509] (#9860)
- 3ee88bb fix: replace tree with code mirror for metadata view (#9853)
- 8dd46d5 chore: Improve CompareTrials perfomance (#9807)
- 6e08303 fix: fix error toast popping up in Workpace Creator view (#9855)
- fb95df8 chore: add backport github action (#9835)
- a37e6e7 fix: prevent loading issues with ipynb files (#9850)
- 9de4f72 feat: configurable preemption timeout [MD-500] (#9833)
- 640126b feat: Add workspaceId, mode, name to webhook (#9820)
- d436c23 fix: reset pinned column state when resetting columns (#9852)
- 3a91552 fix: fix fallback logic for partially provided custom logos (#9842)
- 707ad07 Revert "chore: add tracing info to some backend APIs" (#9843)
- 73a756a fix: update broken tensorflow & certbot links (#9846)
- 771bbe4 ci: sequential metric count sweep test [Scale-35] (#9791)
- 32fafdd perf: remove duplicate ids in
ExpMetricNames
api (#9848) - a8fa015 docs: Fix broken links (#9845)
- 2b1856a fix: model version name overflow on mobile [ET-384] (#9827)
- e13de20 docs: Document rbac editorprojectrestricted role (#9844)
- 2838af4 chore: add tracing info to some backend APIs (#9841)
- e3dfb0a fix: change filter form to say "Show runs" in flat runs view [ET-740] (#9840)
- 52f2b9f chore: add release notes for PR 9822 (#9837)
- a37d482 fix: experiment single trial tabs don't scroll on load (#9831)
- aff486c feat: Rocm bumpenvs (#9830)
- 13622ad feat: Add
report_progress
toTrainContext
(#9826) - d831461 fix: replace rawsource attribute with node directly, due to removal of rawsource in Docutil 2.0 (#9838)
- 7ed9e83 feat: add EOL notice regarding Aurora V1 & Postgres 12 along with Master Log warnings for Postgres <=12 [CM-413] [CM-416] (#9832)
- 5c5f107 docs: Minor docs enhancements (#9836)