1.2.5 - fix scheduler bug
What's Changed
Fix bug that was causing requests to get matched to the wrong model instance.
- fix(scheduler): reinstate model filter temporarily to ensure running models only get what they ask for by @philwinder in #533
- Make context length smaller on 70b model so it fits in 48G by @lukemarsden in #534
Full Changelog: 1.2.4...1.2.5