Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Bring cog demo up to date with rewritten llava code in upstream #4

Draft
wants to merge 63 commits into
base: main
Choose a base branch
from

Conversation

jd7h
Copy link
Member

@jd7h jd7h commented Apr 2, 2024

Tested with cog predict
Not yet tested with cog train

ChunyuanLI and others added 30 commits November 10, 2023 06:22
- this allows it to e.g. take a ``cache_dir`` argument
…oading

🩹 make ``load_pretrained_model`` accept kwargs
…-name-fix

Revert "use images instead of image to match model.forward kwarg"
Extention -> Extension
T-VQA should be TextVQA to avoid confusion.

(first result for T-VQA is [TVQA](https://arxiv.org/abs/1809.01696) and not TextVQA)
Pass image_size to model.generate() in cli.py
[Fix] Add image_sizes for cli.py for LLaVA-v1.6
@jd7h jd7h marked this pull request as draft April 2, 2024 15:14
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

9 participants