-
Notifications
You must be signed in to change notification settings - Fork 608
[llava] Use huggingface LLaVA instead of depending on third-party/LLaVa #4687
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Conversation
Currently we depend on third-party/LLaVA for llava model definition. This is hard to use because we have to pull LLaVA in as a git submodule and install from there. It also breaks a lot of dependency assumptions. This PR removes `third-party/LLaVA`, in favor of huggingface llava model definition. [ghstack-poisoned]
🔗 Helpful Links🧪 See artifacts and rendered test results at hud.pytorch.org/pr/pytorch/executorch/4687
Note: Links to docs will display an error until the docs builds have been completed. ✅ No FailuresAs of commit e6d1553 with merge base 2117c1a ( This comment was automatically generated by Dr. CI and updates every 15 minutes. |
Currently we depend on third-party/LLaVA for llava model definition. This is hard to use because we have to pull LLaVA in as a git submodule and install from there. It also breaks a lot of dependency assumptions. This PR removes `third-party/LLaVA`, in favor of huggingface llava model definition. ghstack-source-id: 08a04f7 Pull Request resolved: #4687
…d-party/LLaVa" Currently we depend on third-party/LLaVA for llava model definition. This is hard to use because we have to pull LLaVA in as a git submodule and install from there. It also breaks a lot of dependency assumptions. This PR removes `third-party/LLaVA`, in favor of huggingface llava model definition. [ghstack-poisoned]
Currently we depend on third-party/LLaVA for llava model definition. This is hard to use because we have to pull LLaVA in as a git submodule and install from there. It also breaks a lot of dependency assumptions. This PR removes `third-party/LLaVA`, in favor of huggingface llava model definition. ghstack-source-id: fafdedd Pull Request resolved: #4687
…d-party/LLaVa" Currently we depend on third-party/LLaVA for llava model definition. This is hard to use because we have to pull LLaVA in as a git submodule and install from there. It also breaks a lot of dependency assumptions. This PR removes `third-party/LLaVA`, in favor of huggingface llava model definition. [ghstack-poisoned]
Currently we depend on third-party/LLaVA for llava model definition. This is hard to use because we have to pull LLaVA in as a git submodule and install from there. It also breaks a lot of dependency assumptions. This PR removes `third-party/LLaVA`, in favor of huggingface llava model definition. ghstack-source-id: 5fa9c37 Pull Request resolved: #4687
…d-party/LLaVa" Currently we depend on third-party/LLaVA for llava model definition. This is hard to use because we have to pull LLaVA in as a git submodule and install from there. It also breaks a lot of dependency assumptions. This PR removes `third-party/LLaVA`, in favor of huggingface llava model definition. [ghstack-poisoned]
Currently we depend on third-party/LLaVA for llava model definition. This is hard to use because we have to pull LLaVA in as a git submodule and install from there. It also breaks a lot of dependency assumptions. This PR removes `third-party/LLaVA`, in favor of huggingface llava model definition. ghstack-source-id: 3c11fa5 Pull Request resolved: #4687
@larryliu0820 has imported this pull request. If you are a Meta employee, you can view this diff on Phabricator. |
…d-party/LLaVa" Currently we depend on third-party/LLaVA for llava model definition. This is hard to use because we have to pull LLaVA in as a git submodule and install from there. It also breaks a lot of dependency assumptions. This PR removes `third-party/LLaVA`, in favor of huggingface llava model definition. Differential Revision: [D61200610](https://our.internmc.facebook.com/intern/diff/D61200610) [ghstack-poisoned]
Currently we depend on third-party/LLaVA for llava model definition. This is hard to use because we have to pull LLaVA in as a git submodule and install from there. It also breaks a lot of dependency assumptions. This PR removes `third-party/LLaVA`, in favor of huggingface llava model definition. ghstack-source-id: 93b881f Pull Request resolved: #4687
@larryliu0820 has imported this pull request. If you are a Meta employee, you can view this diff on Phabricator. |
…d-party/LLaVa" Currently we depend on third-party/LLaVA for llava model definition. This is hard to use because we have to pull LLaVA in as a git submodule and install from there. It also breaks a lot of dependency assumptions. This PR removes `third-party/LLaVA`, in favor of huggingface llava model definition. Differential Revision: [D61200610](https://our.internmc.facebook.com/intern/diff/D61200610) [ghstack-poisoned]
Currently we depend on third-party/LLaVA for llava model definition. This is hard to use because we have to pull LLaVA in as a git submodule and install from there. It also breaks a lot of dependency assumptions. This PR removes `third-party/LLaVA`, in favor of huggingface llava model definition. ghstack-source-id: 4714a93 Pull Request resolved: #4687
# Newer transformer (4.38) will give TypeError: LlavaLlamaForCausalLM.forward() got an unexpected keyword argument 'cache_position' | ||
pip install timm==0.6.13 | ||
pip install transformers==4.37.2 | ||
pip install transformers |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Should we also run
bash ./install_requirements
and
bash examples/models/llama2/install_requirements.sh
as well
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
I think user should call it outside
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
I see, the ./install_requirements makes sense, but at least for the llama2, I don't think its immediately obvious that you have to install the requirements for llama2 in order for llava to work.
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
@larryliu0820 great to me we are moving to the HF model. I think we should have no blocker to clean up all the ad-hoc setup requirements #4320?
@@ -204,28 +204,14 @@ jobs: | |||
conda activate "${CONDA_ENV}" | |||
|
|||
PYTHON_EXECUTABLE=python bash .ci/scripts/setup-linux.sh "cmake" | |||
|
|||
# install pybind | |||
bash install_requirements.sh --pybind xnnpack | |||
|
|||
# install Llava requirements | |||
bash examples/models/llama2/install_requirements.sh | |||
bash examples/models/llava/install_requirements.sh | |||
|
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
why don't we run export_llava.py anymore? looks like test_llava still requires the model to have already been exported?
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
The last test in test_llava.py exports and tests llava
…d-party/LLaVa" Currently we depend on third-party/LLaVA for llava model definition. This is hard to use because we have to pull LLaVA in as a git submodule and install from there. It also breaks a lot of dependency assumptions. This PR removes `third-party/LLaVA`, in favor of huggingface llava model definition. Differential Revision: [D61200610](https://our.internmc.facebook.com/intern/diff/D61200610) [ghstack-poisoned]
Currently we depend on third-party/LLaVA for llava model definition. This is hard to use because we have to pull LLaVA in as a git submodule and install from there. It also breaks a lot of dependency assumptions. This PR removes `third-party/LLaVA`, in favor of huggingface llava model definition. ghstack-source-id: a3e4c25 Pull Request resolved: #4687
…d-party/LLaVa" Currently we depend on third-party/LLaVA for llava model definition. This is hard to use because we have to pull LLaVA in as a git submodule and install from there. It also breaks a lot of dependency assumptions. This PR removes `third-party/LLaVA`, in favor of huggingface llava model definition. Differential Revision: [D61200610](https://our.internmc.facebook.com/intern/diff/D61200610) [ghstack-poisoned]
Currently we depend on third-party/LLaVA for llava model definition. This is hard to use because we have to pull LLaVA in as a git submodule and install from there. It also breaks a lot of dependency assumptions. This PR removes `third-party/LLaVA`, in favor of huggingface llava model definition. ghstack-source-id: 432b90e Pull Request resolved: #4687
…d-party/LLaVa" Currently we depend on third-party/LLaVA for llava model definition. This is hard to use because we have to pull LLaVA in as a git submodule and install from there. It also breaks a lot of dependency assumptions. This PR removes `third-party/LLaVA`, in favor of huggingface llava model definition. Differential Revision: [D61200610](https://our.internmc.facebook.com/intern/diff/D61200610) [ghstack-poisoned]
Currently we depend on third-party/LLaVA for llava model definition. This is hard to use because we have to pull LLaVA in as a git submodule and install from there. It also breaks a lot of dependency assumptions. This PR removes `third-party/LLaVA`, in favor of huggingface llava model definition. ghstack-source-id: 61ec08e Pull Request resolved: #4687
…d-party/LLaVa" Currently we depend on third-party/LLaVA for llava model definition. This is hard to use because we have to pull LLaVA in as a git submodule and install from there. It also breaks a lot of dependency assumptions. This PR removes `third-party/LLaVA`, in favor of huggingface llava model definition. Differential Revision: [D61200610](https://our.internmc.facebook.com/intern/diff/D61200610) [ghstack-poisoned]
Currently we depend on third-party/LLaVA for llava model definition. This is hard to use because we have to pull LLaVA in as a git submodule and install from there. It also breaks a lot of dependency assumptions. This PR removes `third-party/LLaVA`, in favor of huggingface llava model definition. ghstack-source-id: 02cc0b1 Pull Request resolved: #4687
@larryliu0820 has imported this pull request. If you are a Meta employee, you can view this diff on Phabricator. |
…d-party/LLaVa" Currently we depend on third-party/LLaVA for llava model definition. This is hard to use because we have to pull LLaVA in as a git submodule and install from there. It also breaks a lot of dependency assumptions. This PR removes `third-party/LLaVA`, in favor of huggingface llava model definition. Differential Revision: [D61200610](https://our.internmc.facebook.com/intern/diff/D61200610) [ghstack-poisoned]
Currently we depend on third-party/LLaVA for llava model definition. This is hard to use because we have to pull LLaVA in as a git submodule and install from there. It also breaks a lot of dependency assumptions. This PR removes `third-party/LLaVA`, in favor of huggingface llava model definition. ghstack-source-id: 9fb3fff Pull Request resolved: #4687
@larryliu0820 has imported this pull request. If you are a Meta employee, you can view this diff on Phabricator. |
…d-party/LLaVa" Currently we depend on third-party/LLaVA for llava model definition. This is hard to use because we have to pull LLaVA in as a git submodule and install from there. It also breaks a lot of dependency assumptions. This PR removes `third-party/LLaVA`, in favor of huggingface llava model definition. Differential Revision: [D61200610](https://our.internmc.facebook.com/intern/diff/D61200610) [ghstack-poisoned]
Currently we depend on third-party/LLaVA for llava model definition. This is hard to use because we have to pull LLaVA in as a git submodule and install from there. It also breaks a lot of dependency assumptions. This PR removes `third-party/LLaVA`, in favor of huggingface llava model definition. ghstack-source-id: 0f3f7a1 Pull Request resolved: #4687
@larryliu0820 has imported this pull request. If you are a Meta employee, you can view this diff on Phabricator. |
Differential Revision: D61200610 Pull Request resolved: pytorch#4687
Stack from ghstack (oldest at bottom):
Currently we depend on third-party/LLaVA for llava model definition.
This is hard to use because we have to pull LLaVA in as a git submodule
and install from there. It also breaks a lot of dependency assumptions.
This PR removes
third-party/LLaVA
, in favor of huggingface llava modeldefinition.
Differential Revision: D61200610