Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

PAB超参数如何确定 #151

Open
nebuladream opened this issue Jul 29, 2024 · 2 comments
Open

PAB超参数如何确定 #151

nebuladream opened this issue Jul 29, 2024 · 2 comments

Comments

@nebuladream
Copy link

对于PAB的threshold、gap应该如何确定合适的超参数,需要对比不同step的att数值变化么,有什么经验?以及如果使用full attention是否仍然适用?

@nebuladream
Copy link
Author

另外这些模型的threshold都是几百,timestep只有训练的时候才会生效吧

@oahzxl
Copy link
Collaborator

oahzxl commented Jul 29, 2024

  1. the timestep in both training and inference are from 1000 to 0. so the default value is workable for inference.
  2. to determine the hyper param, we will quantilize the difference of attention outputs for adjacent diffusion timesteps. Based on the difference, we can then finetune the threshold and gap by visualizing the results.
  3. suitable for any attention

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

2 participants