Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Doc2doc search to improve index performance on hard queries. #417

Open
wants to merge 1 commit into
base: main
Choose a base branch
from

Conversation

khaiwang
Copy link

  • add compile command in CMAKELSITS
  • data.ipynb is a demo to show the recall distribution of the query set
  • buildtext2image10m.ini is the build config for text2image ood dataset, see https://github.com/harsha-simhadri/big-ann-benchmarks neurips23 ood track for dataset information
  • Set following args to use doc2doc in a build config file (see buildtext2image10m as an example):
    • spreadSearch is the option for doc2doc, spreadSearch = true will enable doc2doc
    • doc2docRounds can decide the max doc2doc search round
    • doc2docResults is the max number of doc used for doc2doc search in each round

* add compile command in CMAKELSITS
* data.ipynb is a demo to show the recall distribution of the query set
* buildtext2image10m.ini is the build config for text2image dataset, see https://github.com/harsha-simhadri/big-ann-benchmarks neurips23 ood track for dataset information
	* spreadSearch is the option for doc2doc, spreadSearch = true will enable doc2doc
	* doc2docRounds can decide the max doc2doc search round
	* doc2docResults is the max number of doc used for doc2doc search in each round
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

1 participant