Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

adding connectome sideloading #26

Merged
merged 16 commits into from
Mar 21, 2022
Merged

adding connectome sideloading #26

merged 16 commits into from
Mar 21, 2022

Conversation

Robbie1977
Copy link
Contributor

No description provided.

Copy link
Contributor

@hkir-dev hkir-dev left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

do we have connectome_fafb, connectome_l1em and connectome_hemibrain related sparql queries

@dosumis
Copy link
Member

dosumis commented Jan 25, 2022

do we have connectome_fafb, connectome_l1em and connectome_hemibrain related sparql queries

Nope. Probably better to hack in as direct (robot) merge of relevant OWL files (assuming the are accessible from here) in the goal. If you make these into dependencies, this should be outside of the patsub.

@Robbie1977
Copy link
Contributor Author

Ref #24 (comment)

@Robbie1977
Copy link
Contributor Author

I've linked the owl files previously loaded in at collectdata stage. Test still running ATM.

Copy link
Member

@dosumis dosumis left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

I think your edit will cause the job to fail because it will look for non-existent SPARQL queries to run here:

$(RAW_DUMPS_DIR)/%.ttl:
	curl -G --data-urlencode "query=`cat $(SPARQL_DIR)/construct_$*.sparql`" $(SPARQL_ENDPOINT) -o $@

I assume the connectomics OWL files are no longer loaded into the triplestore. They need to be accessible to this build and the dumps goals need to be edited to merge them.

e.g.

CONNECTOMICS=connectome_fafb.owl connectome_l1em.owl connectome_hemibrain.owl

$(FINAL_DUMPS_DIR)/pdb.owl: $(patsubst %, $(RAW_DUMPS_DIR)/construct_%.owl, $(DUMPS_PDB)) $(RAW_DUMPS_DIR)/inferred_annotation.owl ${CONNECTOMICS}
	$(ROBOT) merge $(patsubst %, -i %, $^) -o $@ $(STDOUT_FILTER)

@hkir-dev
Copy link
Contributor

hkir-dev commented Jan 26, 2022

If current approach doesn't work, I can implement this function both in this branch and generic_pipeline. I can read EXTERNAL_ONTOLOGIES (aka. CONNECTOMICS) from environment variables and directly feed to the robot.

@Robbie1977
Copy link
Contributor Author

Well, we've officially made it slower but as we are now doing the merge twice it's not unexpected but it is running very inefficiently as using only 5% of a single CPU and 138G RAM for the last few days but still seems to be going. Could do with --vvv but I'll give it the weekend to finish.

@dosumis
Copy link
Member

dosumis commented Jan 31, 2022

Well, we've officially made it slower but as we are now doing the merge twice it's not unexpected but it is running very inefficiently as using only 5% of a single CPU and 138G RAM for the last few days but still seems to be going. Could do with --vvv but I'll give it the weekend to finish.

@Robbie1977 - do you have edits outside this PR that pull in the additional files and merge them? Not seeing anything here that would what we planned. This would #26 (review) (see suggested code in second box).

BTW - the merge shouldn't take long. I think it would be best to sprinkle echo statements through the Makefile that print time so that we have a clear idea which steps are slow.

Copy link
Member

@dosumis dosumis left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Might be useful to sprinkle some 'echo date' statements in the build to track how long each stage takes.

@Robbie1977 Robbie1977 merged commit 43ddfcd into vfb_pipeline Mar 21, 2022
@Robbie1977 Robbie1977 deleted the vfb_sideloading branch March 21, 2022 16:32
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

3 participants