Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

GOC / GOA Joint Pipeline #93

Open
kltm opened this issue Jul 23, 2024 · 3 comments
Open

GOC / GOA Joint Pipeline #93

kltm opened this issue Jul 23, 2024 · 3 comments
Assignees
Labels

Comments

@kltm
Copy link
Member

kltm commented Jul 23, 2024

Project link

https://github.com/orgs/geneontology/projects/159

Project description

As part of the discussion from the UNC GO meeting, we decided to go ahead with a joint GO/GOA pipeline. The intended division of labor would be:

GO Central produces:

  • Ontology
  • User, groups, dbxrefs, and other metadata
  • Curation tool resources
    • PAINT annotations
    • Noctua-derived annotations (standard and GO-CAM)
    • "Automated upstreams" (MGI)
  • Derived products to drive GO Central interfaces and downloads
    • Solr index, Blazegraph, GO API and required files, etc.

GO Central consumes from GOA:

  • GAFs and GPAD/GPI(?) files from GOA
  • GOA annotation error reports

GOA produces:

  • GAFs and GPAD/GPI(?) files from GOA
  • GOA annotation error reports

GOA consumes from GOC:

  • Ontology
  • Metadata
  • Noctua GPADs
  • PAINT
  • "Automated upstreams" (MGI)

This joint pipeline would solve several ongoing issues that GO faces wrt the pipeline

  • speed and stability of releases
  • detailed error reporting
  • ontology synchronization
  • consistence of data products for the GO community
  • infrastructure expense (leveraging EBI compute)
PI

Chris

Product owner (PO)

Pascale

Technical lead (TL)

Seth/Alex

Other personnel (OP)

Dustin (at need)

Technical specs

Rolling technical discussions w/Alex and Pascale:
https://docs.google.com/document/d/1Jxl2WaOCpxmiuPrTZZB3LFs4lYzXQtjDxqAjvuidxRE/edit
Explanatory slides: (IN PROGRESS)

Other comments

N/A

@kltm kltm added Needs LA approval Needs final approval from the Lead Architect Needs PM approval Needs final approval from the Project Manager Needs tech doc Needs PI Needs PO Needs TL labels Jul 23, 2024
@kltm kltm added Ready and removed Needs LA approval Needs final approval from the Lead Architect Needs PM approval Needs final approval from the Project Manager Needs tech doc Needs PI Needs PO Needs TL labels Jul 23, 2024
@kltm
Copy link
Member Author

kltm commented Jul 23, 2024

Tagging @pgaudet, noting that this not "officially" exists.

@kltm kltm changed the title GO/GOA Joint Pipeline GOC/GOA Joint Pipeline Jul 23, 2024
@kltm kltm changed the title GOC/GOA Joint Pipeline GOC / GOA Joint Pipeline Jul 23, 2024
@kltm
Copy link
Member Author

kltm commented Jul 23, 2024

Items to be added to project at need.

We still need to work out a lot of technical details (e.g. file granularity, downloads, timing and freezes wrt ontology and curation tools for release processing, etc.). Early days.

@kltm
Copy link
Member Author

kltm commented Oct 3, 2024

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
Projects
Status: Active
Development

No branches or pull requests

2 participants