Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Update Image Extraction Microservices Description #1378

Open
darshad-github opened this issue Nov 15, 2022 · 1 comment
Open

Update Image Extraction Microservices Description #1378

darshad-github opened this issue Nov 15, 2022 · 1 comment
Assignees
Labels
documentation Improvements or additions to documentation good first issue Good for newcomers

Comments

@darshad-github
Copy link
Contributor

darshad-github commented Nov 15, 2022

SMI Readme - IsIdentifiable and ExtractImages are not displayed in the visualisation.

@darshad-github darshad-github self-assigned this Nov 15, 2022
@rkm
Copy link
Member

rkm commented Nov 30, 2022

Standard anonymisation extraction pipeline

The current extraction services look like:

  • ExtractImages: Reads UIDs from a CSV file. Sends
  • CohortExtractor: Matches DICOM UIDs in the ExtractionRequestMessage to the physical file location(s). Sends
  • CTPAnonymiser: Microservice wrapper for CTP. Anonymises images specified in the input ExtractFileMessage and copies the output to specified directory. Sends
    • ExtractedFileStatusMessage to IsIdentifiable when anonymisation was successful, and to CohortPackager when the anonymisation fails
  • IsIdentifiable: Checks for PII in the file specified by ExtractedFileStatusMessage and sends
  • CohortPackager: Receives ExtractionRequestInfoMessages, ExtractFileCollectionInfoMessages, ExtractedFileStatusMessage, and ExtractedFileVerificationMessages, and stores them in MongoDB. Once it detects all messages for an extraction have been received, it generates a set of extraction reports and marks the job as completed.

Identifiable extraction pipeline

We also have the ability to run an "identifiable" (non anonymised) extraction, which looks similar to above except:

  • FileCopier is used in place of of CTPAnonymiser. This sends
  • IsIdentifiable is not part of the pipeline

Both extractions use RabbitMQ routing keys to direct messages to the appropriate services.

ToDo

Steps to fix this:

  • Update/replace the diagram to include the missing services as above
    • I can't find the original version of the current diagram, so let's create a new one using Draw.io.
  • Update the table with the information above

@rkm rkm added documentation Improvements or additions to documentation good first issue Good for newcomers labels Jan 12, 2023
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
documentation Improvements or additions to documentation good first issue Good for newcomers
Projects
None yet
Development

No branches or pull requests

2 participants