Skip to content

Latest commit

 

History

History
34 lines (28 loc) · 2.85 KB

README.md

File metadata and controls

34 lines (28 loc) · 2.85 KB

Bystro Paper Data

This repo contains data pertaining to Bystro's publication

The preprint may be found @ https://www.biorxiv.org/content/early/2017/08/09/146514

The Bystro command line software and database is available @ https://github.com/akotlar/bystro

Bystro is free to use online, for large (terabyte-sized) datasets @ https://bystro.io

Datasets used

  1. 1000G Phase3 Chr1 50K lines (8MB)
  2. 1000G Phase3 Chr1 100K lines (17MB)
  3. 1000G Phase3 Chr1 150K lines (24MB)
  4. 1000G Phase3 Chr1 200K lines (33MB)
  5. 1000G Phase3 Chr1 250K lines (40MB)
  6. 1000G Phase3 Chr1 300K lines (50MB)
  7. 1000G Phase3 Chr1 1M lines (166MB)
  8. 1000G Phase3 Chr1 2M lines (327MB)
  9. 1000G Phase3 Chr1 4M lines (650MB)
  10. 1000G Phase3 Chr1 (1GB)
  11. 1000G Phase3 (14.5GB)
    • Warning: 853GB uncompressed
  12. 1000G Phase1 (129GB)
    • Warning: 890GB uncompressed
  13. Yen et al. 2017 accuracy test data

Query Accuracy

Compares Bystro to Perl scripts in matching various anotation features
  1. Results and scripts used: Bystro_query_accuracy_comparisons
  2. Full results + raw annotations: Bystro_query_accuracy_comparisons.tar.gz Warning: 1.4GB

Bystro/GEMINI de novo query comparison

Identifing denovo variants using Bystro, compared with GEMINI
  1. Bystro_GEMINI_denovo_comparison