FreeGenes Auto-annotation scripts

Steps:

Fill Postgresql database with translation data [Note: this can also be subsituted with a list of IDs and translations. Modify json_db.py accordingly]
Download xml-to-json-fast. This program is necessary for conversion of XML files to JSON files.
Run get_uniprot.sh (this should take a while). This will download and covert full dumps of uniprot to JSON.
Run json_db.py (this should take a while). This script will iterate through the uniprot data dumps and save potentially relevant annotations.
Run update_db.py (this should take a little while). This script will choose the proper annotations from the relevant annotations, and if there is nothing there, apply the annotations to the postgres database. This can also be replaced with scripts that convert to different data types, like CSV files. The important part is the sorting of potentially relevant annotations.

An example postgres dump is provided for schema.

Data out

Run gene_table_generator.sql on the postgres database (as provided in toolkit-2020-09-18.sql) to get a table with names, descriptions, and references.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

FreeGenes Auto-annotation scripts

Data out

About

Releases

Packages

Languages

Name		Name	Last commit message	Last commit date
Latest commit History 2 Commits
README.md		README.md
gene_table_generator.sql		gene_table_generator.sql
get_uniprot.sh		get_uniprot.sh
json_db.py		json_db.py
toolkit-2020-09-18.sql		toolkit-2020-09-18.sql
update_db.py		update_db.py

biobricks/freegenes_autoannotations

Folders and files

Latest commit

History

Repository files navigation

FreeGenes Auto-annotation scripts

Data out

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages