Skip to content

Metadata extraction

Kevina-Zeni edited this page May 13, 2022 · 5 revisions

We host research manuscripts on currently six (6) partner repositories, namely Open Science Framework (OSF), Zenodo, ScienceOpen, PubPub, Qeios, Figshare. Here is how we extract the data:

Extract from OSF

AfricArXiv submissions on OSF: osf.io/preprints/africarxiv/

Extract from Zenodo

AfricArXiv submissions on Zenodo: zenodo.org/communities/africarxiv/

This workflow extracts metadata from our AfricArXiv.org Zenodo community: https://zenodo.org/communities/africarxiv/ for the joint project with Masakhane.io called Decolonise Science: https://www.masakhane.io/ongoing-projects/masakhane-mt-decolonise-science.

Workflow

  • navigate to the Google Colab notebook and run the three code blocks (hover over and hit the 'play' button)
  • click the files tab (folder icon on the left-hand side)
  • download the 'decolsci_zenodo-extract_...' CSV file by clicking the 'three-dot' icon on the right and selecting 'Download'

Previous workflow

  • download the decolsci_zenodo-stats.sh shell script to a linux machine or virtual environment
  • run the script: bash zenodo-community-stats.sh
  • upload the generated .csv file to the GitHub repository

Links of interest

Extract from ScienceOpen

AfricArXiv submissions on ScienceOpen: https://www.scienceopen.com/collection/africarxiv