Skip to content

Extract structured datasets and buildout the Stanford Open Data Portal

License

Notifications You must be signed in to change notification settings

TheStanfordDaily/open-data-pipeline

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

21 Commits
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

OpenDataPipeline

Extract structured datasets for the Stanford Open Data Project

Use Python 3.

pip3 install awscli boto3
aws configure
# enter access key id and secret access key
pip3 install --upgrade google-api-python-client google-auth-httplib2 google-auth-oauthlib
python3 gspreadsheet.py
# This will query the google sheet API, bring in the file as csv, convert to JSON, and upload to AWS

gspreadsheet.py pulls metada from the google sheet as a csv and converts to JSON. To upload to an S3 bucket in AWS requires configuring S3 with boto3.

About

Extract structured datasets and buildout the Stanford Open Data Portal

Resources

License

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages