Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Include examples of a full ETL pipeline #1

Open
trosel opened this issue Jul 16, 2018 · 1 comment
Open

Include examples of a full ETL pipeline #1

trosel opened this issue Jul 16, 2018 · 1 comment

Comments

@trosel
Copy link

trosel commented Jul 16, 2018

This may be outside the scope of the booklet, but there is a lack of information on how to obtain data from APIs and then format that JSON data into dataframes, and then send that off somewhere.

For example, most APIs for getting data would require some kind of oauth1 or oauth2 passed in the client.

And these APIs would most certainly give you JSON which you would need to serialize into Objects or straight into the Dataframe object.

Because in order to visualize data, we need to get it first :)

And then what do you do with the dataframe or visualization afterwards? Do you export the dataframe into a CSV and email that? Do you convert the visualization into a PDF and email that?

@olekscode
Copy link
Member

olekscode commented Aug 17, 2018

Thank you for the issue and interesting questions. I will try to cover these topics in the booklet.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

2 participants