Skip to content

This is an example of how to use Python and Jupyter Notebook to explore a given dataset.

License

Notifications You must be signed in to change notification settings

MartinSeeler/python-data-exploration

Repository files navigation

Python Data Exploration

This is an example of how to use Python and Jupyter Notebook to explore a given dataset with Pandas and Seaborn.

Requirements

To get started, create a virtual environment and install the requirements in it.

$ python3 -m venv venv
$ pip install pandas seaborn jupyterlab

After that, you can launch your notebook server by running

$ jupyter-lab

Of course you can use your own environment as well. Obviously you need Python. I'd recommend installing Anaconda, which provides you with all the fancy Python data-science libraries.

If you have python installed, make sure to install Jupyter Notebook, as well as the necessary python packages for Pandas and Seaborn via pip.

You can see the steps in Exploring Datasets.ipynb. Github will render the Jupyter Notebooks so you don't have to download it. If you want to try it yourself, feel free to do it inside Exploring Datasets Yourself.ipynb.

Source

Fair, Ray. 1978. “A Theory of Extramarital Affairs,” Journal of Political Economy, February, 45-61.

The data is available at http://fairmodel.econ.yale.edu/rayfair/pdf/2011b.htm

About

This is an example of how to use Python and Jupyter Notebook to explore a given dataset.

Topics

Resources

License

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published