Accompanying blog post
python3 -m venv venv
source venv/Scripts/activate
pip install -r .\requirements.txt
mkdir driver profile_html scraped_data clean_data
Download the Selenium Chrome webdriver from here and store it to the "driver" folder.
- Rename "sample.ini" to "config.ini"
- Update username and password config variables with your LinkedIn profile credentials
cd scraper
python3 fetch_profile_urls.py
python3 download_profile_pages.py
After this step, you should have html profile pages downloaded in the "profile_html" directory.
python3 scrap_profile_data.py
For data cleaning and preparation I used the "data-preparation.ipynb" jupyter notebook.
You can find the Tableau workbook here