Skip to content

katzdaniel/Scoutsman

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

12 Commits
 
 
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

Scoutsmen

A web scraping project.

Aim

The aim of this project was to scrape data that is essential during the COVID pandemic. To accomplish this we targeted the prices of products essential during COVID (Masks, Sanitizer, etc) and culture jobs that are being lost. We then wanted to provide the data and an interface to access it.

Tech

These are the main tools we used during this project:

  • Requests: Python library for http requests.
  • Lxml: Python library for parsing html.
  • Selenium Webdriver: Python library for scripting a web browser. It is useful for bypassing anti-botting measures.
  • AWS Lambda and S3: Cloud services that we used for running scrapers and storing data. S3 was also used for hosting the static websites.
Links

Mask Price Website

Job Website

Job Data Set

Credits

Creating during CodeLabs 2020

By: Nicholas Contreras, Daniel Katz, and Joseph Ngo

Mentor: Quentin Geddes

Logo

About

A COVID product web scraping project.

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published