Skip to content

Web Scraper of Hong Kong Air Quality AQHI and Pollutant data

Notifications You must be signed in to change notification settings

sammyfung/hkaqhi

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

34 Commits
 
 
 
 
 
 
 
 
 
 

Repository files navigation

hkaqhi codecov

hkaqhi

hkaqhi scrapes the past 24 hours of Air Quality and pollutant concentration data from Hong Kong Air Quality Health Index (AQHI) website by Environment Protection Department (EPD).

Installation

Cloning the project and setup a python virtual enviornment to install required package.

$ git clone [email protected]:sammyfung/hkaqhi.git
$ cd hkaqhi
$ python3 -m venv venv
$ source venv/bin/activate  
$ pip install -r requirements.txt   

Quickstart

To crawl past 24 hours of AQHI and append to an output file in json format:

$ scrapy crawl aqhi24 -o aqhi24.json  

To crawl past 24 hours of detailed pollutant concentration data and append to an output file in json format:

$ scrapy crawl pollutant24 -o pollutant24.json

About

Web Scraper of Hong Kong Air Quality AQHI and Pollutant data

Topics

Resources

Stars

Watchers

Forks

Releases

No releases published

Sponsor this project

 

Packages

No packages published

Languages