Skip to content

Latest commit

 

History

History
executable file
·
30 lines (24 loc) · 1.07 KB

README.md

File metadata and controls

executable file
·
30 lines (24 loc) · 1.07 KB

hkaqhi codecov

hkaqhi

hkaqhi scrapes the past 24 hours of Air Quality and pollutant concentration data from Hong Kong Air Quality Health Index (AQHI) website by Environment Protection Department (EPD).

Installation

Cloning the project and setup a python virtual enviornment to install required package.

$ git clone [email protected]:sammyfung/hkaqhi.git
$ cd hkaqhi
$ python3 -m venv venv
$ source venv/bin/activate  
$ pip install -r requirements.txt   

Quickstart

To crawl past 24 hours of AQHI and append to an output file in json format:

$ scrapy crawl aqhi24 -o aqhi24.json  

To crawl past 24 hours of detailed pollutant concentration data and append to an output file in json format:

$ scrapy crawl pollutant24 -o pollutant24.json