Skip to content

snudm/acl-anthology-structurizer

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

7 Commits
 
 
 
 
 
 
 
 
 
 

Repository files navigation

ACL trend survey

Requirements

  • Python 2.7

  • Install scrapy

      $pip install scrapy
    

Run

Data crawling

  • Configure year and journal in crawler/crawler/settings.py. (Haven't tried crawling other proceedings/journals than ACL though)

      $ cd crawler
      $ scrapy crawl acl -o items.csv -t csv
      $ scrapy crawl acl -o items.json -t json
    
  • Be careful of running the code twice because the json file gets appended, rather than overwritten.

Calculate frequent authors

$ python count.py

Author

License