Skip to content

A simple R script for extracting tabular data from Oryx' excellent post detailing materiel lost by all sides in the [Russian invasion of Ukraine](https://www.oryxspioenkop.com/2022/02/attack-on-europe-documenting-equipment.html).

License

Notifications You must be signed in to change notification settings

leedrake5/scrape_oryx

 
 

Repository files navigation

Scrape Oryx

About

This is a simple R script designed to scrape data from Oryx' excellent post detailing materiel lost by all sides in the Russian invasion of Ukraine and to output it into csv format.

To-Do

  • Explore XPATH's ability to extract data between HTML tags: The Country column is currently blank, making attribution impossible. I'm not super familiar with xpath, so I've yet to work out how it to extract elements under each country header and conditionally assign the country name.
  • Data Quality Assurance using the Totals provided at the start of each list item.

About

A simple R script for extracting tabular data from Oryx' excellent post detailing materiel lost by all sides in the [Russian invasion of Ukraine](https://www.oryxspioenkop.com/2022/02/attack-on-europe-documenting-equipment.html).

Resources

License

Stars

Watchers

Forks

Packages

No packages published

Languages

  • R 100.0%