Skip to content
This repository has been archived by the owner on Feb 21, 2021. It is now read-only.

Open access data repository for institutional/news media tweet dataset in the time of COVID-19 pandemic

License

Notifications You must be signed in to change notification settings

narcisoyu/Institutional-and-news-media-tweet-dataset-for-COVID-19-social-science-research

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

Institutional-and-news-media-tweet-dataset-for-COVID-19-social-science-research

Open access data repository for institutional/news media tweet dataset in the time of COVID-19 pandemic

Detail information pre-print avaliable at: https://arxiv.org/abs/2004.01791


#IMPORTANT INFORMATION

As Twitter has provided a new academic API, which gives access to full historical data, this dataset will be no longer updated since Feb 20, 2021.

Thank you very much for all your interests in this small project.


#UPDATE EVERY THURSDAY

News media and government/international organization tweets across different countries (eg. US, UK, China, Spain, France, Germany etc) Feel free to share this repo.

Data collected using twitter REST API.

First data collection at March 12, 2020 (updated on my PC every week). This means the first time I collect the most recent 3200 tweets (official limits) of all the target accounts, then update weekly.

##V1.46 Last update: from Feb 11 to Feb 17

##V1.45 update data from Feb 04 to Feb 10

##V1.44 update data from Jan 28 to Feb 03

  • @GuiseppeConteIT (has resigned) and @socialstyrelsen tweeted 0 message.
  • I will no longer update eu_leadership from the next week

##V1.43 update data from Jan 21 to Jan 27

##V1.42 update data from Jan 14 to Jan 20

  • @socialstyrelsen tweeted 0 message

##V1.41 update data from Jan 7 to Jan 13

  • election_us has been removed from my tracking list

##V1.40 update data from Dec 31 to Jan 6 (2021)

  • @socialstyrelsen tweeted 0 message

##V1.39 update data from Dec 24 to Dec 30

  • @Itamaraty_EN tweeted 0 message

##V1.38 update data from Dec 17 to Dec 23

Merry Xmas

##V1.37 update data from Dec 10 to Dec 16

  • Due to their low tweeting frequency, @BrazilGovNews and @French_Gov have been removed from my tracking list.

##V1.36 update data from Dec 3 to Dec 9

  • @BrazilGovNews and @French_Gov tweet 0 message

##V1.35 update data from Nov 26 to Dec 2

  • @BrazilGovNews and @French_Gov tweet 0 message

##V1.34 update data from Nov 19 to Nov 25

  • @BrazilGovNews and @French_Gov tweet 0 message

##V1.33 update data from Nov 12 to Nov 18

  • @BrazilGovNews and @French_Gov tweet 0 message

##V1.32 update data from Nov 5 to Nov 11

  • @BrazilGovNews and @French_Gov tweet 0 message

##V1.31 update data from Oct 29 to Nov 4

  • @BrazilGovNews and @French_Gov tweet 0 message

##V1.30 update data from Oct 22 to Oct 28

  • @BrazilGovNews and @French_Gov tweet 0 message

##V1.29 update data from Oct 15 to Oct 21

  • @BrazilGovNews and @French_Gov tweet 0 message

##V1.28 update data from Oct 8 to Oct 14

  • @BrazilGovNews and @French_Gov tweet 0 message

##V1.27 update data from Oct 1 to Oct 7

  • @BrazilGovNews, @socialstyrelsen and @French_Gov tweet 0 message

##V1.26 update data from Sep 24 to Sep 30

  • @BrazilGovNews, @Itamaraty_EN and @French_Gov tweet 0 message

##V1.25 update data from Sep 17 to Sep 23

  • @BrazilGovNews and @French_Gov tweet 0 message

##V1.24 update data from Sep 10 to Sep 16

  • @BrazilGovNews, @Itamaraty_EN @SwedishPMand @French_Gov tweet 0 message

##V1.23 update data from Sep 3 to Sep 9

  • @BrazilGovNews, @socialstyrelsen and @French_Gov tweet 0 message
  • @foreignoffice will be removed from the next update

##V1.22 update data from Aug 27 to Sep 2

  • @BrazilGovNews, @Itamaraty_EN, @SwedishPM, @French_Gov and @foreignoffice tweet 0 message
  • It seems like the twitter account @foreignoffice has met some problem, and the tweets are not publicly available any more.

##V1.21 update data from Aug 20 to Aug 26

  • @BrazilGovNews, @socialstyrelsen and @French_Gov tweet 0 message

##V1.20 update data from Aug 13 to Aug 19

  • @BrazilGovNews, @Itamaraty_EN. @socialstyrelsen and @French_Gov tweet 0 message

##Extra update 1

  • example_doc_classifier.R is the example (election_us) script I used to subset all the collected data

##V1.19 update data from Aug 6 to Aug 12

  • @BrazilGovNews, @socialstyrelsen and @French_Gov tweet 0 message

##V1.18 update data from Jul 30 to Aug 5

  • @BrazilGovNews and @French_Gov tweet 0 message

##V1.17 update data from Jul 23 to Jul 29

  • @BrazilGovNews, @French_Gov, @SwedishPM tweet 0 message

##V1.16: update data from Jul 16 to Jul 22

  • @BrazilGovNews, @Itamaraty_EN, @French_Gov, @socialstyrelsen tweet 0 message

##V1.15: update data from Jul 9 to Jul 15

  • @BrazilGovNews and @French_Gov tweeted 0 message

##V1,14: update data from Jul 2 to Jul 8

  • @BrazilGovNews, @Itamaraty_EN, @French_Gov, @socialstyrelsen, @SwedishPM tweeted 0 message

##V1.13: update data from Jun 25 to Jul 1

  • New added: Two Italian news media: @LaStampa and @Corriere
  • @BrazilGovNews and @French_Gov tweeted 0 message

##V1.12: update data from Jun 18 to Jun 24

  • New added: SE_tweet_id Swedish gov, PM and news media tweets
  • Attention: During 0618-0624 @BrazilGovNews tweeted 0 message
  • Attention: During 0618-0624 @French_Gov tweeted 0 message

##V1.11: update data from Jun 11 to Jun 17

  • New added: TR_tweet_id Turkish gov, president and news media tweets
  • Attention: During 0611-0617 @BrazilGovNews tweeted 0 message
  • Attention: During 0611-0617 @Itamaraty_EN tweeted 0 message
  • Attention: During 0611-0617 @French_Gov tweeted 0 message

##V1.10: update data from Jun 4 to Jun 10

  • Attention: During 0604-0610 @BrazilGovNews tweeted 0 message
  • Attention: During 0604-0610 @Itamaraty_EN tweeted 0 message
  • Attention: During 0604-0610 @French_Gov tweeted 0 message

##V1.09: update data from May 28 to Jun 3

  • Attention: During 0528-0603 @BrazilGovNews tweeted 0 message

##V1.08: update data from May 21 to May 27

  • Attention: During 0521-0527 @BrazilGovNews tweeted 0 message

##V1.07: update data from May 14 to May 20

  • Attention: During 0514-0520 @BrazilGovNews tweeted 0 message

##V1.06: update data from May 7 to May 13.

  • Attention: During 0507-0513 @BrazilGovNews tweeted 0 message
  • Attention: During 0507-0513 @French_Gov tweeted 0 message

##V1.05: update data from April 30 to May 6.

  • Attention: During 0430-0506 @BrazilGovNews tweet 0 message

##V1.04: update data from April 23 to April 29.

  • Attention: During 0423-0429 @BrazilGovNews tweeted 0 message
  • Attention: During 0423-0429 @French_Gov tweeted 0 message

##V1.03: update data from April 16 to April 22.

  • New added: BR_tweets Brazilian government, president, news media
  • Attention: During 0416-0422 @French_Gov tweeted 0 message
  • Attention: During 0416-0422 @BorisJohnson tweeted 0 message

##V1.02: update data from April 9 to April 15.

  • New added: EU_leadership (@BorisJohnson, @EmmanuelMacron, @GiuseppeconteIT, @sanchezcastejon)
  • New added: election_us (@BernieSanders, @JoeBiden, @realDonaldTrump, @POTUS)
  • New added: national_gov_foreign_office (you can see this as a huge update to the previous gov file, which include 14 European/US/Chinese government/foreign office accounts)
  • Minor changes: @globaltimesnews moved from ADDITIONAL_news_tweet_id to CHINA_news_tweet_id.
  • Minor changes: @spiegelonline stop tweeting at 20200108, it was removed from my collection query, tweet_id were saved on V1.0.

##V1.01: update data from April 2 to April 8.

##First online: April 2, 2020


IMPORTANT:

Data crawled by twitter account user name (same as txt file name), some of the accounts may lost maintaince for long time (for example @SanidadPublicaEs, stop tweeting at 2014, but activate this account again when COVID-19 became global crisis).

I did NOT remove the historical data before coronavirus outbreak. Any questions please contact with me (see email below).


How to Hydrate

Two recommendations: by Hydrator https://github.com/DocNow/hydrator

or twarc https://github.com/DocNow/twarc

Please follow the instructions


Papers that have mentioned/used this dataset

  • 吉田光男. (2020). COVID-19 流行下におけるソーシャルメディア—日本での状況と研究動向・公開データセット—. 人工知能, 35(5), 644-653.
  • Liang, S., Wong, D. F., & Zhang, Y. (2020, October). 新型冠状病毒肺炎相关的推特主题与情感研究 (Exploring COVID-19-related Twitter Topic Dynamics across Countries). In Proceedings of the 19th Chinese National Conference on Computational Linguistics (pp. 707-718).
  • Shuja, J., Alanazi, E., Alasmary, W., & Alashaikh, A. (2020). Covid-19 open source data sets: A comprehensive survey. medRxiv.
  • Yu, J., Lu, Y., & Muñoz-Justicia, J. (2020). Analyzing Spanish News Frames on Twitter during COVID-19—A Network Study of El País and El Mundo. International Journal of Environmental Research and Public Health, 17(15), 5414.

Contact me

Jingyuan Yu

narcisoyu[at]gmail[dot]com


License

Creative Commons License
This work is licensed under a Creative Commons Attribution-NonCommercial-ShareAlike 4.0 International License.

About

Open access data repository for institutional/news media tweet dataset in the time of COVID-19 pandemic

Resources

License

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages