Infodemiology

User-generated health data from internet search and social media to inform public health and public policy responses

πŸ’¬ WhatsApp Chat for Infodemiology πŸ’¬


Link Drop
Got another link for this section? Put it here.


✏️ Edit this document to make it better ✏️

Internet Search Data

Search Datasets from Google & Bing


Bing Coronavirus search query set

πŸŒŽπŸ”— https://github.com/microsoft/BingCoronavirusQuerySet
πŸ—£ [email protected]
This dataset was curated from the Bing search logs (desktop users only) over the period of Jan 1st, 2020 – April 18th, 2020.Only searches that were issued many times by multiple users were included. Dataset includes queries from all over the world that had an intent related to the Coronavirus or Covid-19. In some cases this intent is explicit in the query itself, e.g. β€œCoronavirus updates Seattle” in other cases it is implicit , e.g. β€œShelter in place” .Implicit intent of search queries (e.g. Toilet paper) were extracted by using Random walks on the click graph approach as outlined in this paper by Microsoft Research. All personal data was removed.


Coronavirus searches by date/lang/country from AnswerThePublic

πŸŒŽπŸ”— https://docs.google.com/spreadsheets/d/1roQeJkDBH2d2UGV85HzwAmxOHwehV2te4X7YzRcWqDI
πŸ—£ [email protected]Β 
Search term variation data on Coronavirus (i.e., what people google when they google β€œcoronavirus”) Geographical scope: Various languages and countries (including Italy, Netherlands, France, Germany, Belgium, US, Mexico, Brazil, Colombia, Norway, Switzerland, UK)
Data has been collecting on a near-daily basis since 20/01/2020

Coronavirus search term variations and volume - various countries

πŸŒŽπŸ”—https://docs.google.com/spreadsheets/d/1elbDJU7WPPv5s2-rS8K9M53Og-gvCm5pV2kN87BLxpk/edit?usp=sharing
πŸ—£https://twitter.com/samgilb
Long lists of Coronavirus keywords with estimated search volumes for 9 countries, sourced from the search engine marketing tool SEMrush. Covers UK, Italy, India, USA, Australia, France, Germany, Spain, Brazil. As at 14 March 2020.

Country-specific search datasets


Madagascar
Coronavirus search term variation data in French for Madagascar from Answer The Public (as at 6 April 2020):
MG πŸ”—https://docs.google.com/spreadsheets/d/1DBcm9Y6ZCOoh61RNeCjROpXTJ76RiFWh_cVABTaDRe8/edit#gid=0

United Kingdom
πŸ‡¬πŸ‡§πŸ”—https://docs.google.com/spreadsheets/d/1rWdx7ITPyAA5497T18TI1BL5Gxz-Oi7aVAVrE8f50sU/edit?usp=sharing
Search term variation data from Hitwise on Coronavirus (as at 25 Jan 2020) - that is, what people google when they google β€œCoronavirus”

United Kingdom - Counties, Cities, Towns
πŸ‡¬πŸ‡§πŸ”— https://docs.google.com/spreadsheets/d/19Sc2M3zP-OwQjdwefTvW54vwsIJ0JpSAYL_cCYD1DTE/edit?usp=sharing
Search term variation data for the UK by County, City & Town, extracted from the Bing Coronavirus Search Query Set. April 2020.

Academic Papers


"The use of google trends to investigate the loss of smell related searches during COVID‐19 outbreak"

πŸ”— https://onlinelibrary.wiley.com/doi/abs/10.1002/alr.22580
πŸ—£ [email protected]

"Tracking COVID-19 using online search" (Pre-print working paper)

πŸ”— https://github.com/vlampos/covid-19-online-search/blob/master/covid-19-online-search.pdf
πŸ—£ [email protected]

"Tracking COVID-19 in Europe: Infodemiology Approach"

πŸ”— https://publichealth.jmir.org/2020/2/e18941/
πŸ—£ [email protected]

"How to Best Predict the Daily Number of New Infections of COVID-19" (Pre-print)

πŸ”— https://papers.ssrn.com/sol3/papers.cfm?abstract_id=3571252
πŸ—£ [email protected]

"The Second Worldwide Wave of Interest in Coronavirus since the COVID-19 Outbreaks in South Korea, Italy and Iran: A Google Trends Study" Brain, Behavior, and Immunity

πŸ”— https://www.sciencedirect.com/science/article/pii/S0889159120305511
πŸ—£ https://twitter.com/arturstrzelecki

"Online Information Search During COVID-19" (Pre-print)

πŸ”— https://arxiv.org/pdf/2004.07183v1.pdf
πŸ—£ [email protected]

"In Crisis, We Pray: Religiosity and the COVID-19 Pandemic" (Pre-print)

πŸ”— https://www.dropbox.com/s/jc8vcx8qqdb84gn/Bentzen_religiosity_covid.pdf?dl=0
πŸ—£ https://twitter.com/JeanetBentzen

"Internet Search Patterns Reveal Clinical Course of Disease Progression for COVID-19 and Predict Pandemic Spread in 32 Countries" (Pre-print)

πŸ”— https://www.medrxiv.org/content/10.1101/2020.05.01.20087858v1
πŸ—£ [email protected]; [email protected]

"Evidence from internet search data shows information-seeking responses to news of local COVID-19 cases"

πŸ”— https://www.pnas.org/content/early/2020/05/01/2005335117
πŸ—£ [email protected]; [email protected]

Tools


Google Trends

πŸ”— https://trends.google.com/trends/
To request Google Trends API access:
πŸ”— https://docs.google.com/forms/d/e/1FAIpQLSenHdGiGl1YF-7rVDDmmulN8R-ra9MnGLLs7gIIaAX9VHPdPg/viewform
For general guidance on using Google Trends in academic research: πŸ”—http://people.ischool.berkeley.edu/~hal/Papers/2015/primer.pdf

Answer The Public

πŸ”— https://answerthepublic.com/
πŸ—£ https://twitter.com/answerthepublic
Instant, raw search insights on any keyword, compiled from Google autocomplete suggestions, with csv download

Visualizations

Anosmia Searches

πŸ”— USA: https://www.berlinquette.com/anosmia-searches
πŸ”— Tanzania: https://www.berlinquette.com/anosmia-google-searches-tanzania
πŸ—£ [email protected]
Interactive visualization of Google searches for anosmia / loss of smell in the most populated cities in the US and Tanzania. Built in Google Data Studio, so data is downloadable in csv.

Coronasearch Live

πŸ”— https://www.coronasearch.live/
πŸ—£ https://twitter.com/WarmSpeakers
Currently, people in China use VPN's (Virtual Private Networks) to bypass censorship, and Google info about the Coronavirus (COVID-19). This website compiles these Google searches, word for word, in real-time. Built in Google Data Studio, so data is downloadable in csv.

Should I Suggestions

πŸ”— https://public.flourish.studio/visualisation/1512938/
πŸ—£ https://twitter.com/ColeyBird
Evolution of UK public's concerns over time during 2020, as revealed by searches beginning "Should I..."

Blogs & Journalism

"How can Google search help with the COVID-19 response?"

πŸ”— https://www.bennettinstitute.cam.ac.uk/blog/how-can-google-search-help-covid-19-response/
πŸ—£ https://twitter.com/samgilb
Bennett Institute for Public Policy, University of Cambridge

"Coronavirus Google Searches Could Save Lives"

πŸ”— https://onezero.medium.com/google-needs-to-share-the-data-from-coronavirus-searches-62e6f60cc363
πŸ—£ [email protected]
OneZero

"Google Searches Can Help Us Find Emerging Covid-19 Outbreaks"

πŸ”— https://www.nytimes.com/2020/04/05/opinion/coronavirus-google-searches.html
πŸ—£ https://twitter.com/SethS_D
New York Times

"Sniffing out COVID-19 with Google Search Trends."

πŸ”— https://medium.com/@miahusic/sniffing-out-covid-19-with-google-search-trends-46f80ea83e2f
πŸ—£ https://medium.com/@miahusic
Medium

"Goldman says fewer "Loss of Smell" Google queries suggest better COVID-19 prospects"

πŸ”— https://www.cnbc.com/2020/04/15/goldman-says-fewer-loss-of-smell-google-searches-is-a-positive-sign-for-pandemic.html
πŸ—£ https://twitter.com/tomwfranck
CNBC

"Search Trends and The Coronavirus Outbreak: Causation or Correlation"

πŸ”— https://www.semrush.com/blog/search-trends-coronavirus/
πŸ—£ https://twitter.com/SEO
SEMRush

"Your Google Searches Could Shape the Response to the Next Pandemic"

πŸ”— https://onezero.medium.com/your-google-searches-could-shape-the-response-to-the-next-pandemic-ade9148a5e58
πŸ—£ https://twitter.com/chrisbaraniuk
OneZero

"Smells like trouble? Google searches as a sign of COVID-19"

πŸ”—https://mtega.com/2020/05/smells-like-trouble-google-searches-as-a-sign-of-covid-19/
πŸ—£ https://twitter.com/mtega


"Can Google Ads data predict the next coronavirus outbreak?"

πŸ”—https://mashable.com/article/google-ads-data-coronavirus/?europe=true
πŸ—£ https://twitter.com/MattBinder
Mashable

Social Media Data

Social Media Datasets


Covid-19 Twitter chatter dataset for scientific use

πŸŒŽπŸ”—http://www.panacealab.org/covid19/
Dataset of tweets acquired from the Twitter Stream related to COVID-19. Includes all language, with English, Spanish, and French most prevalent.

COVID-19-TweetIDs

πŸŒŽπŸ”—https://github.com/echen102/COVID-19-TweetIDs
The repository contains an ongoing collection of tweets IDs associated with COVID-19.

Twitter COVID-19 Stream

πŸŒŽπŸ”—https://developer.twitter.com/en/docs/labs/covid19-stream/overview
A streaming endpoint into Twitter Developer Labs.
Returns tweets based on Twitter's internal COVID-19Β Tweet annotationΒ and a set of defined parameters Twitter believes deliver a comprehensive view of the conversation around this topic.

COVID-19 Mentions DB Project

πŸ”—https://covid19-mentions-db.com/
Our goal is to collect all mentions to JCOVID-19 starting from December 1, 2019, from Twitter, Facebook and Instagram in one database and provide access to this information for all researchers, journalists, and OSINT specialists.