Readme for COVID-2019.ru

To cite this repository in publications use:

Kouprianov, A. (2020--2023). COVID.2019.ru. Coronavirus epidemics in Russia: data and scripts.
URL https://github.com/alexei-kouprianov/COVID.2019.ru

A BibTeX entry for LaTeX users is:

@Manual{,
    title = {COVID.2019.ru. Coronavirus epidemics in Russia: data and scripts},
    author = {Kouprianov, Alexei},
    year = {2020--2023},
    note = {data, R, and perl code},
    url = {https://github.com/alexei-kouprianov/COVID.2019.ru},
}

This repo was created to keep records of the COVID-2019 epidemics in Russia. The dataset is based on the official reports of confimed cases. This means that the data lag behind the spread of the virus. See also a magnificent project by Johns Hopkins Univ.

Collection of World resources:

covid19-r at GitHub

Other resources on Russia:

MediaZona at GitHub

Dmitrii Kobak on excess mortality:

Excess mortality during the COVID-19 pandemic

All images originally published in this repository or resulting from the scripts but not routinely uploaded to GitHub are licensed under cc-by-4.0

Fitting and prognostic scripts ceased to work properly towards the end of April 2020, now they are fixed and the images are kept being updated.

Data extraction procedures

While working on this project I had to change data gathering procedures several times. During the early weeks, until 2020-03-24, I relied mostly on the media publishing updates from time to time. Then, from 2020-03-24/25 through 2020-04-07 I shifted to the website of RosPotrebNadzor, which published daily reports with a breakdown by regions. Then, from 2020-04-08 on, стопкоронавирус.рф became the main source of information (also, some information on recovered and deceased from it has been used retrospectively to fill in the gaps). RosPotrebNadzor was rather inconsistent in data formats mostly relying on HTML representation of simple lists. стопкоронавирус.рф changed their reporting format twice, most notably on 2020-04-29. Since 2020-04-29 the data have been published as a valid JSON chunk embedded in the code of the webpage.

As for now, the procedure of data extraction and express-analysis works as follows:

Go to the COVID.2019.ru/scripts/ folder and run from the command line:

perl stopcoronavirus.extractor.20200429.pl
Rscript covid.2019.converter.r 
perl covid.2019.disaggregator.pl

Then, go to R and run scripts stored in:

covid.2019.ru.libraries.r
covid.2019.ru.main.20201104.r
covid.2019.ru.plots.20201104.r

Please, read carefully the comments in covid.2019.ru.main.20201104.r. A vitally important part of the script is commented. It should not be implemented carelessly, so, please, uncomment and execute it mindfully.

An outdated procedure (worked for me from 2020-06-04 through 2020-10-13 until I've run out of memory):

perl stopcoronavirus.extractor.20200429.pl
Rscript covid.2019.converter.r 
perl covid.2019.disaggregator.pl
Rscript covid.2019.ru.main.r

An outdated procedure (for scripts published before 2020-06-04):

Run scripts/stopcoronavirus.extractor.20200429.pl (results in downloads/stopcoronavirus.storage.cumulative.20200429.txt, downloads/stopcoronavirus.storage.moment.20200429.json and downloads/stopcoronavirus.timestamp.moment.20200429.txt
Run covid.2019.converter.r (results in downloads/increment.txt and downloads/increment.0.txt)
Pick downloads/increment.txt and feed it into an electronic table (to adjust for the English language region codes).
Copy the resulting table to data/momentary.txt
Run scripts/covid.2019.disaggregator.pl (results in data/momentary.da.txt).

After that one can start running analytic and plotting R scripts. All of them are listed in:

scripts/covid.2019.ru.main.r

Disaggregated data for Cheliabinsk

The disaggregated data for Cheliabinsk can be downloaded with wget (command-line string is given below), then processed with cheliabinsk.extractor.pl and cheliabinsk.r.

wget -S 'https://коронавирус74.рф/districts/' --no-check-certificate -O cheliabinsk.raw.txt

Illustrations

The visualizations derived from nation-wide data are as follows. For regional graphs see a special subfolder. An example of regional graphs (St. Petersburg, Russia) is given below in this readme file.

Regional passport

St. Petersburg is taken as an example, but all regions are available.

Name		Name	Last commit message	Last commit date
Latest commit History 594 Commits
Rmd		Rmd
data		data
downloads		downloads
misc		misc
plots		plots
scripts		scripts
.gitignore		.gitignore
README.md		README.md
readme.extended.ru.txt		readme.extended.ru.txt

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

Readme for COVID-2019.ru

Data extraction procedures

Disaggregated data for Cheliabinsk

Illustrations

Regional passport

About

Uh oh!

Releases

Packages

Uh oh!

Languages

alexei-kouprianov/COVID.2019.ru

Folders and files

Latest commit

History

Repository files navigation

Readme for COVID-2019.ru

Data extraction procedures

Disaggregated data for Cheliabinsk

Illustrations

Regional passport

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Languages

Packages