・Data collection and processing is a major hurdle in conducting quantitative analysis and quantitative discussions on COVID-19 infections in Japan.
・Reason : Formats (file formats such as PDF and EXCEL, and data structures) are not uniform.
・Reason : Data with the same name may have different definitions.
Example: The number of severely ill patients (based on national standards), the number of severely ill patients (based on the old Tokyo standards and the new Tokyo standards), and the number of severely ill patients (based on the Osaka Prefecture standards)
・Reason : The institutions that release the data and the timing for doing so vary.
Example: If collecting vaccine data
・Vaccine (general vaccinations) -> Digital Agency
・Vaccines (healthcare workers, data to April 9, 2021) -> Ministry of Health, Labour and Welfare
・Vaccines (healthcare workers, data from April 12, 2021) -> Prime Minister’s Office
・Consequently, we are collecting data related to COVID-19 infections automatically and providing it as centralized prefectural (+ national) panel data free of charge.
・We are providing daily data and weekly data processed from daily data simultaneously.
・The collected data covers a wide range including population, state of infections, medical supply system,
whether or not there are restrictions on action, vaccines, flows of people, weather, etc.
・No dataset has been released until now that incorporates that much information by prefecture.
・This dataset can be freely utilized for data analysis and model analysis of causal inferences, etc.
・The data is updated automatically on the server each day (* some data is updated each week).
・By using this dataset, troublesome work such as the data collection and processing
required for analysis can be greatly reduced.
・We have taken all possible measures with regard to the accuracy of the information contained in this dataset, but we bear no responsibility for any actions taken by users using the information contained in this dataset.
・In addition, we bear no responsibility for any damage to the user caused by the user using this dataset or any damage the user causes to a third party.
・The information contained in this dataset is subject to change or deletion without notice.
・This dataset was developed by Taisuke Nakata and Wataru Okamoto of the University of Tokyo as individual researchers. The University of Tokyo bears no responsibility whatsoever with regard to this dataset.
Data visualization: COVID-19 standard dashboard
Model analysis: Forecast tool for the number of inpatients and severely ill patients