The Centers of Disease Control and Prevention is hosting collaborative forecasting projects related to predicting the coronavirus disease spread, and anticipate the mortality and number of hospitalizations caused by the disease across the country. This critically timed project comprises a handful of teams of leading data scientists, epidemiologists, statisticians, and high-performance computing researchers from national laboratories, public universities, public health institutions, and some private sector agents. Our team is using deep learning models to forecast specific targets at the national, regional, state, and local levels. In addition to CDC data, we are incorporating many other real-time datasets such as syndromic surveillance data. We combine these datasets with domain knowledge using end-to-end deep learning models to predict targets on a weekly basis. The CDC synthesizes our weekly and monthly predictions with other models to help determine policy and other planning decisions to help communities prepare for and fight the disease. We have extensive experience with disease surveillance - AdityaLab has been leading a team at the CDC FluSight challenge since 2018 (our model EpiDeep had the best performance in the HHS1 region) and have published multiple research papers on incidence prediction at major venues (see here).
We show two important changes in trend that our model DeepCOVID predicted with several weeks of anticipation: second peak in US national incidence mortality (left) and uptrend in CA incidence mortality (right); both of the pictures above use the JHU dataset as ground truth. Also see our latest results on mortality prediction on the CDC website and FiveThirtyEight. Also see our website for DeepCOVID.
Pre-print on DeepCOVID model (to appear in IAAI-21) [PDF], appendix [PDF] and other resources [LINK].
Pre-print on forecasting COVID-contamined flu (to appear in AAAI-21) [PDF]
[CODE].
Our team DeepOubreak (led by Georgia Tech, collaborators from UIowa and VT) expanded our published research and participated in several COVID-19 data science challenges. Our work (led by PhD student Alex Rodriguez) was awarded:
Our project is partially supported through (NSF CAREER, NSF CISE Expeditions, NSF IIS, NSF COVID RAPID, ORNL, GT and GTRI Rapid Seed Funds).
(to whom correspondence should be addressed)