Statistically consistent way to handle E(known outcomes) < deaths #154

adamkucharski · 2024-07-12T06:09:00Z

The current implementation in CFR is based on calculating E(known outcomes) to compare to totals deaths. However, in extreme examples, such as small outbreaks with a very high CFR (like Ebola in Yambuku in 1976), there can be occasionally situations where E(known outcomes) < deaths and hence the binomial likelihood calculation is not valid. In this situation the code currently returns NA to make the problem clear to the user.

In the longer-term, a more statistically consistent approach would be to integrate over the possible known outcomes, rather than just using the expectation. This would allow calculation on the plausible known outcomes < deaths and automatic omission of known outcomes > deaths. Something like the following:
$E(CFR) = \sum_i P(\text{i known outcomes so far | cases, deaths}) E(\text{CFR | i known outcomes so far}) $

This was referenced Jul 12, 2024

Remove normal approximation #153

Merged

Including uncertainty in baseline CFR for underascertainment estimation #157

Open

adamkucharski mentioned this issue Oct 4, 2024

Expected outcomes lower than number of deaths #165

Open

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Statistically consistent way to handle E(known outcomes) < deaths #154

Statistically consistent way to handle E(known outcomes) < deaths #154

adamkucharski commented Jul 12, 2024

Statistically consistent way to handle E(known outcomes) < deaths #154

Statistically consistent way to handle E(known outcomes) < deaths #154

Comments

adamkucharski commented Jul 12, 2024