Cyclomatic complexity of `simulate_chains()` #179

jamesmbaazam · 2024-01-23T17:08:11Z

In #171, the new simulate_chains() function tripped the cyclocomp_linter with a cyclomatic complexity of 21 against the expected 15.

This issue is to discuss ways to reduce the function's complexity. More broadly, it raises the general question of whether that level of complexity is always achievable for simulation functions as they often require various levels of control flows and complex logic.

The text was updated successfully, but these errors were encountered:

sbfnk · 2024-01-24T10:54:55Z

it raises the general question of whether that level of complexity is always achievable for simulation functions as they often require various levels of control flows and complex logic.

I think that's a good point. We struggled with this one before and have ended up more or less where we started. That said there are things we could consider, e.g.:

turn the if clause l. 219:221 into a stopifnot call
move the susceptible adjustment l. 230:257 into a function susceptible_adjust_next_gen(next_gen, susc_pop) or the like

jamesmbaazam · 2024-01-25T13:12:56Z

Thanks, Seb, I've implemented these ideas in #171 and reduced it by 1 unit. I'll keep this issue open to consolidate more ideas.

sbfnk · 2024-01-26T10:17:44Z

Thanks, Seb, I've implemented these ideas in #171 and reduced it by 1 unit.

Well that was worth it then 🫠

Probably not a bad idea to think of more ideas here but also, is there something magical about the number 15 for cyclomatic complexity? I worry about falling subject to Goodhart's law, or more specifically making the code worse at the expense of hitting an arbitrary target.

Tagging @Bisaloo as it's potentially a question of broader interest.

jamesmbaazam · 2024-01-26T11:38:06Z

Totally agree and that's why I set up the issue. I probably should have set it up as a discussion on the Epiverse-TRACE org. I do see the point of reducing unnecessary branching in smaller functions but I don't see how it's always possible in larger simulation functions.

Bisaloo · 2024-01-29T13:20:24Z

Yes, I agree we don't want to fall in the trap described in the Goodhart's law.

The metrics used in the project should not be considered as targets but as a temperature check. I see this as the same tool as "normal ranges" in health checkups. If a value is outside the range, it's good that a physician takes a deeper look. But it doesn't necessarily always mean something is wrong.

This is the same here. 15 is a value that empirically myself and others have found to correlate well with whether a function is easily understandable. If the cyclomatic complexity exceeds 15, it is important to take time to consider if the function can be refactored in a different way to make it easier to understand. But that may just not be possible in some cases. In these cases, once a couple of pairs of eyes have looked at potential improvements, we can make an exception.

The same goes for code coverage and other metrics we use in the project BTW. As a first approach, we try to stick to what usually works well (full code coverage and cyclomatic complexity < 15). But we know that there are cases where these metrics don't capture well what we are trying to achieve and we should never try to artificially fulfill a perceived needed value.

If it wasn't clear until now, suggestions on how to make it clearer either in documentation (blueprints) or in how the checks are presented are welcome!

jamesmbaazam added enhancement New feature or request help wanted Extra attention is needed question Further information is requested discussion labels Jan 23, 2024

jamesmbaazam mentioned this issue Jan 24, 2024

Combine simulate_tree() and simulate_tree_from_pop() into simulate_chains(). #171

Merged

4 tasks

Bisaloo mentioned this issue Jan 29, 2024

Misc suggestions for simulate_chains() #194

Merged

sbfnk mentioned this issue Jan 29, 2024

Return complexity_limit in cyclocomp_linter to 15L #196

Closed

jamesmbaazam mentioned this issue Jan 29, 2024

Restore cyclocomp_linter defaults #197

Merged

jamesmbaazam closed this as completed in #197 Jan 30, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Cyclomatic complexity of `simulate_chains()` #179

Cyclomatic complexity of `simulate_chains()` #179

jamesmbaazam commented Jan 23, 2024

sbfnk commented Jan 24, 2024

jamesmbaazam commented Jan 25, 2024

sbfnk commented Jan 26, 2024 •

edited

Loading

jamesmbaazam commented Jan 26, 2024

Bisaloo commented Jan 29, 2024

Cyclomatic complexity of simulate_chains() #179

Cyclomatic complexity of simulate_chains() #179

Comments

jamesmbaazam commented Jan 23, 2024

sbfnk commented Jan 24, 2024

jamesmbaazam commented Jan 25, 2024

sbfnk commented Jan 26, 2024 • edited Loading

jamesmbaazam commented Jan 26, 2024

Bisaloo commented Jan 29, 2024

Cyclomatic complexity of `simulate_chains()` #179

Cyclomatic complexity of `simulate_chains()` #179

sbfnk commented Jan 26, 2024 •

edited

Loading