Add statistic correction option #5061

spxiwh · 2025-02-25T09:25:57Z

This patch adds a statistic_correction "statistic keyword" in stat.py. This option will add a constant factor to all returned statistic values.

The immediate problem this patch sets out to fix is this plot (from https://inspirehep.net/literature/2053424)

This shows the desired behaviour using the state of the art statistic as of a few years ago. In particular HL doubles lie above other things.

When adding the new KDE term we see this relative behaviour change and H/L singles lie above the doubles line, causing singles to be weighted more than HL doubles when combining. Using this option we can downrank singles relative to doubles (and add appropriate factors when considering 3-ifo behaviour).

However, the patch is broader than that, it could be used (for e.g.) to try to make 0 correspond to a 50/50 chance of being real ... Or any other thing you could dream up where we just want to scale the statistic.

I've tested that this produces the desired effect.

pycbc/events/stat.py

tdent · 2025-02-25T12:19:45Z

How does this deal with different event types? Is that already in the stat parsing machinery, and if so can you remind me where?

tdent

See suggested comment changes & the question about event types.

spxiwh · 2025-02-25T12:26:05Z

Different event types are handled in the config file. SO one specifies something like:

[sngls-h1&sngls-l1]
statistic-keywords = correction_factor:-3.7
minimum-stat = -4

[sngls-v1]
statistic-keywords = correction_factor:-7.4
minimum-stat = -7

[coinc-h1l1]
statistic-keywords = correction_factor:-1

[coinc-l1v1]
statistic-keywords = correction_factor:-3.7

[coinc-h1v1]
statistic-keywords = correction_factor:-3.7


[coinc-h1l1v1]
statistic-keywords = correction_factor:0

(old naming for this argument, but same idea).

tdent · 2025-02-25T12:39:25Z

Right, that is what I suspected & seems the easiest way of doing this (modulo the case where we would want a different number for the same event type in different coinc times, but that seems not to be needed quite yet).

Co-authored-by: Thomas Dent <thomas.dent@usc.es>

spxiwh · 2025-02-25T12:46:27Z

So to me the only remaining concern is around benchmark_lograte and if this adds redundancy. Is there anything I should do about this, or are folks happy?

GarethCabournDavies · 2025-02-25T12:49:53Z

So to me the only remaining concern is around benchmark_lograte and if this adds redundancy. Is there anything I should do about this, or are folks happy?

I am happy with the redundancy - all the benchmark_lograte means is that the numbers are a bit more natural this way

pycbc/events/stat.py

Co-authored-by: Thomas Dent <thomas.dent@usc.es>

tdent · 2025-02-25T13:03:56Z

If both the benchmark lograte and this term are used in exactly the same ranking statistics, we ought to be able to get rid of one of them by defining a constant, eg BENCHMARK_LOGRATE = -14.6 at the module level to replace the class attribute in ln_noise_rate -= self.benchmark_lograte.

This leaves the output identical but takes out one redundant option which could confuse any users who tried to understand all the optional kwargs.
How do people feel about adding that 'trivial' change here?

tdent · 2025-02-25T21:26:18Z

I guess alternatively, in the spirit of keeping the production branch changes as simple as possible, we could cherry pick this minimal change to the 2.3 branch and then clean up the benchmark redundancy on master with a separate PR.

tdent

Approve, if we agree to address the redundancy of options that just shift the statistic by a constant at least on master.

* Merge part of 4997 to avoid failure * Bump to v2310 * Compressed waveforms bank workflow (#4969) * Add compressed waveforms to bank workflow * Allolw plotting script to use any bank conversion parameter * Some fixes to allow the joined bank to be plotted * Use inference's parameter labels: they are available and mostly good * Add mismatch to plotting, make some tweaks * some tidying * thinko * Try to make the CI workflow run * Fix do-not-compress default * Use different examples in compress bank workflow * Proper name for the github workflow * Thinko * python shebang in compression workflow script * minor edits * move to readily-available waveform * TRy IMRPhenomD instead * revert change to workflow.core * Warn for KeyError in get_decompressed_waveform * Fix issue with if get_decompressed_waveform raised a ValueError * Combined plotifar (#5034) * added plot script * cleanups * remove now unused bits * Generalize fit plotting * rename script * Added page_farstat in summary (#5052) * Fix release naming * Stat correction patch (#5061) * Limit number of stage output jobs * Reorganize FAR/stat plots on summary page (#5061) * Get the dq files into a nice layout (#5064) * Sphinx version CI fix (#5060) * Update pycbc_page_fars_vs_stat (#5067) --------- Co-authored-by: Gareth S Cabourn Davies <gareth.cabourndavies@ligo.org> Co-authored-by: Thomas Dent <thomas.dent@usc.es> Co-authored-by: Rahul Dhurkunde <rahul.dhurkunde@ldas-pcdev6.ligo.caltech.edu>

* Add statistic correction option * Update pycbc/events/stat.py Co-authored-by: Thomas Dent <thomas.dent@usc.es> * Update pycbc/events/stat.py Co-authored-by: Thomas Dent <thomas.dent@usc.es> * Update pycbc/events/stat.py Co-authored-by: Thomas Dent <thomas.dent@usc.es> * Update pycbc/events/stat.py Co-authored-by: Thomas Dent <thomas.dent@usc.es> * Update pycbc/events/stat.py Co-authored-by: Thomas Dent <thomas.dent@usc.es> --------- Co-authored-by: Thomas Dent <thomas.dent@usc.es>

spxiwh requested a review from tdent February 25, 2025 09:25

Add statistic correction option

922a9af

spxiwh force-pushed the pr_stat_correction branch from f60b9bb to 922a9af Compare February 25, 2025 10:37

tdent reviewed Feb 25, 2025

View reviewed changes

pycbc/events/stat.py Outdated Show resolved Hide resolved

tdent reviewed Feb 25, 2025

View reviewed changes

pycbc/events/stat.py Outdated Show resolved Hide resolved

tdent reviewed Feb 25, 2025

View reviewed changes

pycbc/events/stat.py Outdated Show resolved Hide resolved

tdent reviewed Feb 25, 2025

View reviewed changes

pycbc/events/stat.py Outdated Show resolved Hide resolved

tdent reviewed Feb 25, 2025

View reviewed changes

pycbc/events/stat.py Outdated Show resolved Hide resolved

tdent requested changes Feb 25, 2025

View reviewed changes

spxiwh and others added 4 commits February 25, 2025 12:45

Update pycbc/events/stat.py

2aa8fe9

Co-authored-by: Thomas Dent <thomas.dent@usc.es>

Update pycbc/events/stat.py

2a0ab86

Co-authored-by: Thomas Dent <thomas.dent@usc.es>

Update pycbc/events/stat.py

d13e187

Co-authored-by: Thomas Dent <thomas.dent@usc.es>

Update pycbc/events/stat.py

830fe57

Co-authored-by: Thomas Dent <thomas.dent@usc.es>

tdent reviewed Feb 25, 2025

View reviewed changes

pycbc/events/stat.py Outdated Show resolved Hide resolved

Update pycbc/events/stat.py

35116a1

Co-authored-by: Thomas Dent <thomas.dent@usc.es>

GarethCabournDavies mentioned this pull request Feb 25, 2025

Changes for v2.3.10 #5044

Merged

tdent added offline search v23_release_branch PRs applied to the v2.3.X release branch or to be cherry-picked if merging to master labels Feb 25, 2025

tdent self-requested a review February 25, 2025 21:24

tdent approved these changes Feb 26, 2025

View reviewed changes

spxiwh merged commit 0a16d83 into gwastro:master Feb 27, 2025
31 checks passed

spxiwh deleted the pr_stat_correction branch February 27, 2025 12:52

spxiwh added a commit to spxiwh/pycbc that referenced this pull request Feb 27, 2025

Stat correction patch (gwastro#5061)

e64e4ec

spxiwh added a commit to spxiwh/pycbc that referenced this pull request Feb 27, 2025

Reorganize FAR/stat plots on summary page (gwastro#5061)

123801c

spxiwh added a commit to spxiwh/pycbc that referenced this pull request Feb 27, 2025

Stat correction patch (gwastro#5061)

ed5b345

spxiwh added a commit to spxiwh/pycbc that referenced this pull request Feb 27, 2025

Reorganize FAR/stat plots on summary page (gwastro#5061)

668e760

GarethCabournDavies removed the v23_release_branch PRs applied to the v2.3.X release branch or to be cherry-picked if merging to master label Mar 14, 2025

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Add statistic correction option #5061

Add statistic correction option #5061

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Add statistic correction option #5061

Add statistic correction option #5061

Uh oh!

Conversation

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!