Add full cut optimization as introduced in pyirf 0.13 by LukasBeiske · Pull Request #2789 · cta-observatory/ctapipe

LukasBeiske · 2025-06-26T18:03:53Z

This changes the PointSourceSensitivityOptimizer to use the full cut optimization introduced in pyirf 0.13. The previous EventDisplay-like optimization can now be used via the PointSourceSensitivityGhOptimizer.

I did a quick comparison of the three optimizers using prod6 files (multiplicity >= 2 for gh opt and percentile cuts, as the HillasReconstructer is used):

I am not sure, why the EventDisplay-like optimization results in a better sensitivity at high energies.

Fixes #2771

maxnoe · 2025-06-26T18:30:26Z

How fine did you make the scanning of the cuts? In principle, the full cut opt should always be at least as good as the one that is restricted,.if it is allowed to find the same cuts.

LukasBeiske · 2025-06-26T19:02:02Z

How fine did you make the scanning of the cuts? In principle, the full cut opt should always be at least as good as the one that is restricted,.if it is allowed to find the same cuts.

I kept everything at the default values, as we have them in here right now. So, if I'm not mistaken, the only difference would be that the EventDisplay-like optimization has a theta cut with 68% efficiency, while the full optimization only tries 60% and 70%.
But I doubt that this makes such a difference. I'll test that and take a closer look again tomorrow.

And, I think, there is no check for a minimum number of events per bin in the full cut optimization, while the 68% theta cut for the EventDisplay-like optimization has a minimum of 10 events per bin. However, this only seems to play a role for the two lowest energy bins, if at all.

LukasBeiske · 2025-06-27T16:44:52Z

There was still an error with the application of the cuts in the irf tool. It should be correct now and this improved the sensitivity situation a lot, but there are still some bins where the EventDisplay-like optimization outperforms the full optimization, even though the first uses a theta cut with 70% efficiency for this plot, which also gets tested in the full optimization.

ctao-dpps-sonarqube · 2025-06-27T16:58:39Z

Analysis Details

1 Issue

0 Bugs
0 Vulnerabilities
1 Code Smell

Coverage and Duplications

89.10% Coverage (94.30% Estimated after merge)
0.00% Duplicated Code (0.70% Estimated after merge)

Project ID: cta-observatory_ctapipe_AY52EYhuvuGcMFidNyUs

View in SonarQube

maxnoe · 2025-06-27T17:18:02Z

If I remember correctly, these percentiles cannot be compared directly, as the eventdisplay-like optimization computes the percentile on the events surviving the initial cut, whereas the full optimization computes it on all events.

LukasBeiske · 2025-06-27T17:42:02Z

If I remember correctly, these percentiles cannot be compared directly, as the eventdisplay-like optimization computes the percentile on the events surviving the initial cut, whereas the full optimization computes it on all events.

Ah, good point, I forgot about that. I'll run the full optimization again with a finer gridding. I guess, these underperformance for some bins will disappear then.

LukasBeiske · 2025-06-30T15:15:34Z

Running both optimizations with finer grids:

PointSourceSensitivityOptimizer:
  gh_cut_efficiency_step=0.02
  theta_cut_efficiency_step=0.02
  
PointSourceSensitivityGhOptimizer:
  gh_cut_efficiency_step=0.02

gets the performance of the full optimization closer, but some bins are still worse then with EventDisplay-like optimization.

kosack

gh_cut_efficiency_step=0.02
theta_cut_efficiency_step=0.02

If a smaller step is important to get a good sensitivity, please also update the ctapipe-quickstart configurations to provide the best values for users, and also the default values in the tool itself.

LukasBeiske · 2025-07-07T13:01:45Z

gh_cut_efficiency_step=0.02
theta_cut_efficiency_step=0.02

If a smaller step is important to get a good sensitivity, please also update the ctapipe-quickstart configurations to provide the best values for users, and also the default values in the tool itself.

I just noticed this (similar for the full optimization):

The config is exactly the same (default values) besides the stepsize. This doesn't make sense. I'm re-running everything now and, if this behavior is still there, I'll convert this PR to draft until I figure out whats going on here.

maxnoe · 2025-10-24T12:53:06Z

There was still an error with the application of the cuts in the irf tool.

Is this an error introduced here? Or does it affect main? If it affects also main, could you open a PR just with the bugfix please?

LukasBeiske · 2025-10-30T14:19:27Z

There was still an error with the application of the cuts in the irf tool.

Is this an error introduced here? Or does it affect main? If it affects also main, could you open a PR just with the bugfix please?

This does not affect main, it was related to the application of the multiplicity cut introduced here.

Hckjs · 2025-11-24T19:37:58Z

gh_cut_efficiency_step=0.02
theta_cut_efficiency_step=0.02

If a smaller step is important to get a good sensitivity, please also update the ctapipe-quickstart configurations to provide the best values for users, and also the default values in the tool itself.

I just noticed this (similar for the full optimization):

The config is exactly the same (default values) besides the stepsize. This doesn't make sense. I'm re-running everything now and, if this behavior is still there, I'll convert this PR to draft until I figure out whats going on here.

Did you use the same dataset here to calculate the sensitivities as for the cut optimization?

Hckjs · 2025-12-01T11:28:26Z

If i understand correctly, the gh cut optimization:

first calculates (initial) theta cuts based on inital gh cuts
optimizes gh cuts based on this inital theta cut
calculates optimize theta cut based on optimized gh cuts

When the minimization of relative sensitivity is calculated on the initial theta cuts, it doesn't mean that its also minimized on "optimized" theta cuts based on optimized gh cuts, right? So the discrepancy should be valid by definition...

The full cut optimization should not have that problem since its doing a full grid search:

maxnoe · 2025-12-01T12:01:36Z

@Hckjs This is correct yes. One could probably get around this by optimizing again at least once or until it converges. But I think we should just use the global optimization scheme instead.

maxnoe · 2025-12-01T12:04:29Z

I just noticed this (similar for the full optimization):

I didn' realize that plot was for the old optimization, as @Hckjs points out it doesnt hold true for the gh opt that finer steps should alsway result in lower sensitivity.

For the full optimization, it only holds if

the coarse steps are part of the finer steps (otherwise the best step could be a step in the coarser sample not contained in the finer sample)
the senstivity is computed on the same dataset used for the cut optimization, otherwise the statistical differences could dominate the difference in sensitivity, not actually better cuts. (I.e. "overtraining", the reason why we should use a separate dataset in the first place).

ctao-sonarqube · 2025-12-03T15:42:22Z

Quality Gate passed

Issues
0 New issues
0 Fixed issues
0 Accepted issues

Measures
0 Security Hotspots
89.2% Coverage on New Code
0.0% Duplication on New Code

See analysis details on SonarQube

LukasBeiske · 2025-12-03T16:06:25Z

I just noticed this (similar for the full optimization):

I didn' realize that plot was for the old optimization, as @Hckjs points out it doesnt hold true for the gh opt that finer steps should alsway result in lower sensitivity.

For the full optimization, it only holds if
* the coarse steps are part of the finer steps (otherwise the best step could be a step in the coarser sample not contained in the finer sample)

* the senstivity is computed on the same dataset used for the cut optimization, otherwise the statistical differences could dominate the difference in sensitivity, not actually better cuts. (I.e. "overtraining", the reason why we should use a separate dataset in the first place).

Thanks @Hckjs, I missed that. However, doing it again for the full optimization using the same dataset for cut optimization and sensitivity calculation, the same problem is visible:

I'll look into this again. Maybe there is something I'm still missing in the grid search within pyirf.

kosack · 2025-12-17T14:52:13Z

I'll look into this again. Maybe there is something I'm still missing in the grid search within pyirf.

Looking at only the final sensitivity makes it a bit hard to debug since it has so many factors that effect it. Those fluctuations could be due to low stats if one of the cuts is too tight. Might be good to compare the cut efficiencies, background rates, PSF, and effective areas separately

LukasBeiske · 2026-01-15T15:02:48Z

Looking at only the final sensitivity makes it a bit hard to debug since it has so many factors that effect it. Those fluctuations could be due to low stats if one of the cuts is too tight. Might be good to compare the cut efficiencies, background rates, PSF, and effective areas separately

I did some more plots (see below) and checked the code again, but I did not find anything new.
However, since Jonas did the plot above (where the finer grid search is always as good or better as the coarser one) based on files he re-processed from dl1, my current suspicion is that something changed between ctapipe 0.23.1 (with which the dl2 files I am using where processed) and now that fixes this problem.
I will re-process the same files Jonas used and check whether this is actually the case.

…es' attributes

…llow implicit definition of physical_type via default_value

Co-authored-by: Jonas Hackfeld <53918415+Hckjs@users.noreply.github.com>

maxnoe

Two minor comments on docstrings, otherwise looks good, thanks a lot!

maxnoe · 2026-06-12T11:37:26Z

I resolved those myself

ctao-sonarqube · 2026-06-12T11:50:22Z

Quality Gate passed

Issues
0 New issues
0 Fixed issues
0 Accepted issues

Measures
0 Security Hotspots
88.7% Coverage on New Code
0.0% Duplication on New Code

See analysis details on SonarQube

This comment has been minimized.

Sign in to view

maxnoe reviewed Jun 27, 2025

View reviewed changes

Comment thread src/ctapipe/irf/optimize.py Outdated

LukasBeiske mentioned this pull request Jun 30, 2025

Generalise table preprocessing #2791

Merged

kosack requested changes Jul 7, 2025

View reviewed changes

LukasBeiske marked this pull request as draft July 7, 2025 17:46

maxnoe reviewed Oct 24, 2025

View reviewed changes

Comment thread src/ctapipe/irf/optimize.py Outdated

maxnoe reviewed Oct 24, 2025

View reviewed changes

Comment thread src/ctapipe/irf/optimize.py Outdated

maxnoe reviewed Oct 24, 2025

View reviewed changes

Comment thread src/ctapipe/irf/optimize.py Outdated

LukasBeiske force-pushed the add_full_cut_opt branch from 7dcc8e6 to 1e98b11 Compare December 3, 2025 13:42

LukasBeiske mentioned this pull request Jan 20, 2026

Refactor DL2EventLoader to use FeatureGenerator #2919

Draft

7 tasks

LukasBeiske force-pushed the add_full_cut_opt branch from cad944f to 126cdc1 Compare January 29, 2026 11:40

kosack previously approved these changes Jun 5, 2026

View reviewed changes

LukasBeiske and others added 15 commits June 12, 2026 00:58

Start adding full cut opt from pyirf 0.13.0; remove unnecessay 'class…

79448df

…es' attributes

Update tests

db5b249

Fix multiplicity computation; remove multiplicity precut

fe23ded

Add changelog

2311a7a

Index by extname when reading an OptimizationResult

ce0e5ad

Fix application of multiplicity cut in irf tool

6e5ee95

Remove rebase artifacts

503e5bd

Fix test after rebase

41d01f4

Do not shadow builtin

8d19b62

Enable allow_none for AstroQuantity with explicit physical_type and a…

e660fab

…llow implicit definition of physical_type via default_value

Address comments

754be2f

Update resource configs and docstring

3b8e7e5

None -> null in optimize_cuts.yaml

60782bb

Co-authored-by: Jonas Hackfeld <53918415+Hckjs@users.noreply.github.com>

None -> null in optimize_cuts.yaml

6829da7

Co-authored-by: Jonas Hackfeld <53918415+Hckjs@users.noreply.github.com>

Address comments

2fe1e5f

LukasBeiske dismissed kosack’s stale review via 2fe1e5f June 11, 2026 23:06

LukasBeiske force-pushed the add_full_cut_opt branch from e44f95a to 2fe1e5f Compare June 11, 2026 23:06

maxnoe reviewed Jun 12, 2026

View reviewed changes

Comment thread src/ctapipe/irf/optimize.py Outdated

maxnoe reviewed Jun 12, 2026

View reviewed changes

Comment thread src/ctapipe/irf/optimize.py Outdated

maxnoe previously approved these changes Jun 12, 2026

View reviewed changes

Improve docstrings

aacf578

maxnoe dismissed their stale review via aacf578 June 12, 2026 11:34

maxnoe approved these changes Jun 12, 2026

View reviewed changes

maxnoe added this to the v0.31.0 milestone Jun 12, 2026

kosack approved these changes Jun 12, 2026

View reviewed changes

kosack merged commit 5cb6bd1 into main Jun 12, 2026
13 checks passed

maxnoe deleted the add_full_cut_opt branch June 23, 2026 12:54

Uh oh!

Conversation

LukasBeiske commented Jun 26, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

This comment has been minimized.

maxnoe commented Jun 26, 2025

Uh oh!

This comment has been minimized.

LukasBeiske commented Jun 26, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

Uh oh!

LukasBeiske commented Jun 27, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

ctao-dpps-sonarqube Bot commented Jun 27, 2025

Analysis Details

1 Issue

Coverage and Duplications

Uh oh!

maxnoe commented Jun 27, 2025

Uh oh!

LukasBeiske commented Jun 27, 2025

Uh oh!

LukasBeiske commented Jun 30, 2025

Uh oh!

kosack left a comment

Choose a reason for hiding this comment

Uh oh!

LukasBeiske commented Jul 7, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

maxnoe commented Oct 24, 2025

Uh oh!

Uh oh!

Uh oh!

Uh oh!

LukasBeiske commented Oct 30, 2025

Uh oh!

Hckjs commented Nov 24, 2025

Uh oh!

Hckjs commented Dec 1, 2025

Uh oh!

maxnoe commented Dec 1, 2025

Uh oh!

maxnoe commented Dec 1, 2025

Uh oh!

ctao-sonarqube Bot commented Dec 3, 2025

Quality Gate passed

Uh oh!

LukasBeiske commented Dec 3, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

kosack commented Dec 17, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

LukasBeiske commented Jan 15, 2026

Uh oh!

Uh oh!

Uh oh!

maxnoe left a comment

Choose a reason for hiding this comment

Uh oh!

maxnoe commented Jun 12, 2026

Uh oh!

ctao-sonarqube Bot commented Jun 12, 2026

Quality Gate passed

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

LukasBeiske commented Jun 26, 2025 •

edited

Loading

LukasBeiske commented Jun 26, 2025 •

edited

Loading

LukasBeiske commented Jun 27, 2025 •

edited

Loading

LukasBeiske commented Jul 7, 2025 •

edited

Loading

LukasBeiske commented Dec 3, 2025 •

edited

Loading

kosack commented Dec 17, 2025 •

edited

Loading