add true position and E to hits dataset #127

YifanC · 2024-05-24T19:14:17Z

Added true drift position and true energy reconstructed hits.
Adapted to use configuration instead hardcoded numbers for hit reconstruction

krwood

Most of these changes look good to me, except for the namesake of this PR (adding truth information to the hits dataset), but it would be good to see some validation plots before merging this in.

krwood · 2024-05-28T22:43:25Z

src/proto_nd_flow/reco/charge/calib_prompt_hits.py

@@ -161,6 +161,33 @@ def run(self, source_name, source_slice, cache):

        has_mc_truth = packet_seg_bt is not None

+        if has_mc_truth:
+            self.calib_hits_dtype = np.dtype(self.calib_hits_dtype.descr + [('x_true_seg_t', 'f8'), ('E_true_recomb_elife', 'f8')])


What's the rational for this? I don't think we should be mixing truth and reco information in the same dataset. This is what the back tracking datasets are for, no? We should also try and keep the "reco" datasets consistent for mc and data..

Hits with true t0 position is required for MLreco label making. Storing all contributions with the new commit (also corrected a unit issue). Given our setup, the output recombination (close to 0.7), lifetime (close to 1), E values (~0.5) all make sense to me. The differences between x and x_true_seg_t, E and E_true_recomb_elife also fits the scale and is consistent with the making.

Truth information should go into the backtracking and truth datasets, not the reco datasets.

Can you make more concrete suggestions how to structure this? The current proposal makes sense to me in a way that these are extension of what calib_prompt_hits have. Especially E_true_recomb_elife is half true half readout. Putting in mc_truth requires backtracking, not straightforward matching. It doesn't affect the processing of the dataset and the name of the variable is clear about that it uses truth information.

add true position and E to hits dataset

459a409

YifanC requested review from diaza, krwood and cuddandr May 24, 2024 19:14

math.log

d13d96b

krwood reviewed May 28, 2024

View reviewed changes

Yifan Chen and others added 4 commits May 30, 2024 01:37

store all the true positions

757f337

make sure hit merger can use calib_hits_dtype

909fbe6

manually resolve merging issues in calib_prompt_hits.py

d525b31

resolve merging issue in calib_prompt_hits.py [2]

06e566d

YifanC mentioned this pull request Jun 4, 2024

mc_truth/interaction + true_pos in prompt_hit + backtrack #128

Merged

YifanC closed this Jun 4, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

add true position and E to hits dataset #127

add true position and E to hits dataset #127

YifanC commented May 24, 2024

krwood left a comment

krwood May 28, 2024

YifanC May 30, 2024

krwood May 30, 2024

YifanC May 30, 2024

add true position and E to hits dataset #127

add true position and E to hits dataset #127

Conversation

YifanC commented May 24, 2024

krwood left a comment

Choose a reason for hiding this comment

krwood May 28, 2024

Choose a reason for hiding this comment

YifanC May 30, 2024

Choose a reason for hiding this comment

krwood May 30, 2024

Choose a reason for hiding this comment

YifanC May 30, 2024

Choose a reason for hiding this comment