v4 to v5 Port #48

aeoranday · 2024-09-11T17:30:57Z

This simply ports this repository from v4 to v5.

Most of the dependency for this port was data format related, which have already been ported.

Testing was done by viewing t*_dump.py for various files from both v4 and v5 and confirming the results are sensible.

This also features a slight change to docstrings and a missed enhancement in v4.

The usage of process_tpstream is consistent and was previously addressed when the naming change happened in triggeralgs for v5.

Removed Fragment Separation From TAData & TPData Classes

Remove C Python Binding Import

Bumping version to v1.1.4

Patch/fddaq v4.4.x

MRiganSUSX · 2024-09-27T09:37:25Z

Hi Alex,

Thanks for the work on this. Here is my first round of review:

new tag for v5
fix build warnings
link to process_tpstream documentation (same as for the others)
note on what channel map is for PD2HD
the default config for process_tpstream has invalid names for the algos...

process_tpstream:
"
 TS gap: 62499456 999.991296 ms
		Average TA Time Process (0 TAs): 0 ns.
		Average TC Time Process (0 TCs): 0 ns.
"

for process_tpstream --latencies please add a header for columns

for ta_dump, could the list of commands at the bottom be either ordered alphabetically or in the order they were introduced in the text above

seen an error with ta_dump in v4:
"
(dbt) [mrigan@np04-srv-016 trgtools_old]$ ta_dump.py /nfs/rscratch/mrigan/np04hd_raw_run029349_0000_dataflow0_datawriter_0_20240926T114848.hdf5 
Saving outputs to ./ta_29349-0000_figures_0000.*
Number of TAs: 3
Writing descriptive statistics to ta_29349-0000_figures_0000.txt.
Traceback (most recent call last):
  File "/nfs/home/mrigan/dune-daq/fddaq-v4.4.8-a9/install/trgtools/bin/ta_dump.py", line 551, in <module>
    main()
  File "/nfs/home/mrigan/dune-daq/fddaq-v4.4.8-a9/install/trgtools/bin/ta_dump.py", line 488, in main
    plot_data = np.array([datum.value for datum in data.ta_data[ta_key]], dtype=int)
  File "/nfs/home/mrigan/dune-daq/fddaq-v4.4.8-a9/install/trgtools/bin/ta_dump.py", line 488, in <listcomp>
    plot_data = np.array([datum.value for datum in data.ta_data[ta_key]], dtype=int)
AttributeError: 'numpy.uint8' object has no attribute 'value'
"
this was not the case for the same file for v5 (for same file). it makes it impossible to compare 'apples to apples'

(tx_dump) nit-picking, but if you have time:
  -- detector IDs are integers (plot)
  -- number of TAs per TC does not need to go below 0 (same for other counting plots)
  -- 'TC Relative Time Spans' plot: red bars need to be on the top of the black line + screenshot
  -- (tp_dump)'TP Time Over Threshold vs Channel': little higher transparency on the dots

TP's ADC peak > ADC integral: np04hd_raw_run029194_0002_dataflow2_datawriter_0_20240916T072613.hdf5 + screenshot

convert_latencies:
"
(dbt) [mrigan@np04-srv-016 trgtools_old]$ convert_tplatencies.py -r /nfs/rscratch/mrigan/np04hd_raw_run029349_0000_dataflow0_datawriter_0_20240926T114848.hdf5 -l ta_timings_processed.csv 
Loading TAs from hdf5
Loading TPs from csv
Traceback (most recent call last):
  File "/nfs/home/mrigan/dune-daq/fddaq-v4.4.8-a9/install/trgtools/bin/convert_tplatencies.py", line 109, in <module>
    main()
  File "/nfs/home/mrigan/dune-daq/fddaq-v4.4.8-a9/install/trgtools/bin/convert_tplatencies.py", line 92, in main
    latencies_whole = tp_data_latencies.iloc[first_tp_idx + 1:tp_making_ta_idx + 1,2].sum()
TypeError: unsupported operand type(s) for +: 'NoneType' and 'int'
"
both v4 and v5...

in py-trgtools.md:
  -- Plotting -> Example: the example script is broken (doesn't load tp_data, when plotting it's not accessing the data correctly)

Also included a link to the channel maps table.

Added and using an option to use integer x-ticks in PDFPlotter histograms.

Previously invisible.

Log minor ticks were not being colored correctly.

Users should define which maker/plugin in a config JSON.

._.

The last TP is sometimes missing, so skip it. Removed a pandas deprecation warning.

This is more correct to PEP 484.

Indexing the reader class now returns the indexed content of the primary data.

Follows the flow change that happened in `trigger`.

ArturSztuc

LGTM.
One suggestion made, but it's not related to v4->v5 movement so fine to ignore.

ArturSztuc · 2024-10-07T10:00:26Z

apps/tapipe.cxx

+  char* payload = static_cast<char*>(malloc(payload_size));


  size_t offset(0);
  for ( const auto& ta : ta_buffer ) {
-    triggeralgs::write_overlay(ta, payload+offset);
+    triggeralgs::write_overlay(ta, static_cast<void*>(payload+offset));
    offset += triggeralgs::get_overlay_nbytes(ta);
  }

-  daqdataformats::Fragment ta_frag(payload, payload_size);
+  daqdataformats::Fragment ta_frag(static_cast<void*>(payload), payload_size);

-  free(payload);
+  free(static_cast<void*>(payload));


Both here and in EmulationUnit, I would totally avoid using malloc and free. This is low-level C that should only be used if really cannot be avoided. In this case we could simply do:

std::vector<char> payload(payload_size); size_t offset(0); for ( const auto& ta : ta_buffer ) { triggeralgs::write_overlay(ta, static_cast<void*>(payload.data()+offset)); offset += triggeralgs::get_overlay_nbytes(ta); } daqdataformats::Fragment ta_frag(static_cast<void*>(payload.data()), payload_size);

And no free would be needed, since vector would go out of scope anyway (but could clear it if you must).

BUT this is not related to v4-to-v5 movement, so I'm fine if that's done at later date with a separate PR

ArturSztuc · 2024-10-07T10:00:47Z

include/trgtools/detail/EmulationUnit.hxx

@@ -80,18 +80,19 @@ EmulationUnit<T, U, V>::emulate_vector(const std::vector<T>& inputs) {
  if (payload_size == 0)
    return nullptr;

-  void* payload = malloc(payload_size);
+  // Awkward type conversion to avoid compiler complaints on void* arithmetic.
+  char* payload = static_cast<char*>(malloc(payload_size));


ditto re. malloc & free, could use vector here instead.

ArturSztuc · 2024-10-07T10:02:27Z

apps/process_tpstream.cxx

@@ -212,6 +212,7 @@ int main(int argc, char const *argv[])
  if (latencies) {
    std::filesystem::path output_path(output_file_path);
    ta_emulator->set_timing_file((output_path.parent_path() / ("ta_timings_" + output_path.stem().string() + ".csv")).string());
+    ta_emulator->write_csv_header("TP Time Start,TP ADC Integral,Time Diffs,Is Last TP In TA");


I like this, although I wish it was done in a separate PR, not in a simple "v4 to v5". I'm now having to do PR review commit-by-commit :(

I understand the out-of-scopeness, but it was part of the requested changes. I tried to keep the commits well separated for what they changed.

ArturSztuc · 2024-10-07T10:06:17Z

python/trgtools/TAReader.py

@@ -7,6 +7,7 @@
 import trgdataformats

 import numpy as np
+from numpy.typing import NDArray


Had no idea numpy has its own typing module. Nice.

Only removes from building. The cxx is kept for reference.

MRiganSUSX

Thanks @aeoranday for improvements, it's already better.

Here are more comments after another round of review:

running process_tpstream over the same tpstream hd5 file, same configuration (barring change in algo names, but both prescale):
V4:

		Average TA Time Process (19702 TAs): 7410 ns.
		Average TC Time Process (197 TCs): 7758 ns.
-- Finished TSL 118490:0

V5:

		Average TA Time Process (19695 TAs): 18111 ns.
		Average TC Time Process (197 TCs): 21811 ns.
-- Finished TSL 118490:0

=> different number of TAsm and much much slower?
(output file also very slightly smaller - guessing that's the 'missing' TAs ?)

running process_tpstream with --latencies:
-- (same issue as above with missing TAs)
-- the 'Time Diffs' parameters differ for the same TA(s):
V4:

108017674586074187,16506,7949,1   <- ??
108017674565021675,26052,147,0

V5:

108017674586074187,16506,8115,0
108017674565021675,26052,953,0

convert_tplatencies:
what exactly is the point of convert_tplatencies? It does not seem that there any scripts available that would use the output of this. And, if what it produces is the desired result, why don't we get it straight from tp_process with --latencies? is that inbetween output used for anything else other than the input to this converter ?
tp_dump:
still get weird TPs (attach ss)
tc_dump:

when using on the output file made with process_tpstream (the input to that process step is the same in both versions):
even though the # of TCs made IS the same, the TCs are different, as the TC stats are different:
V4:

time_candidate
Reference Statistics:
        Total # TCs = 10,
        Mean = 2.13,
        Std = 1.37,
        Min = 0.0,
        Max = 4.220520496368408.
Anomalies:
        # of >3 Sigma TCs = 0,
        # of >2 Sigma TCs = 0.


time_end
Reference Statistics:
        Total # TCs = 10,
        Mean = 2.13,
        Std = 1.37,
        Min = 0.0,
        Max = 4.220515489578247.
Anomalies:
        # of >3 Sigma TCs = 0,
        # of >2 Sigma TCs = 0.


time_start
Reference Statistics:
        Total # TCs = 10,
        Mean = 2.13,
        Std = 1.37,
        Min = 0.0,
        Max = 4.220520496368408.
Anomalies:
        # of >3 Sigma TCs = 0,
        # of >2 Sigma TCs = 0.

V5:

time_candidate
Reference Statistics:
        Total # TCs = 10,
        Mean = 2.02,
        Std = 1.46,
        Min = 0.0,
        Max = 4.214995384216309.
Anomalies:
        # of >3 Sigma TCs = 0,
        # of >2 Sigma TCs = 0.


time_end
Reference Statistics:
        Total # TCs = 10,
        Mean = 2.02,
        Std = 1.46,
        Min = 0.0,
        Max = 4.214994430541992.
Anomalies:
        # of >3 Sigma TCs = 0,
        # of >2 Sigma TCs = 0.


time_start
Reference Statistics:
        Total # TCs = 10,
        Mean = 2.02,
        Std = 1.46,
        Min = 0.0,
        Max = 4.214995384216309.
Anomalies:
        # of >3 Sigma TCs = 0,
        # of >2 Sigma TCs = 0.

(using this dump on the same raw hdf5 (without the process_ step), the output is identical).

py-trgtools:
-- very last command block (Plotting->Example) is missing command to load data:
tpr.read_all_fragments() (or something similar that takes subset)
without that:

>>> print(tpr['adc_peak'])
[]

-- the output example.pdf seems to be broken / corrupted (empty) file (tried 3 different inputs)
-- there is an error message when exiting the interactive python session:

Exception ignored in: <function PDFPlotter.__del__ at 0x7fb183882680>
Traceback (most recent call last):
  File "/nfs/home/mrigan/dune-daq/v5.2-241010/install/trgtools/lib64/python/trgtools/plot/PDFPlotter.py", line 91, in __del__
  File "/nfs/home/mrigan/dune-daq/v5.2-241010/.venv/lib/python3.10/site-packages/matplotlib/backends/backend_pdf.py", line 2721, in close
  File "/nfs/home/mrigan/dune-daq/v5.2-241010/.venv/lib/python3.10/site-packages/matplotlib/backends/backend_pdf.py", line 828, in finalize
  File "/nfs/home/mrigan/dune-daq/v5.2-241010/.venv/lib/python3.10/site-packages/matplotlib/backends/backend_pdf.py", line 974, in writeFonts
  File "/nfs/home/mrigan/dune-daq/v5.2-241010/.venv/lib/python3.10/site-packages/matplotlib/backends/backend_pdf.py", line 1451, in embedTTF
  File "/nfs/home/mrigan/dune-daq/v5.2-241010/.venv/lib/python3.10/site-packages/matplotlib/backends/backend_pdf.py", line 1182, in embedTTFType3
ImportError: sys.meta_path is None, Python is likely shutting down

I understand that lot (if not all) of these are not caused by your port. I'm just listing my findings here as I was going through testing.
Please, for those points that you are not fixing for this PR, make issues.

alessandrothea and others added 30 commits January 26, 2024 09:04

Adding tc making and rearranging configuration a bit

8016812

Initial fragment removal.

a43b31f

Bug fix.

7041a43

Prefer single row over single column.

92bb686

Removed fragment separation.

b55eace

Updated to make use of new TPData.tp_data definition.

790d18f

Updated to new format and added tick/second conversoin.

648cfd7

Bug fix and conversion correction.

844b3d2

Small style improvements.

cc3ded5

Fetches run number and file index.

d96eccd

Shifted to double hist on PdfPages.

8edc6fd

Shifted to double hist on PdfPages.

7884c1b

Detailed analysis on ADC Integral vs ADC Peak.

a3c04ac

Stops importing an empty binding.

379adc9

Merge pull request #11 from DUNE-DAQ/aeoranday/fragment-removal

614c35b

Removed Fragment Separation From TAData & TPData Classes

Merge branch 'production/v4' into aeoranday/removing-pybindsrc-ref.

0e607dc

Merge pull request #15 from DUNE-DAQ/aeoranday/removing-pybindsrc-ref

6dbe5d6

Remove C Python Binding Import

Flipped the linear/log ordering.

7f1e14b

Added seconds option, variable bins, and xlabel.

cf86074

Merge branch 'production/v4' into aeoranday/hdf5-file-attributes.

6f8e962

Create dir on run id and file index and dump.

8cb33b2

Added an overwrite option and better word choice.

706bb80

--overwrite bug fix on mkdir logic.

1031b64

Abstract HDF5 reader class.

2172bbe

Working TC data loading from raw HDF5.

fbf708f

TC data member plotting.

0dacb0b

Unique save names and stats writing bug fix.

14f19c1

Added time spans and ADC integral plots.

1e0ab28

Changed double scaling display order and improved xticks.

0b8dede

Changed double scaling display order.

b14fc9a

ArturSztuc and others added 4 commits July 5, 2024 15:06

Merge pull request #45 from DUNE-DAQ/asztuc/version_bump_v1.1.4

a5ebda1

Bumping version to v1.1.4

Merge pull request #47 from DUNE-DAQ/patch/fddaq-v4.4.x

a93bb67

Patch/fddaq v4.4.x

Merge branch 'production/v4' into aeo/v4_to_v5.

032f259

Update TA Types/Algs and Correct Docstrings.

6e4fa1c

aeoranday added the maintenance Addresses a user request or a change in another part of the system label Sep 11, 2024

aeoranday requested review from ArturSztuc and MRiganSUSX September 11, 2024 17:30

aeoranday self-assigned this Sep 11, 2024

aeoranday added 16 commits October 3, 2024 09:58

Clean Compiler Warnings.

9f0a45f

Update Docs.

b900eb9

Also included a link to the channel maps table.

Add Latency CSV Headers.

0d7540d

Change Flag Order.

333c2b2

Use Integer X-Ticks.

cbca206

Added and using an option to use integer x-ticks in PDFPlotter histograms.

Increase Errorbar Marker Size.

8eaaf82

Previously invisible.

Fix Minor Tick Coloring.

dfb3b5d

Log minor ticks were not being colored correctly.

Add TP Transparency.

24daedf

Remove Whitespace.

e49494a

Remove Unused Option.

04db73c

Users should define which maker/plugin in a config JSON.

Link Libraries.

a2c288a

._.

Add Missing Edge Case & Remove Warning.

79a902c

The last TP is sometimes missing, so skip it. Removed a pandas deprecation warning.

Replace 'np.ndarray' With NDArray.

19a51a4

This is more correct to PEP 484.

Add Primary Data Indexing.

f54a5de

Indexing the reader class now returns the indexed content of the primary data.

Update TX Extraction.

de9184c

Follows the flow change that happened in `trigger`.

Upgrade to v2.0.0.

28a96d3

ArturSztuc approved these changes Oct 7, 2024

View reviewed changes

aeoranday mentioned this pull request Oct 14, 2024

Correct TAM ADCSimpleWindow Trace Name DUNE-DAQ/triggeralgs#85

Merged

Remove Placeholder Unit Test.

0378a6f

Only removes from building. The cxx is kept for reference.

MRiganSUSX reviewed Oct 14, 2024

View reviewed changes

aeoranday merged commit b2816b8 into develop Oct 15, 2024
1 check passed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

v4 to v5 Port #48

v4 to v5 Port #48

aeoranday commented Sep 11, 2024

MRiganSUSX commented Sep 27, 2024

ArturSztuc left a comment

ArturSztuc Oct 7, 2024

ArturSztuc Oct 7, 2024

ArturSztuc Oct 7, 2024

aeoranday Oct 7, 2024

ArturSztuc Oct 7, 2024

MRiganSUSX left a comment •

edited

Loading

v4 to v5 Port #48

v4 to v5 Port #48

Conversation

aeoranday commented Sep 11, 2024

MRiganSUSX commented Sep 27, 2024

ArturSztuc left a comment

Choose a reason for hiding this comment

ArturSztuc Oct 7, 2024

Choose a reason for hiding this comment

ArturSztuc Oct 7, 2024

Choose a reason for hiding this comment

ArturSztuc Oct 7, 2024

Choose a reason for hiding this comment

aeoranday Oct 7, 2024

Choose a reason for hiding this comment

ArturSztuc Oct 7, 2024

Choose a reason for hiding this comment

MRiganSUSX left a comment • edited Loading

Choose a reason for hiding this comment

MRiganSUSX left a comment •

edited

Loading