xsec from fortran and cpp differ in gq_ttq tmad tests (wrong order of couplings) #748

valassi · 2023-08-14T10:20:24Z

This is a follow-up to #630, which had originally been files about gq_ttq but eventually focused on gg_uu: xsec from fortran and cpp differ in gq_ttq tmad tests.

In #630, I had also found out that xsec from fortran and cpp also differed in gg_uu tmad tests, but with Zenny's help this was identified as a problem in getGoodHel and fixed. This however fixed gg_uu, but not gq_ttq, where xsecs are still different. For clarity, I move here gq_ttq, while I will soon close #630 about gg_uu.

roiser · 2023-08-18T08:13:03Z

I started looking into this. What I’m doing is to create via our new plugin interface the directories for fortran and c++ code with

generate p p > t t~ j
output madevent ppttj_for_sde1 --hel_recycling=False

and

generate p p > t t~ j 
output madevent ggttj_cpp_sde1 --hel_recycling=False --vector_size=32 --me_exporter=standalone_cudacpp

respectively. My first observation is that there are different numbers of P1 directories produced i.e.

ls ppttj_for_sde1/SubProcesses/P1_
P1_gg_ttxg/ P1_gq_ttxq/ P1_qq_ttxg/

and

ls ppttj_cpp_sde1/SubProcesses/P1_
P1_gg_ttxg/   P1_gu_ttxu/   P1_gux_ttxux/ P1_uux_ttxg/

Is this related to the sde_stragety variable in the run_card? Pls note that in both cases this is set to 1. I presume the fortran/P1_gq_ttxq corresponds to the cpp/P1_gu_ttxu + cpp/P1_gux_ttxux

Going a bit further I then produce the cross sections for fortran/P1_gq_ttxq and cpp/P1_gu_ttxu, cpp/P1_gux_ttxux where I end up with numbers e.g.

fortran/P1_gq_ttxq : Event xsec   0.52569008216473057

and

cpp/P1_gu_ttxu : Event xsec   0.21427934258181794 
cpp/P1_gux_ttxux : Event xsec   0.89817471781675562

can I sum up the two cpp ones to the fortran one? They don’t … there is a factor 2 between them.

Before going further I wanted to see if I can reproduce the same numbers in fortran with the two P1 directories generated. Can this generation of the two P1 directories be forced?

valassi · 2023-08-18T08:28:57Z

Hi @roiser thanks for looking into this.
(cc @oliviermattelaer @hageboeck @zeniheisser @Jooorgen for info on the tests)

Apologies, I should have posted the details here of where I see the errors. There is a script already-made where you get everything automatically. For instance with current upstream/master:

cd epochX/cudacpp
./tmad/teeMadX.sh -gqttq +10x -makeclean
...
*** (1) EXECUTE MADEVENT_FORTRAN x1 (create events.lhe) ***
--------------------
CUDACPP_RUNTIME_FBRIDGEMODE = (not set)
CUDACPP_RUNTIME_VECSIZEUSED = 8192
--------------------
8192 1 1 ! Number of events and max and min iterations
0.000001 ! Accuracy (ignored because max iterations = min iterations)
0 ! Grid Adjustment 0=none, 2=adjust (NB if = 0, ftn26 will still be used if present)
1 ! Suppress Amplitude 1=yes (i.e. use MadEvent single-diagram enhancement)
0 ! Helicity Sum/event 0=exact
1 ! Channel number (1-N) for single-diagram enhancement multi-channel (NB used even if suppress amplitude is 0!)
--------------------
Executing ' ./madevent_fortran < /tmp/avalassi/input_gqttq_x1_fortran > /tmp/avalassi/output_gqttq_x1_fortran'
 [OPENMPTH] omp_get_max_threads/nproc = 1/4
 [NGOODHEL] ngoodhel/ncomb = 16/32
 [XSECTION] VECSIZE_USED = 8192
 [XSECTION] MultiChannel = TRUE
 [XSECTION] Configuration = 1
 [XSECTION] ChannelId = 1
 [XSECTION] Cross section = 0.2605 [0.26050333309703716] fbridge_mode=0
 [UNWEIGHT] Wrote 81 events (found 540 events)
 [COUNTERS] PROGRAM TOTAL          :    0.3017s
 [COUNTERS] Fortran Overhead ( 0 ) :    0.2234s
 [COUNTERS] Fortran MEs      ( 1 ) :    0.0782s for     8192 events => throughput is 1.05E+05 events/s
...
*** (2-none) EXECUTE MADEVENT_CPP x1 (create events.lhe) ***
--------------------
CUDACPP_RUNTIME_FBRIDGEMODE = (not set)
CUDACPP_RUNTIME_VECSIZEUSED = 8192
--------------------
8192 1 1 ! Number of events and max and min iterations
0.000001 ! Accuracy (ignored because max iterations = min iterations)
0 ! Grid Adjustment 0=none, 2=adjust (NB if = 0, ftn26 will still be used if present)
1 ! Suppress Amplitude 1=yes (i.e. use MadEvent single-diagram enhancement)
0 ! Helicity Sum/event 0=exact
1 ! Channel number (1-N) for single-diagram enhancement multi-channel (NB used even if suppress amplitude is 0!)
--------------------
Executing ' ./build.none_d_inl0_hrd0/madevent_cpp < /tmp/avalassi/input_gqttq_x1_cudacpp > /tmp/avalassi/output_gqttq_x1_cudacpp'
 [OPENMPTH] omp_get_max_threads/nproc = 1/4
 [NGOODHEL] ngoodhel/ncomb = 16/32
 [XSECTION] VECSIZE_USED = 8192
 [XSECTION] MultiChannel = TRUE
 [XSECTION] Configuration = 1
 [XSECTION] ChannelId = 1
 [XSECTION] Cross section = 1.276 [1.2757941949814184] fbridge_mode=1
 [UNWEIGHT] Wrote 105 events (found 652 events)
 [COUNTERS] PROGRAM TOTAL          :    0.3808s
 [COUNTERS] Fortran Overhead ( 0 ) :    0.3106s
 [COUNTERS] CudaCpp MEs      ( 2 ) :    0.0702s for     8192 events => throughput is 1.17E+05 events/s

*** (2-none) Compare MADEVENT_CPP x1 xsec to MADEVENT_FORTRAN xsec ***

ERROR! xsec from fortran (0.26050333309703716) and cpp (1.2757941949814184) differ by more than 2E-14 (3.8974198518457603)

This is one of the tests I run every time I do a MR.

The log is always here
https://github.com/madgraph5/madgraph4gpu/blob/master/epochX/cudacpp/tmad/logs_gqttq_mad/log_gqttq_mad_d_inl0_hrd0.txt
ie the latest (with errors) being
https://github.com/madgraph5/madgraph4gpu/blob/bd255c01fb1cf5377de344c42089765756fd75e1/epochX/cudacpp/tmad/logs_gqttq_mad/log_gqttq_mad_d_inl0_hrd0.txt

Specifically: the discrepancy is not a factor two, so I am not sure your findings explain it. Also because I do not think you are comparing the right numbers, see below.

About the number of directpries produced in pp_ttj and gg_ttj, this is an interesting issue but I think it is a known one (and unrelated to gq_ttq cross ssection mismatch). There was one setting which was the default in fortran-without-second-exporter, which however was giving issues in cudacpp (something about mirroring? something about q and q~ better being in separate P1s?... I will try to find the issue number). As a result, Olivier suggested to disable this setiing for cudacpp, and this is (I think) why you see different numbers of P1 directpries.

Anyway, when I see that you are not comparing the right numbers: you are comparing above the fortrran xsec with one seeting for the P1s, against the cpp with a different setting for the P1s. So a facator two may come from that, and might even be normal (instead of one P1 with a xsec 2X, you have two P1s each with xsec X, the sum being 2X anyway?). What I am comparing in the tmad test is instead the cpp with its P1 setting, against the fortran with the same setting as cpp for the P1. In other words: I am generating the cudacpp code, an dthen I am using the fortran that we find inside that same cpp code directory. I have not checked that this is consisten with the fortran in the optimized fortran setting, but I assume that Olivier had done this right an dthere was no need to check.

I will try to find the issue for this doubling of directories now.

valassi · 2023-08-18T08:31:48Z

(By the way: I think that the difference is essentially using fbridge mode = 0 or 1... actually tmad probably uses two different executables madevent_fortran and madevent_cpp, but I guess that you could also use madevent_cpp in both cases with the options fbridge=0 and1. If you use option fbridge=-1, this will print out event by event comparisons of fortran and cpp MEs, which miught help)

valassi · 2023-08-18T08:37:15Z

I think (@oliviermattelaer should confirm) that what @roiser has just described above is related to "mirror processes" and to "nprocesses>1". This was discussed in issue #272 and MR #396.

See in particular this figure from issue #272 :
#272 (comment)
https://user-images.githubusercontent.com/3473550/229080358-ae23dccc-dcca-4b16-a9c4-c7cc4430bf65.png

valassi · 2023-08-18T08:39:00Z

So, to bring back the focus on gq_ttq: the issue that we should solve is the

ERROR! xsec from fortran (0.26050333309703716) and cpp (1.2757941949814184) differ by more than 2E-14 (3.8974198518457603)

in https://github.com/madgraph5/madgraph4gpu/blob/master/epochX/cudacpp/tmad/logs_gqttq_mad/log_gqttq_mad_d_inl0_hrd0.txt

roiser · 2023-08-18T08:46:12Z

Thanks Andrea, I'm trying to debug this on a finer grained level, actually when comparing the two "fortran" and "cpp" there are differences in cross sections of most of them (but not all), I wonder if this is a valid way of tracing down the bug. I presume we need @oliviermattelaer for this, e.g. I run

SubProcesses (gpucpp)$ for d in `ls | grep P1`; do echo $d | tr -d \\n && cd $d && ./madevent < G1/input_app.txt | grep "Event xsec" && cd ..; done

and reveive for fortran

P1_gg_ttxg Event xsec    7.9291683836808555E-002
P1_gq_ttxq Event xsec   0.52569008216473057     
P1_qq_ttxg Event xsec    9.5002520291213539

and C++

P1_gg_ttxg Event xsec    8.0151570235421854E-002
P1_gu_ttxu Event xsec   0.21427934258181794     
P1_gux_ttxux Event xsec   0.89817471781675562     
P1_uux_ttxg Event xsec    4.6258210198299379

valassi · 2023-08-18T08:52:15Z

Note also that the madX test runs only one of the P1:

    elif [ "${gqttq}" == "1" ]; then 
      dir=$topdir/epochX/${bckend}/gq_ttq${suff}SubProcesses/P1_gu_ttxu # 1st of two (test only one for now)
      ###dir=$topdir/epochX/${bckend}/gq_ttq${suff}SubProcesses/P1_gux_ttxux # 2nd of two (test only one for now)

valassi · 2023-08-18T08:56:22Z

Thanks Andrea, I'm trying to debug this on a finer grained level, actually when comparing the two "fortran" and "cpp" there are differences in cross sections of most of them (but not all), I wonder if this is a valid way of tracing down the bug. I presume we need @oliviermattelaer for this, e.g. I run

Thanks Stefan. I think that what you are testing at another level is also useful - but for the purpose of debugging this issue #748, I would just focus on the way this bug was identified, no need to do it differently. IMHO you are overcomplicating it if you go your way, but then your choice ;-)

Note also: cross sections depend in very subtle way on which events are used in the sampling. And if you use different settings you most likely end up with different eventgs. Which means you must run the tests at a statistical level, not bit by bit. I would stick to the madX script...

valassi · 2023-08-18T09:24:55Z

PS

I wonder if this is a valid way of tracing down the bug
Rephrasing: no I think this is not a valid way of tracing down this specific bug #748 (then of course I may be wrong... and certainly you may find also interesting things in your tests, but maybe not the most relevant for this one)

roiser · 2023-08-18T09:47:25Z

I started producing the cross sections for processes which only produce a single sub processes and they show differences with gluon quarks in the initial state.

roiser · 2023-08-18T11:34:34Z

Thanks again, I think its much better to produce those simpler processes. I can reproduce the error now using the mg5 plugin interface. Just to make sure I also ran a few other processes which look ok always comparing cpp vs fortran version

proc_uux_ttx_cpp      Cross-section :   18.16 +- 0.02996 pb
proc_uux_ttx_for      Cross-section :   18.22 +- 0.04451 pb

proc_gu_ttxu_cpp      Cross-section :   7058 +- 26.17 pb
proc_gu_ttxu_for      Cross-section :   35.61 +- 0.1087 pb

proc_gg_ttx_cpp      Cross-section :   440.6 +- 0.5252 pb
proc_gg_ttx_for      Cross-section :   440.9 +- 0.5376 pb

roiser · 2023-08-22T06:57:44Z

Quick update on the status of things: as seen above g u > t t~ u does produce wrong results. After a chat with Olivier we concluded that the problem should be on the cppcuda side (alphaS?). In order to prove this I want to compare the "standalone" versions of fortran and cudacpp. The problem now is that "output standalone" produces a compilation error (see below). The vector.inc file is not produced. I need to check in the code generating code.

	The compilation fails with the following output message:
	    cd MODEL; make
	    make[1]: Entering directory '/afs/cern.ch/work/r/roiser/sw/madgraph4gpu/MG5aMC/mg5amcnlo/guttu2.sa/Source/MODEL'
	    /opt/rh/gcc-toolset-11/root/usr/bin/gfortran -w -fPIC  -ffixed-line-length-132  -c -o couplings.o couplings.f
	    couplings.f:16: Error: Can't open included file '../vector.inc'
	    make[1]: *** [<builtin>: couplings.o] Error 1
	    make[1]: Leaving directory '/afs/cern.ch/work/r/roiser/sw/madgraph4gpu/MG5aMC/mg5amcnlo/guttu2.sa/Source/MODEL'
	    make: *** [makefile:59: ../lib/libmodel.a] Error 2

roiser · 2023-08-22T14:10:57Z

A fix for the above issue of the missing vector.inc has been proposed in mg5amcnlo/mg5amcnlo#65

oliviermattelaer · 2023-08-23T08:28:29Z

I confirm that proc_gu_ttxu_for
is 35.74 in the Long Term Stable version of MG5aMC
is 35.71 (+-0.10) within gpucpp (with cpp code created but not used --andrea patch are not yet applied--)

zeniheisser · 2023-08-23T12:37:24Z

@roiser @valassi @oliviermattelaer
I just did a quick standalone check of cudacpp vs upstream for the same phase space point

For native MGaMC I get:

 Phase space point:

 -----------------------------------------------------------------------------
 n        E             px             py              pz               m 
 1   0.7500000E+03  0.0000000E+00  0.0000000E+00  0.7500000E+03  0.0000000E+00
 2   0.7500000E+03  0.0000000E+00  0.0000000E+00 -0.7500000E+03  0.0000000E+00
 3   0.1354789E+03  0.1344191E+03 -0.1686881E+02 -0.1209253E+01  0.1364261E+00
 4   0.6441284E+03  0.2809898E+03 -0.3353039E+03  0.4727762E+03  0.3109229E+00
 5   0.7203927E+03 -0.4154090E+03  0.3521728E+03 -0.4715670E+03  0.2978233E+00
 -----------------------------------------------------------------------------
 Matrix element =    4.4143826846910579E-003  GeV^          -2
 -----------------------------------------------------------------------------

whereas for cudacpp I get

--------------------------------------------------------------------------------
Momenta:
   1  7.500000e+02  0.000000e+00  0.000000e+00  7.500000e+02
   2  7.500000e+02  0.000000e+00  0.000000e+00 -7.500000e+02
   3  1.354789e+02  1.344191e+02 -1.686881e+01 -1.209253e+00
   4  6.441284e+02  2.809898e+02 -3.353039e+02  4.727762e+02
   5  7.203927e+02 -4.154090e+02  3.521728e+02 -4.715670e+02
--------------------------------------------------------------------------------
 Matrix element = 0.00203141 GeV^-2
--------------------------------------------------------------------------------

which differ by roughly but not exactly a factor 2. So, there definitely seems to be an issue in the amplitude evaluation, but as said this is only a single phase space point I've checked.

oliviermattelaer · 2023-08-23T12:40:06Z

Can you check the value of the couplings? olivier On 23 Aug 2023, at 14:37, Zenny Wettersten ***@***.***> wrote: @roiser<https://github.com/roiser> @valassi<https://github.com/valassi> @oliviermattelaer<https://github.com/oliviermattelaer> I just did a quick standalone check of cudacpp vs upstream for the same phase space point For native MGaMC I get: Phase space point:

…

----------------------------------------------------------------------------- n E px py pz m 1 0.7500000E+03 0.0000000E+00 0.0000000E+00 0.7500000E+03 0.0000000E+00 2 0.7500000E+03 0.0000000E+00 0.0000000E+00 -0.7500000E+03 0.0000000E+00 3 0.1354789E+03 0.1344191E+03 -0.1686881E+02 -0.1209253E+01 0.1364261E+00 4 0.6441284E+03 0.2809898E+03 -0.3353039E+03 0.4727762E+03 0.3109229E+00 5 0.7203927E+03 -0.4154090E+03 0.3521728E+03 -0.4715670E+03 0.2978233E+00 ----------------------------------------------------------------------------- Matrix element = 4.4143826846910579E-003 GeV^ -2 ----------------------------------------------------------------------------- whereas for cudacpp I get

-------------------------------------------------------------------------------- Momenta: 1 7.500000e+02 0.000000e+00 0.000000e+00 7.500000e+02 2 7.500000e+02 0.000000e+00 0.000000e+00 -7.500000e+02 3 1.354789e+02 1.344191e+02 -1.686881e+01 -1.209253e+00 4 6.441284e+02 2.809898e+02 -3.353039e+02 4.727762e+02 5 7.203927e+02 -4.154090e+02 3.521728e+02 -4.715670e+02 -------------------------------------------------------------------------------- Matrix element = 0.00203141 GeV^-2 -------------------------------------------------------------------------------- which differ by roughly but not exactly a factor 2. So, there definitely seems to be an issue in the amplitude evaluation, but as said this is only a single phase space point I've checked. — Reply to this email directly, view it on GitHub<#748 (comment)>, or unsubscribe<https://github.com/notifications/unsubscribe-auth/AH6535V644PI4PBS2BXB5G3XWX2Q7ANCNFSM6AAAAAA3PPJCUM>. You are receiving this because you were mentioned.Message ID: ***@***.***>

roiser · 2023-08-23T12:44:33Z

Thanks @zeniheisser !! I did the same this morning and saw the same.

Another thing worries me a bit. I thought we are able to produce exactly the same values for Fortran and Cudacpp eg for other processes. Eg I tried „g g > t t-“ the values are close but not the same.

zeniheisser · 2023-08-23T13:10:06Z

Can you check the value of the couplings?

@roiser can you do this? I could take a quick look at it tomorrow otherwise, but I'm not exactly drowning in free time at the moment hehe

Another thing worries me a bit. I thought we are able to produce exactly the same values for Fortran and Cudacpp eg for other processes. Eg I tried „g g > t t-“ the values are close but not the same.

@roiser, I think you'll have agreement to the level of momentum precision though, no? That's what I have seen when doing tests --- there is a slight difference, but roughly on the order of precision of the momenta I've explicitly set for the processes.

roiser · 2023-08-25T14:46:27Z

Can you check the value of the couplings? olivier On 23 Aug 2023, at 14:37

I checked now, the G value is the same in fortran and cudacpp

roiser · 2023-08-25T19:23:05Z

The G value is the same, but when running the debugger I see that in fortran and cudacpp the GC_10 and GC11 are swapped. E.g. I find in fortran

(gdb) print GC_10
$1 = ((-1.2177157847767197,0), ...)
gdb) print GC_11
$2 = ((0,1.2177157847767197), ...)

while in cudacpp the values are for GC_10 (arg of VVV1_0)

(gdb) print COUP
$6 = {m_real = {0, 0, 0, 0}, m_imag = {1.2177157847767195, 1.2177157847767195, 1.2177157847767195, 1.2177157847767195}}

and GC_11 (e.g. arg of FFV1_0)

(gdb) print COUP
$5 = {m_real = {-1.2177157847767195, -1.2177157847767195, -1.2177157847767195, -1.2177157847767195}, m_imag = {0, 0, 0, 0}}

I now swapped the two COUPs entries in the arguments of the ixxx/FFV functions and this renders the same Matrix element value !!! Now we need to find out why only in this case the couplings had been swapped ...

oliviermattelaer · 2023-08-25T20:46:57Z

Good job!! That's a huge findings (and so easy to miss). Thanks so much.

valassi · 2023-10-26T15:32:58Z

Thanks @roiser ! Indeed this is fixed by PR #757 . I am unpinning this.

…qttq xsec is now correct! fixes high-priority issue madgraph5#748 These three tests now succeed (they used to fail) ./tmad/teeMadX.sh +10x -gqttq -makeclean ./tmad/teeMadX.sh +10x -gqttq -makeclean -fltonly ./tmad/teeMadX.sh +10x -gqttq -makeclean -mixonly NB: eemumu code generation remains to be fixed after PR madgraph5#757

…) and gqttq (xsec differs again madgraph5#748) STARTED AT Sat Oct 28 01:08:49 PM CEST 2023 ENDED AT Sat Oct 28 01:30:55 PM CEST 2023 Status=0 24 /data/avalassi/GPU2023/madgraph4gpuX/epochX/cudacpp/tmad/logs_eemumu_mad/log_eemumu_mad_d_inl0_hrd0.txt 24 /data/avalassi/GPU2023/madgraph4gpuX/epochX/cudacpp/tmad/logs_eemumu_mad/log_eemumu_mad_f_inl0_hrd0.txt 24 /data/avalassi/GPU2023/madgraph4gpuX/epochX/cudacpp/tmad/logs_eemumu_mad/log_eemumu_mad_m_inl0_hrd0.txt 0 /data/avalassi/GPU2023/madgraph4gpuX/epochX/cudacpp/tmad/logs_ggttggg_mad/log_ggttggg_mad_d_inl0_hrd0.txt 0 /data/avalassi/GPU2023/madgraph4gpuX/epochX/cudacpp/tmad/logs_ggttggg_mad/log_ggttggg_mad_f_inl0_hrd0.txt 0 /data/avalassi/GPU2023/madgraph4gpuX/epochX/cudacpp/tmad/logs_ggttggg_mad/log_ggttggg_mad_m_inl0_hrd0.txt 24 /data/avalassi/GPU2023/madgraph4gpuX/epochX/cudacpp/tmad/logs_ggttgg_mad/log_ggttgg_mad_d_inl0_hrd0.txt 24 /data/avalassi/GPU2023/madgraph4gpuX/epochX/cudacpp/tmad/logs_ggttgg_mad/log_ggttgg_mad_f_inl0_hrd0.txt 24 /data/avalassi/GPU2023/madgraph4gpuX/epochX/cudacpp/tmad/logs_ggttgg_mad/log_ggttgg_mad_m_inl0_hrd0.txt 24 /data/avalassi/GPU2023/madgraph4gpuX/epochX/cudacpp/tmad/logs_ggttg_mad/log_ggttg_mad_d_inl0_hrd0.txt 24 /data/avalassi/GPU2023/madgraph4gpuX/epochX/cudacpp/tmad/logs_ggttg_mad/log_ggttg_mad_f_inl0_hrd0.txt 24 /data/avalassi/GPU2023/madgraph4gpuX/epochX/cudacpp/tmad/logs_ggttg_mad/log_ggttg_mad_m_inl0_hrd0.txt 24 /data/avalassi/GPU2023/madgraph4gpuX/epochX/cudacpp/tmad/logs_ggtt_mad/log_ggtt_mad_d_inl0_hrd0.txt 24 /data/avalassi/GPU2023/madgraph4gpuX/epochX/cudacpp/tmad/logs_ggtt_mad/log_ggtt_mad_f_inl0_hrd0.txt 24 /data/avalassi/GPU2023/madgraph4gpuX/epochX/cudacpp/tmad/logs_ggtt_mad/log_ggtt_mad_m_inl0_hrd0.txt 0 /data/avalassi/GPU2023/madgraph4gpuX/epochX/cudacpp/tmad/logs_gqttq_mad/log_gqttq_mad_d_inl0_hrd0.txt 0 /data/avalassi/GPU2023/madgraph4gpuX/epochX/cudacpp/tmad/logs_gqttq_mad/log_gqttq_mad_f_inl0_hrd0.txt 0 /data/avalassi/GPU2023/madgraph4gpuX/epochX/cudacpp/tmad/logs_gqttq_mad/log_gqttq_mad_m_inl0_hrd0.txt --- This is a summary of the changes with respect to my previous logs using the August code base Functionality: - ggttggg: madevent crashes - gqttq: xsec differs again Performance (Fortran overhead): -- all very similar Performance (MEs): -- eemumu: fortran 20% faster, cuda slightly slower, simd a factor 2 to 3 slower -- ggtt: fortran and simd 20% faster, cuda similar -- ggttg: fortran 10% faster, simd and cuda similar -- ggttgg: all very similar +ERROR! ' ./madevent_fortran < /tmp/avalassi/input_ggttggg_x1_fortran > /tmp/avalassi/output_ggttggg_x1_fortran' failed +d R # 5 > -0.0 -0.0 -0.0 0.4 0.4 +d R # 6 > -0.0 -0.0 -0.0 -0.0 0.4 +s min # 3> 0.0119716.0 29929.0 29929.0 0.0 +s min # 4> 0.0 0.0 29929.0 29929.0 0.0 +s min # 5> 0.0 0.0 0.0 0.0 0.0 +s min # 6> 0.0 0.0 0.0 0.0 0.0 +xqcutij # 3> 0.0 0.0 0.0 0.0 0.0 +xqcutij # 4> 0.0 0.0 0.0 0.0 0.0 +xqcutij # 5> 0.0 0.0 0.0 0.0 0.0 +xqcutij # 6> 0.0 0.0 0.0 0.0 0.0 +ERROR! xsec from fortran (0.26050333309703716) and cpp (1.2757941949814184) differ by more than 2E-14 (3.8974198518457603)

valassi · 2023-10-28T14:51:50Z

Hi @roiser as discussed in #780: I have the impression that this is again broken, can you please check? I am reopening just in case (close it if it works for you). Stefan note: I am kind of sure this had been fixed by PR #757, but it was broken again by PR #761? Thanks

roiser · 2023-10-28T20:55:05Z

Hi @valassi thanks, good catch. I quickly checked the cpp vs fortran and indeed the couplings are swapped back in the wrong order for e.g. gu>ttxu. From quickly looking at #761 I realise that the ordering of the couplings following the correct order in wanted_couplings was lost in that patch, so your guess is correct. I'll fix that, should be easy.

roiser · 2023-10-28T21:19:36Z

Ok, I couldn't wait ;-), there is #782, the trick was to swap the two containers when filtering out the running couplings from wanted_couplings, this preserves the order or the filtered keys. NB: This is WIP, I tested a few processes but will do more checks.

…t23av This is meant to fix gqttq madgraph5#748

valassi · 2023-10-29T10:54:00Z

Thanks Stefan! I checked gqttq it looks good, I merged PR #782

roiser · 2023-10-29T22:28:17Z

I only checked gu>ttxu and dy+2j, let me re-open this one to remind me to do more checks so we can be sure its really fixed.

valassi · 2024-01-17T17:46:23Z

Hi Stefan, did you manage to run your tests here? Is this confirmed fixed? Thanks

valassi · 2024-07-24T10:18:48Z

@roiser now that #826 and #862 (also related to couplings order, in susy this time) can this be closed?

roiser · 2024-07-24T12:49:31Z

Thanks, yes this is all fine, closing it.

This was referenced Aug 14, 2023

Fix cudacpp getGoodHel to define a good helicity as ME>0 (fix mismatch in cross sections for fortran and cpp in gg_uu) #705

Merged

xsec from fortran and cpp differ in gg_uu tmad tests (bug in getGoodHel implementation) #630

Closed

valassi pinned this issue Aug 14, 2023

roiser self-assigned this Aug 17, 2023

roiser mentioned this issue Aug 29, 2023

WIP: if needed fix the order of couplings #757

Merged

oliviermattelaer closed this as completed in #757 Aug 30, 2023

valassi unpinned this issue Oct 26, 2023

valassi mentioned this issue Oct 28, 2023

Patches and test results over the latest PRs #780

Merged

valassi reopened this Oct 28, 2023

valassi added a commit to valassi/madgraph4gpu that referenced this issue Oct 29, 2023

Merge remote-tracking branch 'roiser/preserve_coupling_order' into oc…

4cb6d11

…t23av This is meant to fix gqttq madgraph5#748

valassi closed this as completed Oct 29, 2023

roiser reopened this Oct 29, 2023

roiser mentioned this issue Jan 29, 2024

gq_ttq HIP tests crash on AMD GPUs at LUMI (only in -O3 builds, while -O2 builds succeed) #806

Open

valassi changed the title ~~xsec from fortran and cpp differ in gq_ttq tmad tests~~ xsec from fortran and cpp differ in gq_ttq tmad tests (wrong order of couplings) Jul 24, 2024

This was referenced Jul 24, 2024

Couplings order #918

Merged

No cross section in SUSY gg_t1t1 log file #826

Closed

roiser closed this as completed Jul 24, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

xsec from fortran and cpp differ in gq_ttq tmad tests (wrong order of couplings) #748

xsec from fortran and cpp differ in gq_ttq tmad tests (wrong order of couplings) #748

valassi commented Aug 14, 2023

roiser commented Aug 18, 2023 •

edited

Loading

valassi commented Aug 18, 2023

valassi commented Aug 18, 2023

valassi commented Aug 18, 2023

valassi commented Aug 18, 2023

roiser commented Aug 18, 2023 •

edited

Loading

valassi commented Aug 18, 2023

valassi commented Aug 18, 2023

valassi commented Aug 18, 2023

roiser commented Aug 18, 2023

roiser commented Aug 18, 2023 •

edited

Loading

roiser commented Aug 22, 2023

roiser commented Aug 22, 2023

oliviermattelaer commented Aug 23, 2023

zeniheisser commented Aug 23, 2023

oliviermattelaer commented Aug 23, 2023 via email

roiser commented Aug 23, 2023

zeniheisser commented Aug 23, 2023

roiser commented Aug 25, 2023

roiser commented Aug 25, 2023

oliviermattelaer commented Aug 25, 2023

valassi commented Oct 26, 2023

valassi commented Oct 28, 2023

roiser commented Oct 28, 2023

roiser commented Oct 28, 2023

valassi commented Oct 29, 2023

roiser commented Oct 29, 2023 •

edited

Loading

valassi commented Jan 17, 2024

valassi commented Jul 24, 2024 •

edited

Loading

roiser commented Jul 24, 2024

xsec from fortran and cpp differ in gq_ttq tmad tests (wrong order of couplings) #748

xsec from fortran and cpp differ in gq_ttq tmad tests (wrong order of couplings) #748

Comments

valassi commented Aug 14, 2023

roiser commented Aug 18, 2023 • edited Loading

valassi commented Aug 18, 2023

valassi commented Aug 18, 2023

valassi commented Aug 18, 2023

valassi commented Aug 18, 2023

roiser commented Aug 18, 2023 • edited Loading

valassi commented Aug 18, 2023

valassi commented Aug 18, 2023

valassi commented Aug 18, 2023

roiser commented Aug 18, 2023

roiser commented Aug 18, 2023 • edited Loading

roiser commented Aug 22, 2023

roiser commented Aug 22, 2023

oliviermattelaer commented Aug 23, 2023

zeniheisser commented Aug 23, 2023

oliviermattelaer commented Aug 23, 2023 via email

roiser commented Aug 23, 2023

zeniheisser commented Aug 23, 2023

roiser commented Aug 25, 2023

roiser commented Aug 25, 2023

oliviermattelaer commented Aug 25, 2023

valassi commented Oct 26, 2023

valassi commented Oct 28, 2023

roiser commented Oct 28, 2023

roiser commented Oct 28, 2023

valassi commented Oct 29, 2023

roiser commented Oct 29, 2023 • edited Loading

valassi commented Jan 17, 2024

valassi commented Jul 24, 2024 • edited Loading

roiser commented Jul 24, 2024

roiser commented Aug 18, 2023 •

edited

Loading

roiser commented Aug 18, 2023 •

edited

Loading

roiser commented Aug 18, 2023 •

edited

Loading

roiser commented Oct 29, 2023 •

edited

Loading

valassi commented Jul 24, 2024 •

edited

Loading