Skip to content

Commit

Permalink
[tmad] rerun 102 tput tests on itscrd90 - all ok (after merging madgr…
Browse files Browse the repository at this point in the history
…aph5#860 and madgraph5#850 for Ccoeff madgraph5#825)

STARTED  AT Mon Jun  3 05:51:20 PM CEST 2024
./tput/teeThroughputX.sh -mix -hrd -makej -eemumu -ggtt -ggttg -ggttgg -gqttq -ggttggg -makeclean
ENDED(1) AT Mon Jun  3 06:12:34 PM CEST 2024 [Status=0]
./tput/teeThroughputX.sh -flt -hrd -makej -eemumu -ggtt -ggttgg -inlonly -makeclean
ENDED(2) AT Mon Jun  3 06:20:51 PM CEST 2024 [Status=0]
./tput/teeThroughputX.sh -makej -eemumu -ggtt -ggttg -gqttq -ggttgg -ggttggg -flt -bridge -makeclean
ENDED(3) AT Mon Jun  3 06:29:05 PM CEST 2024 [Status=0]
./tput/teeThroughputX.sh -eemumu -ggtt -ggttgg -flt -rmbhst
ENDED(4) AT Mon Jun  3 06:31:55 PM CEST 2024 [Status=0]
./tput/teeThroughputX.sh -eemumu -ggtt -ggttgg -flt -curhst
ENDED(5) AT Mon Jun  3 06:34:42 PM CEST 2024 [Status=0]
./tput/teeThroughputX.sh -eemumu -ggtt -ggttgg -flt -common
ENDED(6) AT Mon Jun  3 06:37:37 PM CEST 2024 [Status=0]
./tput/teeThroughputX.sh -mix -hrd -makej -susyggtt -susyggt1t1 -smeftggtttt -heftggbb -makeclean
ENDED(7) AT Mon Jun  3 06:47:12 PM CEST 2024 [Status=0]

No errors found in logs
  • Loading branch information
valassi committed Jun 3, 2024
1 parent f9fceae commit f174f4f
Show file tree
Hide file tree
Showing 102 changed files with 5,969 additions and 5,973 deletions.
96 changes: 48 additions & 48 deletions epochX/cudacpp/tput/logs_eemumu_mad/log_eemumu_mad_d_inl0_hrd0.txt
Original file line number Diff line number Diff line change
Expand Up @@ -40,7 +40,7 @@ make[1]: Entering directory '/data/avalassi/GPU2023/madgraph4gpuX/epochX/cudacpp
make[1]: Nothing to be done for 'all'.
make[1]: Leaving directory '/data/avalassi/GPU2023/madgraph4gpuX/epochX/cudacpp/ee_mumu.mad/SubProcesses/P1_epem_mupmum'

DATE: 2024-06-02_20:48:44
DATE: 2024-06-03_17:55:26

On itscrd90.cern.ch [CPU: Intel(R) Xeon(R) Silver 4216 CPU] [GPU: 1x Tesla V100S-PCIE-32GB]:
=========================================================================
Expand All @@ -49,15 +49,15 @@ INFO: The following Floating Point Exceptions will cause SIGFPE program aborts:
Process = SIGMA_SM_EPEM_MUPMUM_CUDA [nvcc 12.0.140 (gcc 11.3.1)] [inlineHel=0] [hardcodePARAM=0]
Workflow summary = CUD:DBL+THX:CURDEV+RMBDEV+MESDEV/none+NAVBRK
FP precision = DOUBLE (NaN/abnormal=0, zero=0)
EvtsPerSec[Rmb+ME] (23) = ( 7.447081e+07 ) sec^-1
EvtsPerSec[MatrixElems] (3) = ( 4.931434e+08 ) sec^-1
EvtsPerSec[MECalcOnly] (3a) = ( 7.173062e+08 ) sec^-1
EvtsPerSec[Rmb+ME] (23) = ( 6.381319e+07 ) sec^-1
EvtsPerSec[MatrixElems] (3) = ( 4.697367e+08 ) sec^-1
EvtsPerSec[MECalcOnly] (3a) = ( 7.143178e+08 ) sec^-1
MeanMatrixElemValue = ( 1.371706e-02 +- 3.270315e-06 ) GeV^0
TOTAL : 0.676046 sec
TOTAL : 0.737241 sec
INFO: No Floating Point Exceptions have been reported
2,567,759,777 cycles # 2.819 GHz
3,947,530,526 instructions # 1.54 insn per cycle
0.969595478 seconds time elapsed
2,630,475,030 cycles # 2.802 GHz
4,094,804,374 instructions # 1.56 insn per cycle
1.031371639 seconds time elapsed
runNcu /data/avalassi/GPU2023/madgraph4gpuX/epochX/cudacpp/ee_mumu.mad/SubProcesses/P1_epem_mupmum/build.cuda_d_inl0_hrd0/check_cuda.exe -p 2048 256 1
==PROF== Profiling "sigmaKin": launch__registers_per_thread 166
==PROF== Profiling "sigmaKin": sm__sass_average_branch_targets_threads_uniform.pct 100%
Expand All @@ -81,16 +81,16 @@ Workflow summary = CPP:DBL+CXS:CURHST+RMBHST+MESHST/none+NAVBRK
FP precision = DOUBLE (NaN/abnormal=0, zero=0)
Internal loops fptype_sv = SCALAR ('none': ~vector[1], no SIMD)
OMP threads / `nproc --all` = 1 / 4
EvtsPerSec[Rmb+ME] (23) = ( 1.052568e+06 ) sec^-1
EvtsPerSec[MatrixElems] (3) = ( 1.236916e+06 ) sec^-1
EvtsPerSec[MECalcOnly] (3a) = ( 1.236916e+06 ) sec^-1
EvtsPerSec[Rmb+ME] (23) = ( 1.012540e+06 ) sec^-1
EvtsPerSec[MatrixElems] (3) = ( 1.179773e+06 ) sec^-1
EvtsPerSec[MECalcOnly] (3a) = ( 1.179773e+06 ) sec^-1
MeanMatrixElemValue = ( 1.371706e-02 +- 3.270315e-06 ) GeV^0
TOTAL : 6.401690 sec
TOTAL : 6.634856 sec
INFO: No Floating Point Exceptions have been reported
18,320,184,384 cycles # 2.860 GHz
43,970,344,438 instructions # 2.40 insn per cycle
6.407522814 seconds time elapsed
=Symbols in CPPProcess_cpp.o= (~sse4: 433) (avx2: 0) (512y: 0) (512z: 0)
19,043,816,100 cycles # 2.871 GHz
46,110,907,096 instructions # 2.42 insn per cycle
6.640203777 seconds time elapsed
=Symbols in CPPProcess_cpp.o= (~sse4: 476) (avx2: 0) (512y: 0) (512z: 0)
-------------------------------------------------------------------------
runExe /data/avalassi/GPU2023/madgraph4gpuX/epochX/cudacpp/ee_mumu.mad/SubProcesses/P1_epem_mupmum/build.none_d_inl0_hrd0/runTest_cpp.exe
[ PASSED ] 3 tests.
Expand All @@ -109,16 +109,16 @@ Workflow summary = CPP:DBL+CXS:CURHST+RMBHST+MESHST/sse4+CXVBRK
FP precision = DOUBLE (NaN/abnormal=0, zero=0)
Internal loops fptype_sv = VECTOR[2] ('sse4': SSE4.2, 128bit) [cxtype_ref=YES]
OMP threads / `nproc --all` = 1 / 4
EvtsPerSec[Rmb+ME] (23) = ( 1.556597e+06 ) sec^-1
EvtsPerSec[MatrixElems] (3) = ( 2.031950e+06 ) sec^-1
EvtsPerSec[MECalcOnly] (3a) = ( 2.031950e+06 ) sec^-1
EvtsPerSec[Rmb+ME] (23) = ( 1.536456e+06 ) sec^-1
EvtsPerSec[MatrixElems] (3) = ( 2.001382e+06 ) sec^-1
EvtsPerSec[MECalcOnly] (3a) = ( 2.001382e+06 ) sec^-1
MeanMatrixElemValue = ( 1.371706e-02 +- 3.270315e-06 ) GeV^0
TOTAL : 4.466648 sec
TOTAL : 4.519211 sec
INFO: No Floating Point Exceptions have been reported
12,746,464,526 cycles # 2.851 GHz
30,998,051,748 instructions # 2.43 insn per cycle
4.472203598 seconds time elapsed
=Symbols in CPPProcess_cpp.o= (~sse4: 1644) (avx2: 0) (512y: 0) (512z: 0)
12,941,312,867 cycles # 2.861 GHz
31,615,854,685 instructions # 2.44 insn per cycle
4.524499594 seconds time elapsed
=Symbols in CPPProcess_cpp.o= (~sse4: 1676) (avx2: 0) (512y: 0) (512z: 0)
-------------------------------------------------------------------------
runExe /data/avalassi/GPU2023/madgraph4gpuX/epochX/cudacpp/ee_mumu.mad/SubProcesses/P1_epem_mupmum/build.sse4_d_inl0_hrd0/runTest_cpp.exe
[ PASSED ] 3 tests.
Expand All @@ -137,16 +137,16 @@ Workflow summary = CPP:DBL+CXS:CURHST+RMBHST+MESHST/avx2+CXVBRK
FP precision = DOUBLE (NaN/abnormal=0, zero=0)
Internal loops fptype_sv = VECTOR[4] ('avx2': AVX2, 256bit) [cxtype_ref=YES]
OMP threads / `nproc --all` = 1 / 4
EvtsPerSec[Rmb+ME] (23) = ( 1.919243e+06 ) sec^-1
EvtsPerSec[MatrixElems] (3) = ( 2.664717e+06 ) sec^-1
EvtsPerSec[MECalcOnly] (3a) = ( 2.664717e+06 ) sec^-1
EvtsPerSec[Rmb+ME] (23) = ( 1.916088e+06 ) sec^-1
EvtsPerSec[MatrixElems] (3) = ( 2.660561e+06 ) sec^-1
EvtsPerSec[MECalcOnly] (3a) = ( 2.660561e+06 ) sec^-1
MeanMatrixElemValue = ( 1.371706e-02 +- 3.270315e-06 ) GeV^0
TOTAL : 3.699016 sec
TOTAL : 3.706215 sec
INFO: No Floating Point Exceptions have been reported
10,057,139,705 cycles # 2.715 GHz
19,364,699,903 instructions # 1.93 insn per cycle
3.704443201 seconds time elapsed
=Symbols in CPPProcess_cpp.o= (~sse4: 0) (avx2: 1966) (512y: 0) (512z: 0)
10,084,827,662 cycles # 2.718 GHz
19,615,618,896 instructions # 1.95 insn per cycle
3.711412090 seconds time elapsed
=Symbols in CPPProcess_cpp.o= (~sse4: 0) (avx2: 1972) (512y: 0) (512z: 0)
-------------------------------------------------------------------------
runExe /data/avalassi/GPU2023/madgraph4gpuX/epochX/cudacpp/ee_mumu.mad/SubProcesses/P1_epem_mupmum/build.avx2_d_inl0_hrd0/runTest_cpp.exe
[ PASSED ] 3 tests.
Expand All @@ -165,16 +165,16 @@ Workflow summary = CPP:DBL+CXS:CURHST+RMBHST+MESHST/512y+CXVBRK
FP precision = DOUBLE (NaN/abnormal=0, zero=0)
Internal loops fptype_sv = VECTOR[4] ('512y': AVX512, 256bit) [cxtype_ref=YES]
OMP threads / `nproc --all` = 1 / 4
EvtsPerSec[Rmb+ME] (23) = ( 1.993729e+06 ) sec^-1
EvtsPerSec[MatrixElems] (3) = ( 2.809982e+06 ) sec^-1
EvtsPerSec[MECalcOnly] (3a) = ( 2.809982e+06 ) sec^-1
EvtsPerSec[Rmb+ME] (23) = ( 1.942239e+06 ) sec^-1
EvtsPerSec[MatrixElems] (3) = ( 2.710200e+06 ) sec^-1
EvtsPerSec[MECalcOnly] (3a) = ( 2.710200e+06 ) sec^-1
MeanMatrixElemValue = ( 1.371706e-02 +- 3.270315e-06 ) GeV^0
TOTAL : 3.577215 sec
TOTAL : 3.659343 sec
INFO: No Floating Point Exceptions have been reported
9,735,076,070 cycles # 2.718 GHz
18,976,322,211 instructions # 1.95 insn per cycle
3.583082575 seconds time elapsed
=Symbols in CPPProcess_cpp.o= (~sse4: 0) (avx2: 1690) (512y: 181) (512z: 0)
9,850,079,029 cycles # 2.688 GHz
19,274,334,982 instructions # 1.96 insn per cycle
3.664605897 seconds time elapsed
=Symbols in CPPProcess_cpp.o= (~sse4: 0) (avx2: 1713) (512y: 178) (512z: 0)
-------------------------------------------------------------------------
runExe /data/avalassi/GPU2023/madgraph4gpuX/epochX/cudacpp/ee_mumu.mad/SubProcesses/P1_epem_mupmum/build.512y_d_inl0_hrd0/runTest_cpp.exe
[ PASSED ] 3 tests.
Expand All @@ -193,16 +193,16 @@ Workflow summary = CPP:DBL+CXS:CURHST+RMBHST+MESHST/512z+CXVBRK
FP precision = DOUBLE (NaN/abnormal=0, zero=0)
Internal loops fptype_sv = VECTOR[8] ('512z': AVX512, 512bit) [cxtype_ref=YES]
OMP threads / `nproc --all` = 1 / 4
EvtsPerSec[Rmb+ME] (23) = ( 1.660221e+06 ) sec^-1
EvtsPerSec[MatrixElems] (3) = ( 2.180702e+06 ) sec^-1
EvtsPerSec[MECalcOnly] (3a) = ( 2.180702e+06 ) sec^-1
EvtsPerSec[Rmb+ME] (23) = ( 1.674322e+06 ) sec^-1
EvtsPerSec[MatrixElems] (3) = ( 2.201468e+06 ) sec^-1
EvtsPerSec[MECalcOnly] (3a) = ( 2.201468e+06 ) sec^-1
MeanMatrixElemValue = ( 1.371706e-02 +- 3.270315e-06 ) GeV^0
TOTAL : 4.214341 sec
TOTAL : 4.178974 sec
INFO: No Floating Point Exceptions have been reported
8,602,295,276 cycles # 2.039 GHz
15,727,245,583 instructions # 1.83 insn per cycle
4.219911758 seconds time elapsed
=Symbols in CPPProcess_cpp.o= (~sse4: 0) (avx2: 901) (512y: 154) (512z: 1258)
8,571,642,787 cycles # 2.049 GHz
15,729,577,049 instructions # 1.84 insn per cycle
4.184518706 seconds time elapsed
=Symbols in CPPProcess_cpp.o= (~sse4: 0) (avx2: 900) (512y: 156) (512z: 1257)
-------------------------------------------------------------------------
runExe /data/avalassi/GPU2023/madgraph4gpuX/epochX/cudacpp/ee_mumu.mad/SubProcesses/P1_epem_mupmum/build.512z_d_inl0_hrd0/runTest_cpp.exe
[ PASSED ] 3 tests.
Expand Down
Loading

0 comments on commit f174f4f

Please sign in to comment.