forked from devitocodes/devito
-
Notifications
You must be signed in to change notification settings - Fork 2
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
b2b PR #16
Closed
Closed
b2b PR #16
Changes from 1 commit
Commits
Show all changes
275 commits
Select commit
Hold shift + click to select a range
6db1d9e
cleaned up makefile
AntonLydike fee500e
fixed interop.c
AntonLydike a0909d1
MLIR dockerfile.
PapyChacal 47b7740
readme: update
georgebisbas 9a3944e
scripts: Update timers
georgebisbas ea8bb95
Update makefile
georgebisbas 8ac06b4
compare.py fix.
PapyChacal 0f38a10
Correct stencil.cast bounds, add clang opts to the Makefile, use clan…
PapyChacal 51cf7fb
2d5pt use to from CLI
PapyChacal 4dfe31f
Bump xDSL to 0.11
PapyChacal 95af73f
Switch requirements syntax.
PapyChacal 168a313
Merge pull request #18 from xdslproject/emilien/xdsl-0.11
AntonLydike c645d55
move interop to proper i64
AntonLydike 43105f2
Switching requirements back.
PapyChacal d41f47a
Use new memref pass after xDSL bump.
PapyChacal 377ca7c
Use clang to compile interop.c. (gcc caused linkage issues on Archer2)
PapyChacal 132fcef
Add a simple script to benchmark different grid sizes.
PapyChacal 84b77ab
Add file output.
PapyChacal a952a6a
Add plotting and SVG output.
PapyChacal faa624e
Increase size more progressively.
PapyChacal f09bd1b
Default max value
PapyChacal 9f96ab2
Cleanup, factorize plotting to depend only on the CSV.
PapyChacal 4c09dd0
Split benchmark and plotting scripts.
PapyChacal 9a96018
Write benchmark data on-the-go (don't lose past runs if interrupted)
PapyChacal ead1632
Return 0 from main if finishing.
PapyChacal cd743ea
Makefile polish + allow skipping output saving & comparison.
PapyChacal dc7c84d
No dump in bench script.
PapyChacal 3febbd0
Display failed bench cmd.
PapyChacal b20abe5
Optional arg to plot.py to star with nth line.
PapyChacal 5a81e05
Slight refactor of the Makefile.
PapyChacal a347150
Add OpenMP pipeline.
PapyChacal 22283c0
Handle OpenMP in runs.
PapyChacal 64e6db3
Add threads bench and plot scripts.
PapyChacal 498a77a
Add CFLAGS to kernel.
PapyChacal 4533ad5
Add cmd's output when failing.
PapyChacal f98e3f2
Refactor 2d5pt.py
PapyChacal a7b360a
More refactor, output "unrounded" times.
PapyChacal d365c8b
Add benchmarks.py.
PapyChacal bc23130
Remove 2d5pt.py and 3d_diff.py in favor of benchmarks.py
PapyChacal 6ee753b
Compile main from benchmarks.py
PapyChacal 14d8b1e
Compile interop in benchmarks.py and use CFLAGS.
PapyChacal 6d7193c
Compile and run kernel from benchmarks.py
PapyChacal 04dcc81
Rename benchmarks.py to fast_benchmarks.py (import reasons).
PapyChacal beb764d
Update plotting scripts to handle lists (by just averaging now).
PapyChacal 8f50d8f
Dump inputs in benchmarks..
PapyChacal aa13ec9
Add points argument to grid_scale.py
PapyChacal 9636e11
Now thread_scale benchmarks EITHER devito or xdsl.
PapyChacal 1c473a6
Don't import thread_scale from plotting script.
PapyChacal ddbf002
Update fast_benchmarks.py 2d5pt initialization to be non-squared resi…
PapyChacal 062923c
Use elipsis indexing.
PapyChacal 62289d6
bench: Add 2D elastic example
georgebisbas 7a5b8c8
update to work with latest xdsl main
AntonLydike 9a27115
update to work with mpi-sprint-2
AntonLydike b472b38
put the mpi not omp on the correct thingy
AntonLydike f006510
put the mpi not omp on the correct thingy (for real now)
AntonLydike 174d89d
mpi compilation now at a stage where it doesn't segfault
AntonLydike 6a5daff
improvements in preperation for new stencil lowering
AntonLydike 8216511
update for new stencil loops and bounds in types
AntonLydike 83e36ed
fix main generation
AntonLydike d9dca5c
remove a bunch of prints and fix pipeline
AntonLydike f950f65
Better comparison.
PapyChacal 54f0fdc
add scatter/gather to generated main
AntonLydike 6582329
add mpi run mode to fast_benchmarks.py
AntonLydike 7c5e843
fix rank at which data is gathered to 0
AntonLydike 229d0fb
fix missing dump call in non-mpi version of generated main
AntonLydike 1c2f110
(again) fix missing dump call in non-mpi version of generated main
AntonLydike b0e2918
a bunch of debugging helpers but no actual bugs fixed
AntonLydike 09ecc40
remove unused MPI imports
AntonLydike 1b8f21e
add visualization notebook
AntonLydike cd16c16
Add GPU lowering to Devito and a flag in fast_benchmarks.
PapyChacal 4bccb41
Commit the lowering too :)
PapyChacal c717eb4
Small updates.
PapyChacal fcdd224
factor compile arguments in fast_benchmarks for easy benchmarking scr…
PapyChacal 4cbd20e
Update bench;arking scripts.
PapyChacal a477751
add more mpi debugging things to interop.c
AntonLydike c888640
fix missing memref arg handling in interop.c
AntonLydike 5ab37d6
main pipeline; move gpu-to-llvm down, to play nice with MPI (mhh)
PapyChacal 0c357cb
Bench;ark scripts CLI args rerefactor.
PapyChacal 70b1999
add asserts for subprocess calls
AntonLydike 73afeb0
fix file name for per-rank dumps
AntonLydike afc1406
init non t0 memrefs to 0, assert exit codes in fast_benchmarks.py
AntonLydike 1cd51f9
fix dumping twice when running mpi or gpu
AntonLydike 42f9fcf
LD_LIBRARY_PATH to -L shennanigans.
PapyChacal 0edd561
Update Makefile and Readme for MPI.
PapyChacal e872df7
Switch from -lmpi to using mpic..
PapyChacal f9b4650
docs: Small pass and check
georgebisbas 4f78ca4
Update to xDSL main.
PapyChacal 1a2064a
Update viz.py
AntonLydike 0abca29
Add archer2 flag to fast_bench;arks to run with srun instead of mpirun.
PapyChacal 52c6a18
Don't initialize GPU memory from host.
PapyChacal 2eb000e
Add proper devito settings to grid_scale.py
PapyChacal 350e8b3
Oops, import, too
PapyChacal 98a6e68
Output grid_scale csv to gpu file.
PapyChacal 133dca5
Try unmangling as part of compare.py
PapyChacal f9e5ea0
Add mpi flag to compare.py and make data PRECIOUS.
PapyChacal 88ca7f8
New GPU pipeline.
PapyChacal bd0071c
Update viz.py
AntonLydike 593a10e
compare.py: fix mpi calculations
AntonLydike 57d6758
fast_benchmarks.py: swicth to 2d-grid for mpi
AntonLydike 553e6f3
compare.py: fix mpi node count
AntonLydike 2ac7690
towards jit compiling in devito
AntonLydike f866643
add 2d5pt.py as example
AntonLydike 46ba8e1
add mlir types interop file
AntonLydike c86d77f
make it less insane by running only 4 ranks
AntonLydike d2e34bb
add proper args handling?
AntonLydike 3f5b105
remove steam-run
AntonLydike b300e38
fix dict error
AntonLydike 38f21d9
fix name of generated function
AntonLydike 241dfd4
fix not passing args to construct_cfunction_args
AntonLydike 7951627
fix params used in setup_memref_args
AntonLydike c7de52a
fix missing numpy import
AntonLydike b0321cd
add _C_ctype to Memref type
AntonLydike 90a92c5
fix _C_ctype to Memref type
AntonLydike dde80ad
make memrefs passed explicit
AntonLydike af5898a
fix how func args are typed?
AntonLydike 7e91563
make memrefs passed more explicit
AntonLydike c232b4b
fixed bug in memref to func args
AntonLydike d8f4bad
fixed sequencing error in 2d5pt.py
AntonLydike ef53151
add jit backdoor and print dialect support
AntonLydike 89f7473
remove memref struct from func signature
AntonLydike 967e4ee
make it ready for mpi
AntonLydike 5a9b6bd
make mpi for two ranks work
AntonLydike 7133d22
add pydebug.sh
AntonLydike 3fb6f65
remove debug prints from dmp-to-mpi lowering
AntonLydike a3b84e7
make apply_kernel timed
AntonLydike 3cee8d2
Add expand-strided-metadata to CPU (necessary for MPI, actually) pipe…
PapyChacal 78e05da
make xdsl passes also register timers
AntonLydike 5762b9e
explicitly flush stdout in 2d5pt.py
AntonLydike b51c7ed
fix 2d5pt init
AntonLydike 50d2a2b
update 2d5pt.py
AntonLydike 50bafb1
Reminder for pinning
georgebisbas 96b287c
Add some perf logging prints
georgebisbas 55a2d79
Update with perf logging
georgebisbas 9054c36
update pipeline, remove unused code
AntonLydike 2de0f09
major cleanup of fast_benchmarks, renamed to run_benchmark
AntonLydike 731049e
refactoring of benchmarking tooling
AntonLydike 6c25f83
remove dumping references
AntonLydike fe6d3f9
make printing nicer
AntonLydike 8cedcb6
fix environ key errors
AntonLydike 4f657bc
debug printing
AntonLydike cae562d
capture stderr for compiler errors
AntonLydike 117fde3
capture stderr for compiler errors
AntonLydike 359413b
fix encoding of compiler stderr
AntonLydike 2a77522
move interop compiler to clang
AntonLydike 6df855c
add missing flags for GPU and OMP
AntonLydike 7f8ea7d
typo
AntonLydike 78f42bf
fix openmp pipeline
AntonLydike 0836517
add ability to compare to run_benchmark.py
AntonLydike c6bcee0
added --random-init flag to run_benchmark.py
AntonLydike e499019
fix --random-init flag to run_benchmark.py
AntonLydike 52b9dd7
remove pring of ssa in op.apply
AntonLydike 0a89547
fix slow random init
AntonLydike e9f5acf
sanity check for domain
AntonLydike 76718c7
make random init better, fix some crashes
AntonLydike e1628c6
use mpicc when targeting mpi
AntonLydike 770ed94
typo
AntonLydike 14f8840
force mpicc to use our clang
AntonLydike 3132d37
make runner script correctly parse output
AntonLydike 8bedb0e
print command run by run_benchmark
AntonLydike 74c6682
print stderr
AntonLydike ec3723b
srun - fix -n option
AntonLydike 4bed90b
srun fix options some more
AntonLydike 3a2d757
fix python args
AntonLydike b799e47
make correctness data cleaner
AntonLydike f2dd7e5
add report print
AntonLydike aad9481
add new auto benchmarking script
AntonLydike 5369789
run_benchmark.py respect --repeat option
AntonLydike 91a39b9
fix auto_benchmark to work with OMP_PLACES=cores
AntonLydike 191594f
add core pinning to runner
AntonLydike df87b52
smaller runs to make benchmarking turnaround quicker
AntonLydike 2930639
remove stray assert
AntonLydike 71b76ca
add qos option
AntonLydike ed7efcc
add qos option
AntonLydike 0f0cbde
Add nd wave with 3d, no damping field
georgebisbas 52249ca
reduce time limit and add qos options
AntonLydike 38c5bd3
add mpi thread scale
AntonLydike 027785c
fix env in mpi thread scale
AntonLydike 4b64038
fix output of thread scaling
AntonLydike 0962da7
Add .slurm file
georgebisbas d34485c
compiler: Update with cray compiler and Devito main compiler file
georgebisbas 0a432b3
Add heat examples, drop intel redundancy
georgebisbas 1c1a205
Add nd_wave
georgebisbas 9db6592
fix devito cray
AntonLydike 6288ef2
georges changes to benchmarking
AntonLydike a7d3e7b
Merge branch 'anton/devito-run-with-apply' into stencil_mad_dash
AntonLydike 760abfa
nd_nwave time fix.
PapyChacal 2b6039f
Sync to xdsl/main.
PapyChacal 446b397
fix issue apply_kernel not found
AntonLydike 687e857
mpi: Add custom topology from devito codebase
georgebisbas 9af33a5
mpi: Add tests for Custom topology
georgebisbas 2d33ad2
devito/mpi/distributed.py
georgebisbas aa7e25a
Clean benchmark
georgebisbas d4d0ed3
Cleanup benchmark
georgebisbas 416f1ec
bench: cleanup
georgebisbas 51dbc2d
bench: cleanup 3d
georgebisbas fe1650c
bench: more cleanup, drop redundant
georgebisbas 479d478
bench: more cleanup
georgebisbas 1142684
add 'set -eo pipefail' to compiler pipeline to catch errors early
AntonLydike 8e5e7ae
Use /bin/bash for set -eo pipefail.
PapyChacal 335b5f5
add todo.
PapyChacal e9c76a7
Make xDSL flow use a temp .mlir file just like the usual temp .c file.
PapyChacal 5df0f76
operator: Add fixed for xdsloperator compilation - apply_kernel
georgebisbas 6fc0ff7
Lower subviews.
PapyChacal b346498
Link to MLIR runner utils.
PapyChacal bd48ffa
c_runner_utils rather.
PapyChacal d9c4239
Merge branch 'add_custom_topology' into bench_edits
PapyChacal c389f96
Reverse stencil.apply inputs and try to name accordingly.
PapyChacal c5f9552
fix data copy, buffer play
georgebisbas d8b5dba
wave2d.py
georgebisbas a67394e
Comment out pdb.
PapyChacal 6de1971
Fix initial buffer order.
PapyChacal afca66d
add canonicalize-dmp pass to dmp pipeline
AntonLydike f2604c6
Merge pull request #22 from xdslproject/emilien/try-fix-wave
georgebisbas 8087f95
Add tiling.
PapyChacal 63ab369
Add proper quoting.
PapyChacal e15ce97
Add dimensionality-1 tiling dimensions logic.
PapyChacal bacf1af
mpi: Init effort for serial modelling on wave operator
georgebisbas 7f3b37e
mpi: wip
georgebisbas e54cd7b
mpi: wip
georgebisbas 625e976
wave3d: cleanup
georgebisbas b73489a
wave2d: cleanup
georgebisbas e4db7f9
mpi-mfe: Add
georgebisbas ea7fe19
hacky fix for row major dmp.grid
AntonLydike d45c113
bench: Conditional execution heat2d
georgebisbas ae3d586
bench: Conditional execution heat3d
georgebisbas 9a973cc
bench: Generalize benchmarking scripts
georgebisbas 4b34fc5
bench: Generalize wave3d
georgebisbas 7d7e639
wave: TryAdd example with no Operator
georgebisbas 26abd2e
add datatest
georgebisbas fd312d5
setup: Add necessary data
georgebisbas 8db89d7
bench: Load dt to XDSL
georgebisbas bcd7a8d
setup: Save extent
georgebisbas 8c8858a
bench: Add so to saved data
georgebisbas 72fb27d
bench: Add wave3d setup
georgebisbas 722e998
bench: compress saved data
georgebisbas aa73e33
bench: compress properly u.data[:]
georgebisbas 163db36
Merge pull request #23 from xdslproject/emilien/stencil-tiling
georgebisbas ad55988
bench: More cleanup and tiling merge
georgebisbas 6cfe569
bench: Hide pyvista req
georgebisbas 80c0d31
Merge pull request #24 from xdslproject/bench_edits-2
georgebisbas dde653e
Insert necessary boilerplate. stencil lowerings doesn't handle it.
PapyChacal 0d73b6e
Add more sensible and resilient tile sizes.
PapyChacal 9082d9f
Try with a arguments-minimizing pipeline.
PapyChacal 84b5522
Improve args-minimization pipeline (by still folding all compile-time…
PapyChacal c694e02
Remove superfluous GPU passes.
PapyChacal cca08a4
Add direct .so backdoor capability, and XDSL_SKIP_CLEAN env variable …
PapyChacal f057a23
Use DeVito's par-tile.
PapyChacal baa38fb
Use the boilerplate flag to not always copy to GPU.
PapyChacal fa0276f
Merge pull request #26 from xdslproject/emilien/gpu-again
georgebisbas e203d59
Merge pull request #21 from xdslproject/bench_edits
AntonLydike File filter
Filter by extension
Conversations
Failed to load comments.
Loading
Jump to
Jump to file
Failed to load files.
Loading
Diff view
Diff view
There are no files selected for viewing
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
Add this suggestion to a batch that can be applied as a single commit.
This suggestion is invalid because no changes were made to the code.
Suggestions cannot be applied while the pull request is closed.
Suggestions cannot be applied while viewing a subset of changes.
Only one suggestion per line can be applied in a batch.
Add this suggestion to a batch that can be applied as a single commit.
Applying suggestions on deleted lines is not supported.
You must change the existing code in this line in order to create a valid suggestion.
Outdated suggestions cannot be applied.
This suggestion has been applied or marked resolved.
Suggestions cannot be applied from pending reviews.
Suggestions cannot be applied on multi-line comments.
Suggestions cannot be applied while the pull request is queued to merge.
Suggestion cannot be applied right now. Please check back later.
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
cool