Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

perf: adds AVX512 implementations of vector.Sum, vector.InnerProduct + assembly refactor #547

Open
wants to merge 49 commits into
base: master
Choose a base branch
from
Open
Show file tree
Hide file tree
Changes from all commits
Commits
Show all changes
49 commits
Select commit Hold shift + click to select a range
d7aa178
feat: asm Vector sum slower, no avx
gbotrel Sep 21, 2024
6d2d8a8
checkpoint
gbotrel Sep 21, 2024
0bb00a0
checkpoint
gbotrel Sep 21, 2024
ce4ade2
checkpoint
gbotrel Sep 21, 2024
847a6df
feat: add vec.Sum AVX512
gbotrel Sep 22, 2024
4796eb3
test: make odd bound for better test case
gbotrel Sep 22, 2024
dfcb110
build: make linter happy
gbotrel Sep 22, 2024
f45aeb9
fix: update bound for vec sum to match parameter choices
gbotrel Sep 22, 2024
75120a0
perf: loop 8 by 8, cosmetics
gbotrel Sep 23, 2024
dc15a6a
style: cosmetics
gbotrel Sep 23, 2024
a66f547
test: better sum test
gbotrel Sep 23, 2024
5b2b11d
test: more test
gbotrel Sep 23, 2024
fcfaa05
refactor: move common assembly routine in subfolder (#545)
gbotrel Sep 24, 2024
b27149d
doc: add reference for reduction algorithm
gbotrel Sep 24, 2024
159a7b7
feat: use latest bavard for avx512 instructions
gbotrel Sep 24, 2024
61268fe
feat: added purego InnerProduct
gbotrel Sep 24, 2024
71d68aa
checkpoint wip
gbotrel Sep 25, 2024
8d21a8e
checkpoint
gbotrel Sep 25, 2024
8a906c6
refactor: checkpoint
gbotrel Sep 26, 2024
a2e333c
test: better tests for vec ops
gbotrel Sep 26, 2024
78184a8
checkpoint
gbotrel Sep 27, 2024
0a939b1
test: add more tests for vector ops
gbotrel Sep 28, 2024
85e509d
feat: update bavard and use better syntax in asm
gbotrel Sep 28, 2024
5c202ac
test: make benchmarks on varying sizes
gbotrel Sep 28, 2024
97cdc21
test: bench on larger vector
gbotrel Sep 28, 2024
48cca4c
test: bench on larger vector
gbotrel Sep 28, 2024
370e77b
checkpoint
gbotrel Sep 30, 2024
452fe92
checkpoint
gbotrel Sep 30, 2024
8ce8ecb
checkpoint
gbotrel Sep 30, 2024
e104c20
checkpoint
gbotrel Sep 30, 2024
e7a1eb0
checkpoint
gbotrel Sep 30, 2024
b868789
checkpoint
gbotrel Sep 30, 2024
8862854
checkpoint
gbotrel Oct 1, 2024
5d212b6
checkpoint
gbotrel Oct 1, 2024
6f063e8
checkpoint
gbotrel Oct 1, 2024
8978198
checkpoint
gbotrel Oct 1, 2024
89c5035
checkpoint
gbotrel Oct 1, 2024
7336398
checkpoint
gbotrel Oct 2, 2024
c56802b
checkpoint
gbotrel Oct 2, 2024
d8be4fe
checkpoint
gbotrel Oct 2, 2024
d7436cd
checkpoint
gbotrel Oct 2, 2024
f5065fd
refactor: use defines for mul
gbotrel Oct 3, 2024
2b201da
feat: make use of defines in assembly
gbotrel Oct 3, 2024
76b236e
checkpoint
gbotrel Oct 3, 2024
ccdec18
style: code cleaning
gbotrel Oct 3, 2024
24c1aa6
perf: prefetches in vec ops
gbotrel Oct 3, 2024
3b926b1
perf: minor adjustements
gbotrel Oct 3, 2024
d1601ba
style: costmetics
gbotrel Oct 3, 2024
138484c
feat: handle case where len(vec)==0
gbotrel Oct 3, 2024
File filter

Filter by extension

Filter by extension


Conversations
Failed to load comments.
Loading
Jump to
Jump to file
Failed to load files.
Loading
Diff view
Diff view
File renamed without changes.
28 changes: 2 additions & 26 deletions ecc/bls12-377/fp/element.go

Some generated files are not rendered by default. Learn more about how customized files appear on GitHub.

Loading