3.7
The 3.7 release contains another round of performance optimizations, including significant improvements to the command line front-end (faster PNG loader) and the arm64 build of the codec (faster NEON implementation).
Change log
General:
- Feature: The command line tool PNG loader has been switched to use the Wuffs library, which is significantly faster than the current implementation.
- Feature: Support for non-invariant builds returns. Opt-in to slightly faster, but not bit-exact, builds by setting
-DNO_INVARIANCE=ON
for the CMake configuration. This improves performance by around 2%. - Optimization: Changed SIMD
select()
so that it matches the default NEON behavior (bitwise select), rather than the default x86-64 behavior (lane select on MSB). Specializationselect_msb()
added for the one case we want to select on a sign-bit, where NEON needs a different implementation. This provides a significant (>25%) performance uplift on NEON implementations.
Binary release sha256 checksums
f69c2acbb3b07386cc95001c253cddfa567e71b9618682856f0ff600955cc2ba astcenc-3.7-linux-x64.zip
41c691613e15d844bac56e97a042cc8aea7bc7e8e76fc767ffe875a4d8c5e995 astcenc-3.7-macos-aarch64.zip
5608f4c0b3e1d56a30070cdf61aff42050576833ac7e2719031cec461aa4d102 astcenc-3.7-macos-x64.zip
ecb0e1a5dcbfbaca8a38630e427638380b9d337c266660b39738260e1df5244a astcenc-3.7-windows-x64.zip