Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Add NGC release tests to 24.10 release branch #1049

Closed
wants to merge 11 commits into from

Commits on Sep 5, 2024

  1. remove deprecated XLA flag (#1010)

    1. `xla_gpu_enable_triton_gemm` is still needed. 
    2. Removed some other deprecated XLA flags:
    `xla_gpu_enable_triton_softmax_fusion`
    3. Also removed some XLA flags that are now turned on by default.
    `xla_enable_async_all_gather` etc.
    kocchop authored Sep 5, 2024
    Configuration menu
    Copy the full SHA
    ecacd5b View commit details
    Browse the repository at this point in the history

Commits on Sep 6, 2024

  1. fix tensorboard events dir path (#1032)

    Fixed the tensorboard dir path after a recent change in MaxText
    software:
    AI-Hypercomputer/maxtext#863
    kocchop authored Sep 6, 2024
    Configuration menu
    Copy the full SHA
    44b4dfe View commit details
    Browse the repository at this point in the history
  2. Makes jaxlib wheel dirs readable for non-root users (#1023)

    Example as of 8-28-2024
    
    ```
    $ docker run --entrypoint='' --rm -it ghcr.io/nvidia/jax:pax-2024-08-28 ls -lah /opt/jaxlibs total 20K
    drwxr-xr-x 1 root root 4.0K Aug 28 09:43 .
    drwxr-xr-x 1 root root 4.0K Aug 28 10:04 ..
    drwx------ 1 root root 4.0K Aug 28 09:43 jax_gpu_pjrt
    drwx------ 1 root root 4.0K Aug 28 09:43 jax_gpu_plugin
    drwx------ 1 root root 4.0K Aug 28 09:43 jaxlib
    ```
    
    Signed-off-by: Terry Kong <[email protected]>
    terrykong authored Sep 6, 2024
    Configuration menu
    Copy the full SHA
    f808df5 View commit details
    Browse the repository at this point in the history

Commits on Sep 9, 2024

  1. Configuration menu
    Copy the full SHA
    f116054 View commit details
    Browse the repository at this point in the history

Commits on Sep 18, 2024

  1. Configuration menu
    Copy the full SHA
    a8c5273 View commit details
    Browse the repository at this point in the history
  2. Add an option to test-pax.sh to enable XLA cuDNN flash attention (#1045)

    Provide an option to run XLA cuDNN flash attention as an alternative to
    TE cuDNN flash attention.
    Cjkkkk authored Sep 18, 2024
    Configuration menu
    Copy the full SHA
    056a3b0 View commit details
    Browse the repository at this point in the history

Commits on Sep 25, 2024

  1. Configuration menu
    Copy the full SHA
    57919e0 View commit details
    Browse the repository at this point in the history
  2. Bump clang to 18 (#1060)

    Forced by this change in JAX build system:
    jax-ml/jax#23787
    DwarKapex authored Sep 25, 2024
    Configuration menu
    Copy the full SHA
    3a2e8c8 View commit details
    Browse the repository at this point in the history
  3. Add CI argument for user-defined CUDA base image (#1013)

    Co-authored-by: Olli Lupton <[email protected]>
    yhtang and olupton authored Sep 25, 2024
    Configuration menu
    Copy the full SHA
    ccededf View commit details
    Browse the repository at this point in the history
  4. Add NVCR token for tests

    DwarKapex committed Sep 25, 2024
    Configuration menu
    Copy the full SHA
    0a09276 View commit details
    Browse the repository at this point in the history
  5. Configuration menu
    Copy the full SHA
    4eeff71 View commit details
    Browse the repository at this point in the history