Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

[CUDAX] Add modernized simpleP2P sample #2696

Merged
merged 31 commits into from
Nov 11, 2024

Commits on Nov 4, 2024

  1. copy pasted sample

    pciolkosz committed Nov 4, 2024
    Configuration menu
    Copy the full SHA
    eae1b06 View commit details
    Browse the repository at this point in the history
  2. WIP

    pciolkosz committed Nov 4, 2024
    Configuration menu
    Copy the full SHA
    5d7ad98 View commit details
    Browse the repository at this point in the history
  3. First draft

    pciolkosz committed Nov 4, 2024
    Configuration menu
    Copy the full SHA
    2a3060f View commit details
    Browse the repository at this point in the history
  4. Fixes

    pciolkosz committed Nov 4, 2024
    Configuration menu
    Copy the full SHA
    795439b View commit details
    Browse the repository at this point in the history
  5. More fixes

    pciolkosz committed Nov 4, 2024
    Configuration menu
    Copy the full SHA
    850f5ec View commit details
    Browse the repository at this point in the history
  6. More fixes

    pciolkosz committed Nov 4, 2024
    Configuration menu
    Copy the full SHA
    bbc7d2e View commit details
    Browse the repository at this point in the history
  7. More fixes

    pciolkosz committed Nov 4, 2024
    Configuration menu
    Copy the full SHA
    9ec90b6 View commit details
    Browse the repository at this point in the history
  8. Configuration menu
    Copy the full SHA
    29d48b7 View commit details
    Browse the repository at this point in the history
  9. Configuration menu
    Copy the full SHA
    4760c7d View commit details
    Browse the repository at this point in the history
  10. Needs launch fix

    pciolkosz committed Nov 4, 2024
    Configuration menu
    Copy the full SHA
    836c512 View commit details
    Browse the repository at this point in the history
  11. Configuration menu
    Copy the full SHA
    824a568 View commit details
    Browse the repository at this point in the history
  12. Missing include

    pciolkosz committed Nov 4, 2024
    Configuration menu
    Copy the full SHA
    088dad2 View commit details
    Browse the repository at this point in the history
  13. Configuration menu
    Copy the full SHA
    3a813a3 View commit details
    Browse the repository at this point in the history
  14. Adjust copy count

    pciolkosz committed Nov 4, 2024
    Configuration menu
    Copy the full SHA
    8ffc7f7 View commit details
    Browse the repository at this point in the history
  15. Configuration menu
    Copy the full SHA
    5174101 View commit details
    Browse the repository at this point in the history
  16. Configuration menu
    Copy the full SHA
    9d47842 View commit details
    Browse the repository at this point in the history
  17. Update copyright date

    Co-authored-by: Eric Niebler <[email protected]>
    pciolkosz and ericniebler authored Nov 4, 2024
    Configuration menu
    Copy the full SHA
    a5f1e4a View commit details
    Browse the repository at this point in the history

Commits on Nov 5, 2024

  1. Configuration menu
    Copy the full SHA
    fa570ad View commit details
    Browse the repository at this point in the history
  2. Configuration menu
    Copy the full SHA
    197ed75 View commit details
    Browse the repository at this point in the history
  3. Configuration menu
    Copy the full SHA
    3650705 View commit details
    Browse the repository at this point in the history
  4. Configuration menu
    Copy the full SHA
    f20190d View commit details
    Browse the repository at this point in the history
  5. Some new MSVC errors

    pciolkosz committed Nov 5, 2024
    Configuration menu
    Copy the full SHA
    119b05c View commit details
    Browse the repository at this point in the history
  6. explicit cast

    pciolkosz committed Nov 5, 2024
    Configuration menu
    Copy the full SHA
    02c94b9 View commit details
    Browse the repository at this point in the history

Commits on Nov 6, 2024

  1. Configuration menu
    Copy the full SHA
    baaff9b View commit details
    Browse the repository at this point in the history
  2. Add cuda::minimum and cuda::maximum (NVIDIA#2681)

    * Add cuda::minimum and cuda::maximum
    Jacobfaib authored and pciolkosz committed Nov 6, 2024
    Configuration menu
    Copy the full SHA
    ed5cf1f View commit details
    Browse the repository at this point in the history
  3. Various fixes to cub::DeviceTransform (NVIDIA#2709)

    * Workaround non-copyable iterators
    * Use a named constant for SMEM
    * Cast to raw reference 2
    * Fix passing non-copy-assignable iterators to transform_kernel via kernel_arg
    bernhardmgruber authored and pciolkosz committed Nov 6, 2024
    Configuration menu
    Copy the full SHA
    49c431c View commit details
    Browse the repository at this point in the history
  4. Make thrust::transform use cub::DeviceTransform (NVIDIA#2389)

    * Add transform benchmark requiring a stable address
    * Make thrust::transform use cub::DeviceTransform
    * Introduces address stability detection and opt-in in libcu++
    * Mark lambdas in Thrust BabelStream benchmark address oblivious
    * Optimize prefetch cub::DeviceTransform for small problems
    
    Fixes: NVIDIA#2263
    bernhardmgruber authored and pciolkosz committed Nov 6, 2024
    Configuration menu
    Copy the full SHA
    1f05372 View commit details
    Browse the repository at this point in the history
  5. Ensure that we only use the inline variable trait when it is actually…

    … available (NVIDIA#2712)
    
    * Ensure that we only use the inline variable trait when it is actually available
    
    * Use the right define for internal traits
    miscco authored and pciolkosz committed Nov 6, 2024
    Configuration menu
    Copy the full SHA
    13961d0 View commit details
    Browse the repository at this point in the history
  6. [CUDAX] Rename memory resource and memory pool from async to device (N…

    …VIDIA#2710)
    
    * Rename the type
    
    * Update tests
    
    * Rename async memory pool
    
    * Rename the tests
    
    * Change name in the docs
    
    * Generalise the memory_pool_properties name
    
    * Fix docs
    
    ---------
    
    Co-authored-by: Michael Schellenberger Costa <[email protected]>
    pciolkosz and miscco committed Nov 6, 2024
    Configuration menu
    Copy the full SHA
    b0384b6 View commit details
    Browse the repository at this point in the history
  7. Configuration menu
    Copy the full SHA
    603f2aa View commit details
    Browse the repository at this point in the history
  8. Configuration menu
    Copy the full SHA
    76cd405 View commit details
    Browse the repository at this point in the history