Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Improvements for transpose, and more. #1624

Merged

Commits on Aug 29, 2023

  1. optimize Transpose() Operation by forcing use of our multi-threaded s…

    …olution instead of Eigen::'s.
    
    Added a commandline switch to enable Eigen::Transpose in case this would prove faster on some architectures or after Eigen:: made progresses.
    Insured (sort of) that in multithreaded mode, TPOOL_MIN_ELTS is also more or less the number of elements that each thread will process, so that GDL
    may use less threads than the machine can provide (some GDL running machines have 64 or more cores). 
    Obviously it is not worth starting 128 threads if 10 would already do the job in time.
    GillesDuvert committed Aug 29, 2023
    Configuration menu
    Copy the full SHA
    88653d0 View commit details
    Browse the repository at this point in the history
  2. Configuration menu
    Copy the full SHA
    7bc7ede View commit details
    Browse the repository at this point in the history
  3. added switch "--smart-tpool" that enable a mode where the number of t…

    …hreads used insure each thread will process more or less TPOOL_MIN_ELTS, not a diminutve number given the number of available threads, that can be large, 128 or more.
    GillesDuvert committed Aug 29, 2023
    Configuration menu
    Copy the full SHA
    8e9a01a View commit details
    Browse the repository at this point in the history
  4. added switch "--smart-tpool" that enable a mode where the number of t…

    …hreads used insure each thread will process more or less TPOOL_MIN_ELTS, not a diminutve number given the number of available threads, that can be large, 128 or more.
    GillesDuvert committed Aug 29, 2023
    Configuration menu
    Copy the full SHA
    1105b1f View commit details
    Browse the repository at this point in the history
  5. modified some flags of parallelize() to force (in the 'smart tpoll' m…

    …ode) use of the max available number of threads, or other variant.
    GillesDuvert committed Aug 29, 2023
    Configuration menu
    Copy the full SHA
    783b3fa View commit details
    Browse the repository at this point in the history
  6. Configuration menu
    Copy the full SHA
    0757621 View commit details
    Browse the repository at this point in the history