Skip to content
Change the repository type filter

All

    Repositories list

    • cutlass

      Public
      CUDA Templates for Linear Algebra Subroutines
      C++
      948100Updated Oct 21, 2024Oct 21, 2024
    • Ongoing Research Project for Mixture of Expert models
      Python
      0000Updated Oct 2, 2024Oct 2, 2024
    • Ongoing research training transformer language models at scale, including: BERT & GPT-2
      Python
      Other
      2.3k000Updated Sep 26, 2024Sep 26, 2024
    • nanoGPT

      Public
      The simplest, fastest repository for training/finetuning medium-sized GPTs.
      Python
      MIT License
      5.8k200Updated Sep 21, 2024Sep 21, 2024
    • BigCodeBench: Benchmarking Code Generation Towards AGI
      Python
      Apache License 2.0
      22000Updated Sep 16, 2024Sep 16, 2024
    • Optimized primitives for collective multi-GPU communication
      C++
      Other
      807030Updated Aug 2, 2024Aug 2, 2024
    • Python
      Apache License 2.0
      44000Updated Jul 17, 2024Jul 17, 2024
    • Hatrix

      Public
      C++
      13121Updated Jul 5, 2024Jul 5, 2024
    • nbd

      Public
      N-Body generator for Hatrix
      C++
      1000Updated Jun 17, 2024Jun 17, 2024
    • hpsc-2024

      Public
      Shell
      381300Updated Jun 12, 2024Jun 12, 2024
    • FRANK

      Public
      C++
      BSD 3-Clause "New" or "Revised" License
      22110Updated May 9, 2024May 9, 2024
    • Python
      MIT License
      0100Updated Apr 30, 2024Apr 30, 2024
    • grok-1

      Public
      Grok open release
      Python
      Apache License 2.0
      8.3k000Updated Mar 17, 2024Mar 17, 2024
    • toast-gpt

      Public
      Python
      MIT License
      1000Updated Mar 8, 2024Mar 8, 2024
    • toast-vit

      Public
      Python
      MIT License
      0000Updated Feb 14, 2024Feb 14, 2024
    • Zero Bubble Pipeline Parallelism
      Python
      Other
      2.3k000Updated Feb 13, 2024Feb 13, 2024
    • main: microsoft/Meagtron-DeepSpeed, cpu: 富岳上で動かすstableブランチ
      Python
      Other
      1520Updated Feb 2, 2024Feb 2, 2024
    • 2023 ABCI Llama-2 継続学習プロジェクト
      Python
      Other
      31300Updated Jan 22, 2024Jan 22, 2024
    • Python
      Apache License 2.0
      173000Updated Dec 15, 2023Dec 15, 2023
    • An adaptable federated learning framework with a central server, supporting diverse datasets, models, and optimizers. Facilitates collaborative, yet private, data training with customizable aggregation algorithms.
      Python
      MIT License
      0000Updated Nov 16, 2023Nov 16, 2023
    • m2

      Public
      Repo for "Monarch Mixer: A Simple Sub-Quadratic GEMM-Based Architecture"
      Assembly
      45000Updated Nov 2, 2023Nov 2, 2023
    • gpt-neox

      Public
      An implementation of model parallel autoregressive transformers on GPUs, based on the DeepSpeed library.
      Python
      Apache License 2.0
      1k000Updated Sep 25, 2023Sep 25, 2023
    • Best practice for training LLaMA models in Megatron-LM
      Python
      Other
      2.3k000Updated Sep 4, 2023Sep 4, 2023
    • Ongoing research training transformer language models at scale, including: BERT & GPT-2
      Python
      Other
      2.3k000Updated Aug 30, 2023Aug 30, 2023
    • elses

      Public
      Fortran
      0000Updated Aug 3, 2023Aug 3, 2023
    • A framework for few-shot evaluation of autoregressive language models.
      Python
      MIT License
      1.8k000Updated Jul 31, 2023Jul 31, 2023
    • STRUMPACK

      Public
      Structured Matrix Package (LBNL)
      C++
      Other
      41000Updated Jul 25, 2023Jul 25, 2023
    • C++
      1000Updated Jul 6, 2023Jul 6, 2023
    • Pixel-level Contrastive Learning of Driving Videos with Optical Flow, CVPR 2023 Workshop
      Python
      MIT License
      0400Updated Jun 27, 2023Jun 27, 2023
    • ひなどりクラスタの使い方 (for public)
      0000Updated Jun 10, 2023Jun 10, 2023