Skip to content
Change the repository type filter

All

    Repositories list

    • CSS
      MIT License
      2040Updated Oct 18, 2024Oct 18, 2024
    • AISES

      Public
      CSS
      1000Updated Oct 16, 2024Oct 16, 2024
    • HPC cluster code and configurations for running on OCI
      Python
      Universal Permissive License v1.0
      04742Updated Oct 10, 2024Oct 10, 2024
    • Measuring correlations between safety benchmarks and general AI capabilities benchmarks.
      Python
      MIT License
      0200Updated Oct 2, 2024Oct 2, 2024
    • HTML
      MIT License
      0300Updated Sep 20, 2024Sep 20, 2024
    • Forecasting.
      TypeScript
      72710Updated Sep 11, 2024Sep 11, 2024
    • HarmBench

      Public
      HarmBench: A Standardized Evaluation Framework for Automated Red Teaming and Robust Refusal
      Jupyter Notebook
      MIT License
      51306194Updated Aug 16, 2024Aug 16, 2024
    • This is the starter kit for the Trojan Detection Challenge 2023 (LLM Edition), a NeurIPS 2023 competition.
      Python
      MIT License
      267800Updated May 19, 2024May 19, 2024
    • wmdp

      Public
      WMDP is a LLM proxy benchmark for hazardous knowledge in bio, cyber, and chemical security. We also release code for RMU, an unlearning method which reduces LLM performance on WMDP while retaining general capabilities.
      Jupyter Notebook
      MIT License
      217751Updated Apr 27, 2024Apr 27, 2024
    • HTML
      MIT License
      0000Updated Mar 28, 2024Mar 28, 2024
    • JavaScript
      MIT License
      0100Updated Mar 6, 2024Mar 6, 2024
    • Prometheus exporter for performance metrics from Slurm.
      Go
      GNU General Public License v3.0
      142251Updated Nov 1, 2023Nov 1, 2023
    • Jupyter Notebook
      0300Updated Oct 30, 2023Oct 30, 2023
    • reading

      Public
      1100Updated Oct 26, 2023Oct 26, 2023
    • Cost-effectiveness models, tools, and results for various AI safety field-building programs.
      Python
      MIT License
      4502Updated Aug 15, 2023Aug 15, 2023
    • Website for the Trojan Detection Challenge NeurIPS 2022 competition
      JavaScript
      MIT License
      0000Updated Jul 28, 2023Jul 28, 2023
    • GoSlurmMailer - drop in replacement for default slurm MailProg. Delivers slurm job messages to various destinations.
      Go
      6000Updated Jun 21, 2023Jun 21, 2023
    • 206200Updated May 31, 2023May 31, 2023