Stars
Master the essential steps of pretraining large language models (LLMs). Learn to create high-quality datasets, configure model architectures, execute training runs, and assess model performance forā¦
Code for "Efficient Data Processing in Spark" Course
threatspec - continuous threat modeling, through code
A simple threat modeling tool to help humans to reduce time-to-value when threat modeling
LLM Adversarial Robustness Toolkit, a toolkit for evaluating LLM robustness through adversarial testing.
Python Data Science Handbook: full text in Jupyter Notebooks
š§µ CLI tool for directly patching container images!
Run safety benchmarks against AI models and view detailed reports showing how well they performed.
Responsible AI Toolbox is a suite of tools providing model and data exploration and assessment user interfaces and libraries that enable a better understanding of AI systems. These interfaces and lā¦
GitHub action for pruning old GHCR container image versions.
A tool for exploring each layer in a docker image
Community-driven, simple, yet powerful framework for fast, cost-effective distributed Compute over Data.
Efficient Retrieval Augmentation and Generation Framework
Detecting InsecureĀ Code with LLMs
Holistic Evaluation of Language Models (HELM) is an open source Python framework created by the Center for Research on Foundation Models (CRFM) at Stanford for holistic, reproducible and transparenā¦
A dummy's guide to setting up (and using) HPC clusters on Ubuntu 22.04LTS using Slurm and Munge. Created by the Quant Club @ UIowa.
FUSE-based file system backed by Amazon S3
A kubernetes based framework for hassle free handling of datasets
A curated list of awesome open source libraries to deploy, monitor, version and scale your machine learning
A collection of inspiring lists, manuals, cheatsheets, blogs, hacks, one-liners, cli/web tools and more.
Safety checks Python dependencies for known security vulnerabilities and suggests the proper remediations for vulnerabilities detected.
IntelĀ® Extension for DeepSpeed* is an extension to DeepSpeed that brings feature support with SYCL kernels on Intel GPU(XPU) device. Note XPU is already supported in stock DeepSpeed (upstream).