Starred repositories
🤗 smolagents: a barebones library for agents that think in python code.
Faster Whisper transcription with CTranslate2
WhisperX: Automatic Speech Recognition with Word-level Timestamps (& Diarization)
Robust Speech Recognition via Large-Scale Weak Supervision
GPT4All: Run Local LLMs on Any Device. Open-source and available for commercial use.
A high-quality tool for convert PDF to Markdown and JSON.一站式开源高质量数据提取工具,将PDF转换成Markdown和JSON格式。
Get your documents ready for gen AI
A natural language interface for computers
Dify is an open-source LLM app development platform. Dify's intuitive interface combines AI workflow, RAG pipeline, agent capabilities, model management, observability features and more, letting yo…
Supercharge Your PyTorch Image Models: Bag of Tricks to 8x Faster Inference with ONNX Runtime & Optimizations
[ICLR 2024 Oral] Supervised Pre-Trained 3D Models for Medical Image Analysis (9,262 CT volumes + 25 annotated classes)
BOA is a segmentation tool of CT scans by the SHIP-AI group (https://ship-ai.ikim.nrw/). Combining the TotalSegmentator and the Body Composition Analysis, this tool is capable of analyzing medical …
Pretrain, finetune ANY AI model of ANY size on multiple GPUs, TPUs with zero code changes.
A grid sampler for larger-than-memory N-dimensional images
Processing Library and Analysis Toolkit for Medical Imaging in Python
SAM-Med3D: An Efficient General-purpose Promptable Segmentation Model for 3D Volumetric Medical Image
Official implementation of SAM-Med2D
The official code for "SegVol: Universal and Interactive Volumetric Medical Image Segmentation".
[NeurIPS 2023] AbdomenAtlas 1.0 (5,195 CT volumes + 9 annotated classes)
Tool for robust segmentation of >100 important anatomical structures in CT and MR images
MOOSE (Multi-organ objective segmentation) a data-centric AI solution that generates multilabel organ segmentations to facilitate systemic TB whole-person research.The pipeline is based on nn-UNet …
Advanced AI Explainability for computer vision. Support for CNNs, Vision Transformers, Classification, Object detection, Segmentation, Image similarity and more.
PACS and DICOM framework to identify, collect, and analyze data sets.
An implementation of 1D, 2D, and 3D positional encoding in Pytorch and TensorFlow
Easily turn large sets of image urls to an image dataset. Can download, resize and package 100M urls in 20h on one machine.