- Toronto, Canada
-
01:13
(UTC -04:00) - https://www.linkedin.com/in/smarunich/
- @smarunich
Lists (2)
Sort Name ascending (A-Z)
Stars
A high-throughput and memory-efficient inference and serving engine for LLMs
Route LLM requests to the best model for the task at hand.
Deploy Tetrate Service Bridge Demo on Azure Kubernetes Service (AKS), Google Kubernetes Engine (GKE) and/or Elastic Kubernetes Service (EKS) using Terraform
This repository contains the Hugging Face Agents Course.
Envoy AI Gateway is an open source project for using Envoy Gateway to handle request traffic from application clients to Generative AI services.
Built for demanding AI workflows, this gateway offers low-latency, provider-agnostic access, ensuring your AI applications run smoothly and quickly.
🦄云原生、超高性能 AI&API网关,LLM API 管理、分发系统、开放平台,支持所有AI API,不限于OpenAI、Azure、Anthropic Claude、Google Gemini、DeepSeek、字节豆包、ChatGLM、文心一言、讯飞星火、通义千问、360 智脑、腾讯混元等主流模型,统一 API 请求和返回,API申请与审批,调用统计、负载均衡、多模型灾备。一键部署,开箱…
This repository contains examples for customers to get started using the Amazon Bedrock Service. This contains examples for all available foundational models
🚀🤖 Crawl4AI: Open-source LLM Friendly Web Crawler & Scraper. Don't be shy, join here: https://discord.gg/jP8KfhDhyN
A blazing fast AI Gateway with integrated guardrails. Route to 200+ LLMs, 50+ AI Guardrails with 1 fast & friendly API.
Ray is an AI compute engine. Ray consists of a core distributed runtime and a set of AI Libraries for accelerating ML workloads.
🥧 HTTPie CLI — modern, user-friendly command-line HTTP client for the API era. JSON support, colors, sessions, downloads, plugins & more.
Hunt down social media accounts by username across social networks
Bring projects, wikis, and teams together with AI. AppFlowy is the AI collaborative workspace where you achieve more without losing control of your data. The leading open source Notion alternative.
An open-source RAG-based tool for chatting with your documents.
InstructLab Core package. Use this to chat with a model and execute the InstructLab workflow to train a model using custom taxonomy data.
Fine-tuning & Reinforcement Learning for LLMs. 🦥 Train Qwen3, Llama 4, DeepSeek-R1, Gemma 3, TTS 2x faster with 70% less VRAM.
📄 A curated list of awesome .cursorrules files
🔥 Web-application firewalls (WAFs) from security standpoint.
Small CLI to generate TSB authz policies from service topology data.
A feature-rich portal to chat with GPT-4, Claude, Gemini, Mistral, & OpenAI Assistant APIs via a lightweight Node.js web app; supports customizable multimodality for voice, images, & files.
modular service framework to move and transform network packets
trust-manager is an operator for distributing trust bundles across a Kubernetes cluster.
Sample code and notebooks for Generative AI on Google Cloud, with Gemini on Vertex AI