mkantwala

Monil Kantwala mkantwala

Pinned Loading

DeepSeek-R1-TrainingSuite DeepSeek-R1-TrainingSuite Public

Advanced implementation of DeepSeek-R1 featuring Group Relative Policy Optimization (GRPO) for mathematical reasoning AI. Integrates safe distillation, modular reward systems, and efficient LoRA fi…

Python 12 3
CoNavic CoNavic Public

CoNavic is a browser extension that brings the power of ChatGPT and browser automation to your fingertips. Instantly access AI assistance, manage tabs, and organize bookmarks using natural language…

JavaScript 6 1