CatherineZhou

CatherineZhou

2 followers · 9 following

Lists (1)

Sort

NLP

8 repositories

Stars

williamFalcon / DeepRLHacks

Hacks for training RL systems from John Schulman's lecture at Deep RL Bootcamp (Aug 2017)

1,112 121 Updated Oct 13, 2017

google-gemini / gemini-fullstack-langgraph-quickstart

Get started with building Fullstack Agents using Gemini 2.5 and LangGraph

Jupyter Notebook 15,325 2,510 Updated Jun 18, 2025

AFAC2024 / AFAC2024-Advanced-Fintech-AI-Competition

AFAC2024金融智能创新大赛

Python 45 7 Updated Nov 27, 2024

zhayujie / chatgpt-on-wechat

基于大模型搭建的聊天机器人，同时支持微信公众号、企业微信应用、飞书、钉钉等接入，可选择ChatGPT/Claude/DeepSeek/文心一言/讯飞星火/通义千问/ Gemini/GLM-4/Kimi/LinkAI，能处理文本、语音和图片，访问操作系统和互联网，支持基于自有知识库进行定制企业智能客服。

Python 38,027 9,322 Updated Jun 29, 2025

AppFlowy-IO / AppFlowy

Bring projects, wikis, and teams together with AI. AppFlowy is the AI collaborative workspace where you achieve more without losing control of your data. The leading open source Notion alternative.

Dart 64,293 4,417 Updated Jul 3, 2025

Fosowl / agenticSeek

Fully Local Manus AI. No APIs, No $200 monthly bills. Enjoy an autonomous agent that thinks, browses the web, and code for the sole cost of electricity. 🔔 Official updates only via twitter @Martin9…

Python 19,863 1,955 Updated Jul 5, 2025

Alibaba-NLP / WebAgent

🌐 WebAgent for Information Seeking bulit by Tongyi Lab: WebWalker & WebDancer & WebSailor https://arxiv.org/pdf/2507.02592

Python 1,589 118 Updated Jul 7, 2025

SkyworkAI / Skywork-OR1

Unleashing the Power of Reinforcement Learning for Math and Code Reasoners

Python 645 40 Updated Jun 6, 2025

bytedance / deer-flow

DeerFlow is a community-driven Deep Research framework, combining language models with tools like web search, crawling, and Python execution, while contributing back to the open-source community.

Python 14,848 1,811 Updated Jul 7, 2025

ZhangXInFD / SpeechTokenizer

This is the code for the SpeechTokenizer presented in the SpeechTokenizer: Unified Speech Tokenizer for Speech Language Models. Samples are presented on

Python 577 53 Updated Jun 9, 2024

NVIDIA / BigVGAN

Official PyTorch implementation of BigVGAN (ICLR 2023)

Python 1,058 134 Updated Sep 5, 2024

SkyworkAI / Skywork

Skywork series models are pre-trained on 3.2TB of high-quality multilingual (mainly Chinese and English) and code data. We have open-sourced the model, training data, evaluation data, evaluation me…

Python 1,397 126 Updated Mar 7, 2025

QwenLM / Qwen-Audio

The official repo of Qwen-Audio (通义千问-Audio) chat & pretrained large audio language model proposed by Alibaba Cloud.

Python 1,727 123 Updated Jul 5, 2024

gradio-app / fastrtc

The python library for real-time communication

JavaScript 4,105 370 Updated Jul 7, 2025

Ninot1Quyi / Qwen2.5-Omni-multimodal-chat

基于通义千问 Qwen2.5-Omni 的实时语音对话系统，使用在线API服务，支持实时语音交互、动态语音活动检测和流式音频处理。A real-time voice conversation system based on Qwen2.5-Omni Online-API, supporting real-time voice interaction, dynamic voice activi…

Python 59 9 Updated May 11, 2025

KoljaB / RealtimeVoiceChat

Have a natural, spoken conversation with AI!

Python 2,707 260 Updated Jun 17, 2025

pipecat-ai / pipecat

Open Source framework for voice and multimodal conversational AI

Python 6,740 1,011 Updated Jul 7, 2025

Alibaba-NLP / ZeroSearch

ZeroSearch: Incentivize the Search Capability of LLMs without Searching

Python 1,040 98 Updated Jul 2, 2025

RUC-NLPIR / WebThinker

🌐 WebThinker: Empowering Large Reasoning Models with Deep Research Capability

Python 1,128 120 Updated Jun 20, 2025

Fantasy-AMAP / fantasy-talking

[ACM MM 2025] FantasyTalking: Realistic Talking Portrait Generation via Coherent Motion Synthesis

Python 1,417 111 Updated Jul 7, 2025

Lancelot39 / Causal-Copilot

Python 88 18 Updated Jun 27, 2025

HumanAIGC-Engineering / OpenAvatarChat

Python 1,489 220 Updated Jul 6, 2025

wxbool / video-srt-windows

这是一个可以识别视频语音自动生成字幕SRT文件的开源 Windows-GUI 软件工具。

Go 4,954 617 Updated Mar 10, 2023

buxuku / SmartSub

「妙幕」是一款跨平台客户端工具，可以批量为视频或者音频生成字幕文件，并支持对字幕进行翻译，支持百度、火山、openai、ollama、deepseek 等多家翻译

TypeScript 2,617 177 Updated Jul 2, 2025

WEIFENG2333 / VideoCaptioner

🎬 卡卡字幕助手 | VideoCaptioner - 基于 LLM 的智能字幕助手 - 视频字幕生成、断句、校正、字幕翻译全流程处理！- A powered tool for easy and efficient video subtitling.

Python 8,097 650 Updated Jul 4, 2025

TencentARC / LLaMA-Pro

[ACL 2024] Progressive LLaMA with Block Expansion.

Python 505 40 Updated May 20, 2024

openai / openai-assistants-quickstart

OpenAI Assistants API quickstart with Next.js.

TypeScript 1,901 557 Updated Mar 7, 2025

krillinai / Kli 43F0 cStudio

A video translation and dubbing tool powered by LLMs, offering professional-grade translations and one-click full-process deployment. It can generate content optimized for platforms like YouTube，T…

Go 8,017 632 Updated Jul 6, 2025

multimodal-art-projection / COIG-P

Python 38 1 Updated Jul 3, 2025

mendableai / firecrawl

🔥 Turn entire websites into LLM-ready markdown or structured data. Scrape, crawl and extract with a single API.

TypeScript 42,491 4,006 Updated Jul 7, 2025

Provide feedback

Saved searches

Use saved searches to filter your results more quickly