tkianai

🖐️

tkianai tkianai

🖐️

Struggle & Survive, while keeping Hopeful!

31 followers · 79 following

别怕失败，大不了从头来过
Beijing
https://tkianai.com

Achievements

Stars

IDEA-Research / Rex-Thinker

Rex-Thinker: Grounded Object Refering via Chain-of-Thought Reasoning

Python 63 2 Updated Jun 30, 2025

google-gemini / gemini-cli

An open-source AI agent that brings the power of Gemini directly into your terminal.

TypeScript 48,652 4,040 Updated Jul 2, 2025

HXMap / HRMapNet

[ECCV 2024] This is the official implementation of HRMapNet, maintaining and utilizing a low-cost global rasterized map to enhance online vectorized map perception.

Python 93 11 Updated Sep 25, 2024

OpenDriveLab / LaneSegNet

[ICLR 2024] Map Learning with Lane Segment for Autonomous Driving

Python 317 38 Updated Jul 19, 2024

OpenDriveLab / UniVLA

[RSS 2025] Learning to Act Anywhere with Task-centric Latent Actions

Python 506 23 Updated Jun 20, 2025

km1994 / AwesomeMultiModel

【AIGC 实战入门笔记 —— AIGC 摩天大楼】分享大语言模型（LLMs），大模型高效微调（SFT）,检索增强生成（RAG），智能体（Agent），PPT自动生成, 角色扮演，文生图（Stable Diffusion），图像文字识别（OCR），语音识别（ASR），语音合成（TTS），人像分割（SA），多模态（VLM），Ai 换脸(Face Swapping), 文生视频(VD)，图生…

15 2 Updated Apr 26, 2025

coderonion / awesome-llm-and-aigc

🚀🚀🚀A collection of some awesome public projects about Large Language Model(LLM), Vision Language Model(VLM), Vision Language Action(VLA), AI Generated Content(AIGC), the related Datasets and Applic…

714 62 Updated May 3, 2025

MCG-NJU / MOTIP

[CVPR 2025] Multiple Object Tracking as ID Prediction

Python 292 19 Updated Jun 21, 2025

facebookresearch / vggt

[CVPR 2025 Best Paper Award] VGGT: Visual Geometry Grounded Transformer

Python 9,288 891 Updated Jun 30, 2025

mega-sam / mega-sam

Code for the project "MegaSaM: Accurate, Fast and Robust Structure and Motion from Casual Dynamic Videos"

Python 909 41 Updated Jun 13, 2025

xiaolul2 / MGMap

[CVPR2024] The code for "MGMap: Mask-Guided Learning for Online Vectorized HD Map Construction"

Python 112 5 Updated Apr 13, 2024

pnnnnnnn / Uni-PrevPredMap

Uni-PrevPredMap: Extending PrevPredMap to a Unified Framework of Prior-Informed Modeling for Online Vectorized HD Map Construction

Python 8 1 Updated May 5, 2025

Peterande / D-FINE

D-FINE: Redefine Regression Task of DETRs as Fine-grained Distribution Refinement [ICLR 2025 Spotlight]

Python 2,510 230 Updated Jul 1, 2025

UCSC-VLAA / OpenVision

OpenVision: A Fully-Open, Cost-Effective Family of Advanced Vision Encoders for Multimodal Learning

Python 272 17 Updated May 15, 2025

xaoyaoo / PyWxDump

获取微信信息；读取数据库，本地查看聊天记录并导出为csv、html等格式用于AI训练，自动回复等。支持多账户信息获取，支持所有微信版本。

Python 8,901 1,380 Updated Apr 29, 2025

ByteDance-Seed / Seed1.5-VL

Seed1.5-VL, a vision-language foundation model designed to advance general-purpose multimodal understanding and reasoning, achieving state-of-the-art performance on 38 out of 60 public benchmarks.

Jupyter Notebook 1,278 49 Updated Jun 14, 2025

xming521 / WeClone

🚀 One-stop solution for creating your digital avatar from chat history 💡 Fine-tune LLMs with your chat logs to capture your unique style, then bind to a chatbot to bring your digital self to life. …

Python 14,561 1,123 Updated Jul 2, 2025