-
National University of Singapore
- Singapore
-
09:42
(UTC +08:00)
Highlights
- Pro
Stars
Qwen2.5-VL is the multimodal large language model series developed by Qwen team, Alibaba Cloud.
Label Studio is a multi-type data labeling and annotation tool with standardized output format
ME5413-Group9-2025 / ME5413_Final_Project
Forked from NUS-Advanced-Robotics-Centre/ME5413_Final_ProjectNUS ME5413 Autonomous Mobile Robotics Final Project
🤗 LeRobot: Making AI for Robotics more accessible with end-to-end learning
Chess playing robot based on LSS 4DoF Robotic Arm & Raspberry Pi
Ready-to-use OCR with 80+ supported languages and all popular writing scripts including Latin, Chinese, Arabic, Devanagari, Cyrillic and etc.
ROS package which uses the Navigation Stack to autonomously explore an unknown environment with help of GMAPPING and constructs a map of the explored environment. Finally, a path planning algorithm…
A ROS package that implements a multi-robot RRT-based map exploration algorithm. It also has the image-based frontier detection that uses image processing to extract frontier points.
ROS Node and Costmap 2D plugin layer for frontier exploration
HybridVLA: Collaborative Diffusion and Autoregression in a Unified Vision-Language-Action Model
Code of the paper "NavCoT: Boosting LLM-Based Vision-and-Language Navigation via Learning Disentangled Reasoning" (TPAMI 2025)
The official repo of Qwen (通义千问) chat & pretrained large language model proposed by Alibaba Cloud.
大模型算法岗面试题(含答案):常见问题和概念解析 "大模型面试题"、"算法岗面试"、"面试常见问题"、"大模型算法面试"、"大模型应用基础"
MM-EUREKA: Exploring the Frontiers of Multimodal Reasoning with Rule-based Reinforcement Learning
The Large-scale Manipulation Platform for Scalable and Intelligent Embodied Systems
[ICML 2025] OTTER: A Vision-Language-Action Model with Text-Aware Visual Feature Extraction
FurnitureBench: Real-World Furniture Assembly Benchmark (RSS 2023)
Official repository of ’Visual-RFT: Visual Reinforcement Fine-Tuning’
Provides ROS integration for Cartographer.
moojink / openvla-oft
Forked from openvla/openvlaFine-Tuning Vision-Language-Action Models: Optimizing Speed and Success
Cartographer is a system that provides real-time simultaneous localization and mapping (SLAM) in 2D and 3D across multiple platforms and sensor configurations.
A high-throughput and memory-efficient inference and serving engine for LLMs
OpenMMLab's next-generation platform for general 3D object detection.
Papers, code and datasets about deep learning for 3D Object Detection.
Containing all the launch, config, rviz and other non-code files