Stars
RLinf: Reinforcement Learning Infrastructure for Embodied and Agentic AI
Official implementation of our NeurIPS 2025 paper: "FlyLoRA: Boosting Task Decoupling and Parameter Efficiency via Implicit Rank-Wise Mixture-of-Experts."
上海交通大学 LaTeX 论文模板 | Shanghai Jiao Tong University LaTeX Thesis Template
Survey: https://arxiv.org/pdf/2507.20198
NORA: A Small Open-Sourced Generalist Vision Language Action Model for Embodied Tasks
This repository contains the PyTorch implementation of the CVPR'2024 paper (Highlight), IS-Fusion: Instance-Scene Collaborative Fusion for Multimodal 3D Object Detection.
Solve Visual Understanding with Reinforced VLMs
《Pytorch实用教程》(第二版)无论是零基础入门,还是CV、NLP、LLM项目应用,或是进阶工程化部署落地,在这里都有。相信在本书的帮助下,读者将能够轻松掌握 PyTorch 的使用,成为一名优秀的深度学习工程师。
The official Implementation of "VSRD: Instance-Aware Volumetric Silhouette Rendering for Weakly Supervised 3D Object Detection" [CVPR 2024]
OpenMMLab Detection Toolbox and Benchmark
Official codebase for PreSight: Enhancing Autonomous Vehicle Perception with City-Scale NeRF Priors
[ICCV 2023] StreamPETR: Exploring Object-Centric Temporal Modeling for Efficient Multi-View 3D Object Detection
[ECCV2022] PETR: Position Embedding Transformation for Multi-View 3D Object Detection & [ICCV2023] PETRv2: A Unified Framework for 3D Perception from Multi-Camera Images
《动手学深度学习》:面向中文读者、能运行、可讨论。中英文版被70多个国家的500多所大学用于教学。
[ECCV 2022] This is the official implementation of BEVFormer, a camera-only framework for autonomous driving perception, e.g., 3D object detection and semantic map segmentation.
A lecture note for understanding deep learning
Sample ROS2 publisher application that transforms and publishes the Kitti dataset into the ROS2 messages.
“让爷康康”是一款手机 AI 应用程序,可以监测不良坐姿并进行语音提示

