publications | 詹锟

2026

Worldrft: Latent world model planning with reinforcement fine-tuning for autonomous driving

2026

Proceedings of the AAAI Conference on Artificial Intelligence 40 (14), 11649 …

HTML
Unifying language-action understanding and generation for autonomous driving

2026

Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern …

HTML
StreetForward: Perceiving Dynamic Street with Feedforward Causal Attention

2026

arXiv preprint arXiv:2603.19552

HTML
Streamingclaw technical report

2026

arXiv preprint arXiv:2603.22120

HTML
SparseWorld-TC: Trajectory-Conditioned Sparse Occupancy World Model

2026

Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern …

HTML
Rlgf: Reinforcement learning with geometric feedback for autonomous driving video generation

2026

Advances in Neural Information Processing Systems 38, 128659-128684

DOI HTML
ReflectDrive-2: Reinforcement-Learning-Aligned Self-Editing for Discrete Diffusion Driving

2026

arXiv preprint arXiv:2605.04647

HTML
Planagent: A multi-modal large language agent for closed-loop vehicle motion planning

2026

IEEE Transactions on Cognitive and Developmental Systems

HTML
Metis: A Generalizable and Efficient World-Action Model for Autonomous Driving and Urban Navigation

2026

arXiv preprint arXiv:2606.15869

HTML
Method and apparatus for generating trajectory, electronic device, storage medium

2026

US Patent App. 19/037,583

HTML
M2A: Synergizing Mathematical and Agentic Reasoning in Large Language Models

2026

arXiv preprint arXiv:2605.09879

HTML
LiAuto-GeoX: Efficient Grounded Driving Transformer

2026

arXiv preprint arXiv:2606.05774

HTML
Hardware Co-Design Scaling Laws via Roofline Modelling for On-Device LLMs

2026

arXiv preprint arXiv:2602.10377

HTML
FAAR: Format-Aware Adaptive Rounding for NVFP4

2026

arXiv preprint arXiv:2603.22370

HTML
Evolving from Tool User to Creator via Training-Free Experience Reuse in Multimodal Reasoning

2026

arXiv preprint arXiv:2602.01983

HTML
Evaluating the Search Agent in a Parallel World

2026

arXiv preprint arXiv:2603.04751

HTML
DriveLiDAR4D: Sequential and Controllable LiDAR Scene Generation for Autonomous Driving

2026

Proceedings of the AAAI Conference on Artificial Intelligence 40 (4), 2525-2533

DOI HTML
DriveCombo: Benchmarking Compositional Traffic Rule Reasoning in Autonomous Driving

2026

Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern …

HTML
Correctad: A self-correcting agentic system to improve end-to-end planning in autonomous driving

2026

Proceedings of the AAAI Conference on Artificial Intelligence 40 (10), 7755-7763

HTML
Closed Loop Dynamic Driving Data Mixture for Real-Synthetic Co-Training

2026

arXiv preprint arXiv:2605.21372

HTML
Ad-r1: Closed-loop reinforcement learning for end-to-end autonomous driving with impartial world models

2026

Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern …

HTML

2025

Transdiffuser: End-to-end trajectory generation with decorrelated multi-modal representation for autonomous driving

2025

arXiv e-prints, arXiv: 2505.09315

HTML
Transdiffuser: Diverse trajectory generation with decorrelated multi-modal representation for end-to-end autonomous driving

2025

arXiv preprint arXiv:2505.09315

HTML
The better you learn, the smarter you prune: Towards efficient vision-language-action models via differentiable token pruning

2025

arXiv preprint arXiv:2509.12594

HTML
StyledStreets: Multi-style Street Simulator with Spatial and Temporal Consistency

2025

arXiv preprint arXiv:2503.21104

HTML
Streetcrafter: Street view synthesis with controllable video diffusion models

2025

Proceedings of the Computer Vision and Pattern Recognition Conference, 822-832

HTML
Street Gaussians: Modeling Dynamic Urban Scenes With Gaussian Primitives

2025

IEEE transactions on pattern analysis and machine intelligence

HTML
RoboPearls: editable video simulation for robot manipulation

2025

Proceedings of the IEEE/CVF International Conference on Computer Vision …

DOI HTML
Recondreamer: Crafting world models for driving scene reconstruction via online restoration

2025

Proceedings of the Computer Vision and Pattern Recognition Conference, 1559-1569

HTML
PosePilot: Steering camera pose for generative world models with self-supervised depth

2025

IEEE/RSJ International Conference on Intelligent Robots and Systems …

HTML
Omnigen: Unified multimodal sensor generation for autonomous driving

2025

Proceedings of the 33rd ACM International Conference on Multimedia, 9365-9374

DOI HTML
Learning Personalized Driving Styles via Reinforcement Learning from Human Feedback

2025

arXiv preprint arXiv:2503.10434

HTML
HiNeuS: High-fidelity Neural Surface Mitigating Low-texture and Reflective Ambiguity

2025

Proceedings of the IEEE/CVF International Conference on Computer Vision …

HTML
Hierarchy UGP: Hierarchy Unified Gaussian Primitive for Large-Scale Dynamic Scene Reconstruction

2025

Proceedings of the IEEE/CVF International Conference on Computer Vision …

HTML
Geodrive: 3d geometry-informed driving world model with precise action control

2025

arXiv preprint arXiv:2505.22421

HTML
Generalizing motion planners with mixture of experts for autonomous driving

2025

IEEE International Conference on Robotics and Automation (ICRA), 6033-6039

HTML
Finetuning generative trajectory model with reinforcement learning from human feedback

2025

arXiv e-prints, arXiv: 2503.10434

HTML
Drivingsphere: Building a high-fidelity 4d world for closed-loop simulation

2025

Proceedings of the Computer Vision and Pattern Recognition Conference, 27531 …

HTML
DriveAgent-R1: Advancing VLM-based autonomous driving with hybrid thinking and active perception

2025

arXiv e-prints, arXiv: 2507.20879

HTML
DriveAgent-R1: Advancing VLM-based Autonomous Driving with Active Perception and Hybrid Thinking

2025

arXiv preprint arXiv:2507.20879

HTML
Dive: Efficient multi-view driving scenes generation based on video diffusion transformer

2025

arXiv preprint arXiv:2504.19614

HTML
Discrete diffusion for reflective vision-language-action models in autonomous driving

2025

arXiv preprint arXiv:2509.20109

HTML
Bev-tsr: Text-scene retrieval in bev space for autonomous driving

2025

Proceedings of the AAAI Conference on Artificial Intelligence 39 (7), 7275-7283

HTML
3drealcar: An in-the-wild rgb-d car dataset with 360-degree views

2025

Proceedings of the IEEE/CVF International Conference on Computer Vision …

HTML

2024

Xiaoxiao Long, Yilun Chen, and Hao Zhao. Tod3cap: Towards 3d dense captioning in outdoor scenes

2024

Computer Vision–ECCV, 367-384

HTML
Unleashing generalization of end-to-end autonomous driving with controllable long video generation

2024

arXiv preprint arXiv:2406.01349

HTML
Ua-track: Uncertainty-aware end-to-end 3d multi-object tracking

2024

arXiv e-prints, arXiv: 2406.02147

HTML
Tod3cap: Towards 3d dense captioning in outdoor scenes

2024

European Conference on Computer Vision, 367-384

HTML
Street gaussians: Modeling dynamic urban scenes with gaussian splatting

2024

European Conference on Computer Vision, 156-173

HTML
S2-track: A simple yet strong approach for end-to-end 3d multi-object tracking

2024

arXiv preprint arXiv:2406.02147

HTML
Drivevlm: The convergence of autonomous driving and large vision-language models

2024

arXiv preprint arXiv:2402.12289

HTML
Dive: Dit-based video generation with enhanced control

2024

arXiv preprint arXiv:2409.01595

HTML
Bev-clip: Multi-modal bev retrieval methodology for complex scene in autonomous driving

2024

HTML
Balanced 3DGS: Gaussian-wise parallelism rendering with fine-grained tiling

2024

arXiv preprint arXiv:2412.17378

HTML

2023

Street gaussians for modeling dynamic urban scenes.(2023)

2023

arXiv preprint arXiv:2401.01339

HTML

2015

Joint tracking and classification with constraints and reassignment by radar and ESM

L Xu H Jiang

2015

Digital Signal Processing 40, 213-223

HTML

2014

Particle filter based joint tracking and classification

2014

Proceedings of IEEE Chinese Guidance, Navigation and Control Conference …

HTML
Joint tracking and classification on aerodynamic model and RCS by ground-based passive radar

2014

Proceedings of IEEE Chinese Guidance, Navigation and Control Conference …

HTML
Joint tracking and classification based on aerodynamic model and radar cross section

K Zhan H Jiang

2014

Pattern recognition 47 (9), 3096-3105

HTML