Jianbiao Mei

my_photo.jpg

I am a Ph.D. candidate in the Department of Control Science and Engineering at Zhejiang University, where I have been since 2021, under the supervision of Prof. Yong Liu at the APRIL Lab. Prior to this, I obtained my B.Eng from the same department with an honors degree at Chu Kochen Honors College in 2021. Currently, I am an intern researcher at Shanghai AI Laboratory.

Research Interests

The following are my current research interests:

  • 3D Perception
  • World Models
  • Multimodal Large Language Models

News

Sep 26, 2025 🎉 Paper X-Scene: Large-Scale Driving Scene Generation with High Fidelity and Flexible Controllability is accepted by NeurIPS 2025 !
📢 We have also released the 3D and 4D World Modeling: A Survey !
Jun 26, 2025 🎉 Paper DriveArena: A Closed-loop Generative Simulation Platform for Autonomous Driving is accepted by ICCV 2025 !
Dec 10, 2024 🎉 Paper Driving in the Occupancy World: Vision-Centric 4D Occupancy Forecasting and Planning via World Models for Autonomous Driving is accepted by AAAI 2025 Oral !
Oct 2, 2024 🎉 Paper Continuously Learning, Adapting, and Improving: A Dual-Process Approach to Autonomous Driving is accepted by NeurIPS 2024 !
📢 We have also developed a closed-loop high-fidelity simulation platform called DriveArena!
Sep 11, 2024 🎉 Paper Camera-Based 3D Semantic Scene Completion With Sparse Guidance Network is accepted by 2024 IEEE Transactions on Image Processing (TIP) !

Selected Publications

* denotes equal contribution.

2025

  1. Preprint
    Vision-Centric 4D Occupancy Forecasting and Planning via Implicit Residual World Models
    Jianbiao Mei*, Yu Yang*, Xuemeng Yang, and 4 more authors
    arXiv preprint arXiv:2510.16729, 2025
  2. Preprint
    RE-Searcher: Robust Agentic Search with Goal-oriented Planning and Self-reflection
    Daocheng Fu*, Jianbiao Mei*, Licheng Wen, and 8 more authors
    arXiv preprint arXiv:2509.26048, 2025
  3. Preprint
    3d and 4d world modeling: A survey
    Lingdong Kong*, Wesley Yang*, Jianbiao Mei*, and 8 more authors
    arXiv preprint arXiv:2509.07996, 2025
  4. NeurIPS
    X-Scene: Large-Scale Driving Scene Generation with High Fidelity and Flexible Controllability
    Yu Yang, Alan Liang, Jianbiao Mei, and 3 more authors
    In Advances in Neural Information Processing Systems (NeurIPS), 2025
  5. Preprint
    O^2-Searcher: A Searching-based Agent Model for Open-Domain Open-Ended Question Answering
    Jianbiao Mei*, Tao Hu*, Daocheng Fu*, and 11 more authors
    2025
  6. AAAI Oral
    Driving in the Occupancy World: Vision-Centric 4D Occupancy Forecasting and Planning via World Models for Autonomous Driving
    Yu Yang*, Jianbiao Mei*, Yukai Ma, and 5 more authors
    AAAI Conference on Artificial Intelligence (AAAI), 2025
  7. ICCV
    Drivearena: A closed-loop generative simulation platform for autonomous driving
    Xuemeng Yang*, Licheng Wen*, Tiantian Wei*, and 8 more authors
    International Conference on Computer Vision (ICCV), 2025

2024

  1. Preprint
    DreamForge: Motion-Aware Autoregressive Video Generation for Multi-View Driving Scenes
    Jianbiao Mei*, Tao Hu*, Licheng Wen, and 7 more authors
    arXiv preprint arXiv:2409.04003, 2024
  2. NeurIPS
    Continuously Learning, Adapting, and Improving: A Dual-Process Approach to Autonomous Driving
    Jianbiao Mei*, Yukai Ma*, Xuemeng Yang, and 8 more authors
    In Advances in Neural Information Processing Systems (NeurIPS), 2024
  3. TIP
    Camera-based 3d semantic scene completion with sparse guidance network
    Jianbiao Mei, Yu Yang, Mengmeng Wang, and 5 more authors
    IEEE Transactions on Image Processing (TIP), 2024
  4. RAL
    LiCROcc: Teach Radar for Accurate Semantic Occupancy Prediction using LiDAR and Camera
    Yukai Ma*, Jianbiao Mei*, Xuemeng Yang, and 6 more authors
    IEEE Robotics and Automation Letters (RAL), 2024
  5. AAAI Oral
    A Multimodal, Multi-Task Adapting Framework for Video Action Recognition
    Mengmeng Wang, Jiazheng Xing, Boyuan Jiang, and 6 more authors
    In Proceedings of the AAAI Conference on Artificial Intelligence (AAAI), 2024
  6. ICRA
    A Coarse-to-Fine Place Recognition Approach using Attention-guided Descriptors and Overlap Estimation
    Chencan Fu, Lin Li, Jianbiao Mei, and 4 more authors
    In 2024 IEEE International Conference on Robotics and Automation (ICRA), 2024
  7. 3DV
    Exploit Spatiotemporal Contextual Information for 3D Single Object Tracking via Memory Networks
    Jongwon Ra, MengMeng Wang, Jianbiao Mei, and 3 more authors
    In 2024 International Conference on 3D Vision (3DV), 2024

2023

  1. ACM MM
    Centerlps: Segment instances by centers for lidar panoptic segmentation
    Jianbiao Mei*, Yu Yang*, Mengmeng Wang, and 5 more authors
    In Proceedings of the 31st ACM International Conference on Multimedia (ACM MM), 2023
  2. IROS
    SSC-RS: Elevate LiDAR semantic scene completion with representation separation and BEV fusion
    Jianbiao Mei, Yu Yang, Mengmeng Wang, and 3 more authors
    In 2023 IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS), 2023
  3. ACM TIST
    Fast real-time video object segmentation with a tangled memory network
    Jianbiao Mei*, Mengmeng Wang*, Yu Yang, and 2 more authors
    ACM Transactions on Intelligent Systems and Technology (ACM TIST), 2023
  4. TNNLS
    Actionclip: Adapting language-image pretrained models for video action recognition
    Mengmeng Wang, Jiazheng Xing, Jianbiao Mei, and 2 more authors
    IEEE Transactions on Neural Networks and Learning Systems (TNNLS), 2023

2022

  1. TIP
    Delving deeper into mask utilization in video object segmentation
    Mengmeng Wang*, Jianbiao Mei*, Lina Liu, and 3 more authors
    IEEE Transactions on Image Processing (TIP), 2022
  2. ECCV
    E-nerv: Expedite neural video representation with disentangled spatial-temporal context
    Zizhang Li, Mengmeng Wang, Huaijin Pi, and 3 more authors
    In European Conference on Computer Vision (ECCV), 2022

2021

  1. Preprint
    Transvos: Video object segmentation with transformers
    Jianbiao Mei*, Mengmeng Wang*, Yeneng Lin, and 2 more authors
    arXiv preprint arXiv:2106.00588, 2021