Publications

Publications are listed in reverse chronological order.
* denotes equal contribution.

2025

  1. AAAI
    Driving in the Occupancy World: Vision-Centric 4D Occupancy Forecasting and Planning via World Models for Autonomous Driving
    Yu Yang*, Jianbiao Mei*, Yukai Ma, and 5 more authors
    AAAI Conference on Artificial Intelligence (AAAI), 2025

2024

  1. Preprint
    DreamForge: Motion-Aware Autoregressive Video Generation for Multi-View Driving Scenes
    Jianbiao Mei, Xuemeng Yang, Licheng Wen, and 7 more authors
    arXiv preprint arXiv:2409.04003, 2024
  2. Preprint
    Drivearena: A closed-loop generative simulation platform for autonomous driving
    Xuemeng Yang*, Licheng Wen*, Yukai Ma*, and 8 more authors
    arXiv preprint arXiv:2408.00415, 2024
  3. NeurIPS
    Continuously Learning, Adapting, and Improving: A Dual-Process Approach to Autonomous Driving
    Jianbiao Mei*, Yukai Ma*, Xuemeng Yang, and 8 more authors
    In Advances in Neural Information Processing Systems (NeurIPS), 2024
  4. TIP
    Camera-based 3d semantic scene completion with sparse guidance network
    Jianbiao Mei, Yu Yang, Mengmeng Wang, and 5 more authors
    IEEE Transactions on Image Processing (TIP), 2024
  5. APIN
    Learning spatiotemporal relationships with a unified framework for video object segmentation
    Jianbiao Mei, Mengmeng Wang, Yu Yang, and 2 more authors
    Applied Intelligence (APIN), 2024
  6. PRL
    LiDAR video object segmentation with dynamic kernel refinement
    Jianbiao Mei, Yu Yang, Mengmeng Wang, and 3 more authors
    Pattern Recognition Letters (PRL), 2024
  7. Preprint
    DQFormer: Towards Unified LiDAR Panoptic Segmentation with Decoupled Queries
    Yu Yang*, Jianbiao Mei*, Liang Liu, and 6 more authors
    arXiv preprint arXiv:2408.15813, 2024
  8. RAL
    LiCROcc: Teach Radar for Accurate Semantic Occupancy Prediction using LiDAR and Camera
    Yukai Ma*, Jianbiao Mei*, Xuemeng Yang, and 6 more authors
    IEEE Robotics and Automation Letters (RAL), 2024
  9. AAAI
    A Multimodal, Multi-Task Adapting Framework for Video Action Recognition
    Mengmeng Wang, Jiazheng Xing, Boyuan Jiang, and 6 more authors
    In Proceedings of the AAAI Conference on Artificial Intelligence (AAAI), 2024
  10. ICRA
    A Coarse-to-Fine Place Recognition Approach using Attention-guided Descriptors and Overlap Estimation
    Chencan Fu, Lin Li, Jianbiao Mei, and 4 more authors
    In 2024 IEEE International Conference on Robotics and Automation (ICRA), 2024
  11. 3DV
    Exploit Spatiotemporal Contextual Information for 3D Single Object Tracking via Memory Networks
    Jongwon Ra, MengMeng Wang, Jianbiao Mei, and 3 more authors
    In 2024 International Conference on 3D Vision (3DV), 2024

2023

  1. ACM MM
    Centerlps: Segment instances by centers for lidar panoptic segmentation
    Jianbiao Mei*, Yu Yang*, Mengmeng Wang, and 5 more authors
    In Proceedings of the 31st ACM International Conference on Multimedia (ACM MM), 2023
  2. IROS
    PANet: LiDAR Panoptic Segmentation with Sparse Instance Proposal and Aggregation
    Jianbiao Mei*, Yu Yang*, Mengmeng Wang, and 3 more authors
    In 2023 IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS), 2023
  3. IROS
    SSC-RS: Elevate LiDAR semantic scene completion with representation separation and BEV fusion
    Jianbiao Mei, Yu Yang, Mengmeng Wang, and 3 more authors
    In 2023 IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS), 2023
  4. ACM TIST
    Fast real-time video object segmentation with a tangled memory network
    Jianbiao Mei*, Mengmeng Wang*, Yu Yang, and 2 more authors
    ACM Transactions on Intelligent Systems and Technology (ACM TIST), 2023
  5. APIN
    Exploiting semantic-level affinities with a mask-guided network for temporal action proposal in videos
    Yu Yang, Mengmeng Wang, Jianbiao Mei, and 1 more author
    Applied Intelligence (APIN), 2023
  6. RAL
    Geo-localization with transformer-based 2D-3D match network
    Laijian Li, Yukai Ma, Kai Tang, and 5 more authors
    IEEE Robotics and Automation Letters (RAL), 2023
  7. TNNLS
    Actionclip: Adapting language-image pretrained models for video action recognition
    Mengmeng Wang, Jiazheng Xing, Jianbiao Mei, and 2 more authors
    IEEE Transactions on Neural Networks and Learning Systems (TNNLS), 2023
  8. Preprint
    CR-SFP: Learning Consistent Representation for Soft Filter Pruning
    Jingyang Xiang, Zhuangzhi Chen, Jianbiao Mei, and 3 more authors
    arXiv preprint arXiv:2312.11555, 2023

2022

  1. TIP
    Delving deeper into mask utilization in video object segmentation
    Mengmeng Wang*, Jianbiao Mei*, Lina Liu, and 3 more authors
    IEEE Transactions on Image Processing (TIP), 2022
  2. ECCV
    E-nerv: Expedite neural video representation with disentangled spatial-temporal context
    Zizhang Li, Mengmeng Wang, Huaijin Pi, and 3 more authors
    In European Conference on Computer Vision (ECCV), 2022

2021

  1. Preprint
    Transvos: Video object segmentation with transformers
    Jianbiao Mei*, Mengmeng Wang*, Yeneng Lin, and 2 more authors
    arXiv preprint arXiv:2106.00588, 2021
  2. Preprint
    Mail: A unified mask-image-language trimodal network for referring image segmentation
    Zizhang Li, Mengmeng Wang, Jianbiao Mei, and 1 more author
    arXiv preprint arXiv:2111.10747, 2021