Our group actively publishes in top-tier venues in the fields of machine learning, computer vision, natural language processing, and interdisciplinary data science. Below is a list of recent and selected papers.

Preprint

  • R. Li, P. Pan, B. Yang, D. Xu, S. Zhou, X. Zhang, Z. Li, A. Kadambi, Z. Wang, Z. Tu, Z. Fan
    "4K4DGen: Panoramic 4D Generation at 4K Resolution"
    Arxiv 2024. [Paper] [Project]
  • T. Zhu, Q. Liu, F. Wang, Z. Tu, and M. Chen
    "Unraveling Cross-Modality Knowledge Conflict in Large Vision-Language Models"
    Arxiv 2024. [Paper] [Code]
  • J. Li, X. Liu, B. Li, R. Xu, J. Li, H. Yu, and Z. Tu
    "CoMamba: Real-time Cooperative Perception Unlocked with State Space Models"
    Arxiv 2024. [Paper] [Code]
  • K. Mei, Z. Tu, M. Delbracio, H. Talebi, V.M. Patel, P. Milanfar
    "Bigger is not Always Better: Scaling Properties of Latent Diffusion Models"
    Arxiv 2024. [Paper]
  • B Li, J Li, X Liu, R Xu, Z Tu, J Guo, X Li, H Yu
    "V2X-DGW: Domain Generalization for Multi-agent Perception under Adverse Weather Conditions"
    Arxiv 2024. [Paper]

Journal Paper

  • R. Zhu, Z. Tu, J. Liu, A.C. Bovik, Y. Fan
    "MWFormer: Multi-Weather Image Restoration Using Degradation-Aware Transformers"
    IEEE Transactions on Image Processing, 2024. [Paper] [Code]
  • K. Mei, Z. Tu, M. Delbracio, H. Talebi, V. M. Patel, P. Milanfar
    "Bigger is not Always Better: Scaling Properties of Latent Diffusion Models"
    Transactions on Machine Learning Research, 2024. [Paper]
  • R. Xu, C.J. Chen, Z. Tu, M.H. Yang
    "V2X-ViTv2: Improved Vision Transformers for Vehicle-to-Everything Cooperative Perception"
    IEEE Transactions on Pattern Analysis and Machine Intelligence, 2024. [Paper] [Code]
  • Q. Zheng*, Z. Tu*, PC Madhusudana, X. Zeng, A.C. Bovik, Y. Fan
    "FAVER: Blind Quality Prediction of Variable Frame Rate Videos"
    Signal Processing: Image Communication, 2024. [Paper] [Code]
  • Q. Zheng, Z. Tu, X. Zeng, AC Bovik, Y. Fan
    "A completely blind video quality evaluator"
    IEEE Signal Processing Letters, 2022. [Paper] [Code]
  • Z. Tu, X. Yu, Y. Wang, N. Birkbeck, B. Adsumilli, A.C. Bovik
    "RAPIQUE: Rapid and accurate video quality prediction of user generated content"
    IEEE Open Journal of Signal Processing, 2021. [Paper] [Code] [IEEE SPS Webinar]
    Highlighted in OJSP 2022-2023 newsletter, featured talk at IEEE SPS Webinar
  • Z. Tu, Y. Wang, N. Birkbeck, B. Adsumilli, A.C. Bovik
    "UGC-VQA: Benchmarking blind video quality assessment for user generated content"
    IEEE Transactions on Image Processing, 2021. [Paper] [Code]
  • Z. Tu, J. Lin, Y. Wang, B. Adsumilli, A.C. Bovik
    "Adaptive Debanding Filter"
    IEEE Signal Processing Letters, 2020. [Paper] [Code]

Conference Paper

2024

  • C. Qi, Z. Tu, K. Ye, M. Delbracio, P. Milanfar, Q. Chen, H. Talebi
    "SPIRE: Semantic Prompt-Driven Image Restoration "
    European Conference on Computer Vision (ECCV), 2024. [Project] [Paper]
  • K. Mei, M. Delbracio, H. Talebi, Z. Tu, V.M. Patel, P. Milanfar
    "CoDi: Conditional Diffusion Distillation for Higher-Fidelity and Faster Image Generation"
    IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR), 2024. [Project] [Paper] [Code]
  • J. Li, B. Li, Z. Tu, X. Liu, Q. Guo, F. Juefei-Xu, R. Xu, H. Yu
    "Light the Night: A Multi-Condition Diffusion Framework for Unpaired Low-Light Enhancement in Autonomous Driving"
    IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR), 2024. [Project] [Paper] [Code]

2023

  • Z. Tu, P. Milanfar, H. Talebi
    "MULLER: Multilayer Laplacian Resizer for Vision"
    IEEE/CVF International Conference on Computer Vision (ICCV), 2023. [Paper] [Code]
  • R. Xu, X. Xia, J. Li, H. Li, S. Zhang, Z. Tu, Z. Meng, H. Xiang, X. Dong, R. Song, H. Yu, B. Zhou, J. Ma
    "V2V4Real: A real-world large-scale dataset for vehicle-to-vehicle cooperative perception"
    IEEE/CVF Computer Vision and Pattern Recognition Conference (CVPR), 2023. Highlight [Project] [Paper] [Code]

2022

  • Q. Zheng, Z. Tu, Z. Hao, X. Zeng, A.C. Bovik, Y. Fan
    "Blind Video Quality Assessment via Space-Time Slice Statistics"
    IEEE International Conference on Image Processing (ICIP), 2022. [Paper] [Code]
  • R Xu*, Z Tu*, H Xiang, W Shao, B Zhou, J Ma
    "CoBEVT: Cooperative Bird's Eye View Semantic Segmentation with Sparse Transformers"
    Conference on Robot Learning (CoRL), 2022. [Paper] [Code]
  • Q. Zheng, Z. Tu, Y. Fan, X. Zeng, A.C. Bovik
    "No-Reference Quality Assessment of Variable Frame-Rate Videos Using Temporal Bandpass Statistics"
    IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), 2022. [Paper] [Code]
  • R Xu*, Z Tu*, Y Du*, X Dong, J Li, Z Meng, J Ma, A Bovik, H Yu
    "Pik-Fix: Restoring and Colorizing Old Photos"
    IEEE/CVF Winter Conference on Applications of Computer Vision (WACV), 2022. [Paper] [Code]
  • Z. Tu, H. Talebi, H. Zhang, F. Yang, P. Milanfar, A. Bovik, Y. Li
    "MaxViT: Multi-axis Vision Transformer"
    European Conference on Computer Vision (ECCV), 2022. [Paper] [Code]
    Highlighted on-top in Jeff Dean's 2022 Annual Google Research Blog; Selected as top-3 papers of the year in Ahead of AI #4: A Big Year for AI; Retweeted by the Yann Lecun: link; Top-15 Most Influential ECCV Papers (2023-01)
  • R. Xu*, H. Xiang*, Z. Tu*, X. Xia, M.H. Yang, J. Ma
    "V2X-ViT: Vehicle-to-everything cooperative perception with vision transformer"
    European Conference on Computer Vision (ECCV), 2022. [Paper] [Code]
  • Z. Tu, H. Talebi, H. Zhang, F. Yang, P. Milanfar, A. Bovik, Y. Li
    "MAXIM: Multi-Axis MLP for Image Processing"
    IEEE/CVF Computer Vision and Pattern Recognition Conference (CVPR), 2022. [Paper] [Code]
    Best paper nomination award (0.4% of 8161 submissions)

2021

  • Z. Tu, C.J. Chen, Y. Wang, N. Birkbeck, B. Adsumilli, A.C. Bovik
    "Video quality assessment of user generated content: A benchmark study and a new model"
    IEEE International Conference on Image Processing (ICIP), 2021. [Paper] [Code]
  • Z. Tu, C.J. Chen, Y. Wang, N. Birkbeck, B. Adsumilli, A.C. Bovik
    "A Temporal Statistics Model For UGC Video Quality Prediction"
    IEEE International Conference on Image Processing (ICIP), 2021. [Paper]
  • Z. Tu, C.J. Chen, Y. Wang, N. Birkbeck, B. Adsumilli, A.C. Bovik
    "Efficient user-generated video quality prediction"
    Picture Coding Symposium (PCS), 2021. [Paper] [Code]
  • Z. Tu, C.J. Chen, L.H. Chen, Y. Wang, N. Birkbeck, B. Adsumilli, A.C. Bovik
    "Regression or classification? new methods to evaluate no-reference picture and video quality models"
    IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP), 2021. [Paper]

2020

  • Z. Tu, L.H. Chen, Y. Wang, N. Birkbeck, B. Adsumilli, A.C. Bovik
    "A comparative evaluation of temporal pooling methods for blind video quality assessment"
    IEEE International Conference on Image Processing (ICIP), 2020. [Paper]
  • Z. Tu, J. Lin, Y. Wang, N. Birkbeck, B. Adsumilli, A.C. Bovik
    "BBAND Index: a No-Reference Banding Artifact Predictor"
    IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP), 2020. [Paper] [Code]

Workshop Paper