Our group actively publishes in top-tier venues in the fields of machine learning, computer vision, natural language processing, and interdisciplinary data science.
Below is a list of recent and selected papers.
Preprint
- R. Li, P. Pan, B. Yang, D. Xu, S. Zhou, X. Zhang, Z. Li, A. Kadambi, Z. Wang, Z. Tu, Z. Fan
"4K4DGen: Panoramic 4D Generation at 4K Resolution"
Arxiv 2024. [Paper] [Project]
- T. Zhu, Q. Liu, F. Wang, Z. Tu, and M. Chen
"Unraveling Cross-Modality Knowledge Conflict in Large Vision-Language Models"
Arxiv 2024. [Paper] [Code]
- J. Li, X. Liu, B. Li, R. Xu, J. Li, H. Yu, and Z. Tu
"CoMamba: Real-time Cooperative Perception Unlocked with State Space Models"
Arxiv 2024. [Paper] [Code]
- K. Mei, Z. Tu, M. Delbracio, H. Talebi, V.M. Patel, P. Milanfar
"Bigger is not Always Better: Scaling Properties of Latent Diffusion Models"
Arxiv 2024. [Paper]
- B Li, J Li, X Liu, R Xu, Z Tu, J Guo, X Li, H Yu
"V2X-DGW: Domain Generalization for Multi-agent Perception under Adverse Weather Conditions"
Arxiv 2024. [Paper]
Journal Paper
- R. Zhu, Z. Tu, J. Liu, A.C. Bovik, Y. Fan
"MWFormer: Multi-Weather Image Restoration Using Degradation-Aware Transformers"
IEEE Transactions on Image Processing, 2024. [Paper] [Code]
- K. Mei, Z. Tu, M. Delbracio, H. Talebi, V. M. Patel, P. Milanfar
"Bigger is not Always Better: Scaling Properties of Latent Diffusion Models"
Transactions on Machine Learning Research, 2024. [Paper]
- R. Xu, C.J. Chen, Z. Tu, M.H. Yang
"V2X-ViTv2: Improved Vision Transformers for Vehicle-to-Everything Cooperative Perception"
IEEE Transactions on Pattern Analysis and Machine Intelligence, 2024. [Paper] [Code]
- Q. Zheng*, Z. Tu*, PC Madhusudana, X. Zeng, A.C. Bovik, Y. Fan
"FAVER: Blind Quality Prediction of Variable Frame Rate Videos"
Signal Processing: Image Communication, 2024. [Paper] [Code]
- Q. Zheng, Z. Tu, X. Zeng, AC Bovik, Y. Fan
"A completely blind video quality evaluator"
IEEE Signal Processing Letters, 2022. [Paper] [Code]
- Z. Tu, X. Yu, Y. Wang, N. Birkbeck, B. Adsumilli, A.C. Bovik
"RAPIQUE: Rapid and accurate video quality prediction of user generated content"
IEEE Open Journal of Signal Processing, 2021. [Paper] [Code] [IEEE SPS Webinar]
Highlighted in OJSP 2022-2023 newsletter, featured talk at IEEE SPS Webinar
- Z. Tu, Y. Wang, N. Birkbeck, B. Adsumilli, A.C. Bovik
"UGC-VQA: Benchmarking blind video quality assessment for user generated content"
IEEE Transactions on Image Processing, 2021. [Paper] [Code]
- Z. Tu, J. Lin, Y. Wang, B. Adsumilli, A.C. Bovik
"Adaptive Debanding Filter"
IEEE Signal Processing Letters, 2020. [Paper] [Code]
Conference Paper
2024
- C. Qi, Z. Tu, K. Ye, M. Delbracio, P. Milanfar, Q. Chen, H. Talebi
"SPIRE: Semantic Prompt-Driven Image Restoration "
European Conference on Computer Vision (ECCV), 2024.
[Project]
[Paper]
- K. Mei, M. Delbracio, H. Talebi, Z. Tu, V.M. Patel, P. Milanfar
"CoDi: Conditional Diffusion Distillation for Higher-Fidelity and Faster Image Generation"
IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR), 2024.
[Project]
[Paper] [Code]
- J. Li, B. Li, Z. Tu, X. Liu, Q. Guo, F. Juefei-Xu, R. Xu, H. Yu
"Light the Night: A Multi-Condition Diffusion Framework for Unpaired Low-Light Enhancement in Autonomous Driving"
IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR), 2024.
[Project]
[Paper] [Code]
2023
- Z. Tu, P. Milanfar, H. Talebi
"MULLER: Multilayer Laplacian Resizer for Vision"
IEEE/CVF International Conference on Computer Vision (ICCV), 2023.
[Paper] [Code]
- R. Xu, X. Xia, J. Li, H. Li, S. Zhang, Z. Tu, Z. Meng, H. Xiang, X. Dong, R. Song, H. Yu, B. Zhou, J. Ma
"V2V4Real: A real-world large-scale dataset for vehicle-to-vehicle cooperative perception"
IEEE/CVF Computer Vision and Pattern Recognition Conference (CVPR), 2023. Highlight
[Project]
[Paper] [Code]
2022
- Q. Zheng, Z. Tu, Z. Hao, X. Zeng, A.C. Bovik, Y. Fan
"Blind Video Quality Assessment via Space-Time Slice Statistics"
IEEE International Conference on Image Processing (ICIP), 2022.
[Paper] [Code]
- R Xu*, Z Tu*, H Xiang, W Shao, B Zhou, J Ma
"CoBEVT: Cooperative Bird's Eye View Semantic Segmentation with Sparse Transformers"
Conference on Robot Learning (CoRL), 2022.
[Paper] [Code]
- Q. Zheng, Z. Tu, Y. Fan, X. Zeng, A.C. Bovik
"No-Reference Quality Assessment of Variable Frame-Rate Videos Using Temporal Bandpass Statistics"
IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), 2022.
[Paper] [Code]
- R Xu*, Z Tu*, Y Du*, X Dong, J Li, Z Meng, J Ma, A Bovik, H Yu
"Pik-Fix: Restoring and Colorizing Old Photos"
IEEE/CVF Winter Conference on Applications of Computer Vision (WACV), 2022.
[Paper] [Code]
- Z. Tu, H. Talebi, H. Zhang, F. Yang, P. Milanfar, A. Bovik, Y. Li
"MaxViT: Multi-axis Vision Transformer"
European Conference on Computer Vision (ECCV), 2022.
[Paper] [Code]
Highlighted on-top in Jeff Dean's 2022 Annual Google Research Blog; Selected as top-3 papers of the year in Ahead of AI #4: A Big Year for AI; Retweeted by the Yann Lecun: link;
Top-15 Most Influential ECCV Papers (2023-01)
- R. Xu*, H. Xiang*, Z. Tu*, X. Xia, M.H. Yang, J. Ma
"V2X-ViT: Vehicle-to-everything cooperative perception with vision transformer"
European Conference on Computer Vision (ECCV), 2022.
[Paper] [Code]
- Z. Tu, H. Talebi, H. Zhang, F. Yang, P. Milanfar, A. Bovik, Y. Li
"MAXIM: Multi-Axis MLP for Image Processing"
IEEE/CVF Computer Vision and Pattern Recognition Conference (CVPR), 2022.
[Paper] [Code]
Best paper nomination award (0.4% of 8161 submissions)
2021
- Z. Tu, C.J. Chen, Y. Wang, N. Birkbeck, B. Adsumilli, A.C. Bovik
"Video quality assessment of user generated content: A benchmark study and a new model"
IEEE International Conference on Image Processing (ICIP), 2021.
[Paper] [Code]
- Z. Tu, C.J. Chen, Y. Wang, N. Birkbeck, B. Adsumilli, A.C. Bovik
"A Temporal Statistics Model For UGC Video Quality Prediction"
IEEE International Conference on Image Processing (ICIP), 2021.
[Paper]
- Z. Tu, C.J. Chen, Y. Wang, N. Birkbeck, B. Adsumilli, A.C. Bovik
"Efficient user-generated video quality prediction"
Picture Coding Symposium (PCS), 2021.
[Paper] [Code]
- Z. Tu, C.J. Chen, L.H. Chen, Y. Wang, N. Birkbeck, B. Adsumilli, A.C. Bovik
"Regression or classification? new methods to evaluate no-reference picture and video quality models"
IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP), 2021.
[Paper]
2020
- Z. Tu, L.H. Chen, Y. Wang, N. Birkbeck, B. Adsumilli, A.C. Bovik
"A comparative evaluation of temporal pooling methods for blind video quality assessment"
IEEE International Conference on Image Processing (ICIP), 2020.
[Paper]
- Z. Tu, J. Lin, Y. Wang, N. Birkbeck, B. Adsumilli, A.C. Bovik
"BBAND Index: a No-Reference Banding Artifact Predictor"
IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP), 2020.
[Paper] [Code]
Workshop Paper
- C. He, Q. Zheng, R. Zhu, X. Zeng, Y. Fan, Z. Tu,
"COVER: A comprehensive video quality evaluator"
IEEE/CVF Computer Vision and Pattern Recognition (CVPR) Workshops, 2024.
[Paper] [Code]
🏆 1st place solution for AIS 2024 UGC Video Quality Assessment Challenge
3rd place solution for AIM 2024 Challenge on Compressed Video Quality Assessment
- X. Yu, Z. Tu, Z. Ying, A.C. Bovik, N. Birkbeck, Y. Wang, B. Adsumilli
"Subjective quality assessment of user-generated content gaming videos"
IEEE/CVF Winter Conference on Applications of Computer Vision (WACV) Workshops, 2022.
[Paper] [Dataset]