
Ce Zhu, University of Electronic Science & Technology of China
IEEE/Optica/IET/AAIA Fellow, ChangJiang Distinguished
Dean, Glasgow College, University of Electronic Science and Technology of China (UESTC), China
Ce ZHU is a Changjiang (Cheung Kong) Distinguished Professor at the University of Electronic Science and Technology of China (UESTC), China, and has been serving as the Dean of Glasgow College, a joint school between the University of Glasgow, UK and UESTC, China, since 2022. He has also been an Affiliate Professor of James Watt School of Engineering, University of Glasgow, UK, since 2023. His research interest lies in the general areas of visual information processing, multimedia signal processing and systems, specializing in image/video coding and processing, 3D video, (AI-enabled) visual analysis, perception and applications.
He is a Fellow of IEEE (2017) and a Fellow of Optica (2024). He was an APSIPA Distinguished Lecturer (2021-2022), and also an IEEE Distinguished Lecturer of Circuits and Systems Society (2019-2020). He is now serving as the Chair of IEEE ICME Steering Committee (2024-2025), and the Chair of IEEE Chengdu Section (2024-2028). He is a co-recipient of over 10 paper/demo awards at international conferences, including the most recent Best Demo Award in IEEE ICME 2025 and in IEEE MMSP 2022, Best Paper Award in IEEE BMSB 2025, and Best Paper Runner-Up Award in IEEE ICME 2020.
Title: Deep-Learning-Empowered Super-Resolution: Architectures and Efficiency
Abstract: The pursuit of higher performance in deep-learning-empowered super-resolution has led to increasingly complex models, creating a central challenge of balancing reconstruction quality with computational efficiency. The talk begins a systematic review of the evolution in the field, highlighting the key architectural shifts and the resulting trade-offs between model performance and complexity. The talk subsequently presents a novel architecture designed to improve this trade-off, achieving higher reconstruction quality with greater efficiency. Finally, the talk explores the topic of model compression, introducing an effective post-training quantization strategy that minimizes performance loss, thereby improving the practicality of super-resolution models.

Jun Li , China University of Geosciences, Wuhan, China
TBA.

Hongyan Zhang, China University of Geosciences, Wuhan, China
Dean, School of Computer Science, CUG (Wuhan)
Boi: Limin Wang received the B.Sc. degree from Nanjing University, Nanjing, China, in 2011, and the Ph.D. degree from The Chinese University of Hong Kong, Hong Kong, in 2015. From 2015 to 2018, he was a Postdoctoral Researcher with the Computer Vision Laboratory, ETH Zu rich. He is currently a Professor with the Department of Computer Science and Technology, Nanjing University. His research interests include computer vision and deep learning. He was the first runner-up at the ImageNet Large Scale Visual Recognition Challenge 2015 in scene recognition and the winner at the ActivityNet Large Scale Activity Recognition Challenge 2016 in video classification. He served as the Area Chair for CVPR, ICCV, and NeurIPS. He is on the Editorial Board of IJCV and T-PAMI.
Title: InternVideo: A Family of Multimodal Foundation Models for Video Understanding
Abstract: How to build a foundation model for video understanding has become a very challenging task. This talk mainly introduces a new family of multimodal foundation model, coined as InternVideo, and the key technologies behind it, including the unimodal video self-supervised pre-training method of VideoMAE, the multimodal video weakly supervised pre-training method of UMT, and the video-centric chat model of VideoChat. At the same time, the multimodal video dataset InternVid and the multimodal video evaluation benchmark MVBench, CGBench, VRBench will also be introduced. Finally, we will discuss the future trend of multimodal video understanding foundation model.







