Biography

My researches aim at (1) developing Differentiable/ Meta/ Reinforcement Learning algorithms that endow machines and devices to solve complex tasks with larger autonomy, (2) understanding foundations of deep learning algorithms, and (3) enabling applications in Machine Vision and Artificial Intelligence such as text to image/video generation, 3D vision, scene and video understanding, and medical image analysis.

Biography Ping Luo is an Associate Professor in the Department of Computer Science at the University of Hong Kong, an Associate Director of the HKU Musketeers Foundation Institute of Data Science (HKU IDS), and a Deputy Director of the Joint Research Lab of HKU and Shanghai AI Lab. He obtained his Ph.D. in Information Engineering from the Chinese University of Hong Kong in 2014, under the supervision of Prof. Xiaoou Tang (founder of SenseTime) and Prof. Xiaogang Wang. Before joining HKU in 2019, he was a Research Director in SenseTime. He has published 100+ papers in international conferences and journals such as TPAMI, ICML, ICLR, NeurIPS, and CVPR, with over 50,000 citations on Google Scholar. He was awarded the 2015 AAAI Easily Accessible Paper, nominated for the 2022 Computational Visual Media Journal's Best Paper of the Year, won the 2022 ACL Outstanding Paper, the 2023 World Artificial Intelligence Conference (WAIC) Outstanding Papers, and was a candidate for the Best Paper at ICCV’23. He was recognized as one of the innovators under 35 in the Asia-Pacific region by the MIT Technology Review (MIT TR35) in 2020. He has mentored 30 Ph.D. students, many of whom have received significant awards such as the Nvidia Fellowship, Baidu Fellowship, WAIC Yunfan Award, etc.

Recent Publications

Quickly discover relevant content by filtering publications.
(2024). RegionGPT: Towards Region Understanding Vision Language Model. Computer Vision and Pattern Recognition (CVPR) 2024.

PDF

(2024). SkillDiffuser: Interpretable Hierarchical Planning via Skill Abstractions in Diffusion-Based Task Execution. Computer Vision and Pattern Recognition (CVPR) 2024.

PDF

(2024). GenTron: Diffusion Transformers for Image and Video Generation. Computer Vision and Pattern Recognition (CVPR) 2024.

PDF

(2024). MotionCtrl: A Unified and Flexible Motion Controller for Video Generation. SIGGRAPH 2024.

PDF Code

(2024). Part123: Part-aware 3D Reconstruction from a Single-view Image. SIGGRAPH 2024.

(2024). PixArt-alpha: Fast Training of Diffusion Transformer for Photorealistic Text-to-Image Synthesis. International Conference on Learning Representation (ICLR) 2024.

PDF

(2024). Vdt: General-purpose video diffusion transformers via mask modeling. International Conference on Learning Representation (ICLR) 2024.

PDF

(2024). OmniQuant: Omnidirectionally Calibrated Quantization for Large Language Models. International Conference on Learning Representation (ICLR) 2024.

PDF

(2023). Visionllm: Large language model is also an open-ended decoder for vision-centric tasks. Thirty-seventh Annual Conference on Neural Information Processing Systems (NeurIPS) 2023.

PDF

(2023). Embodiedgpt: Vision-language pre-training via embodied chain of thought. Thirty-seventh Annual Conference on Neural Information Processing Systems (NeurIPS) 2023.

PDF

Principal Investigator

Advisory Committee

Avatar

Wenping Wang

Professor, IEEE Fellow

Avatar

Xiaoou Tang

In Forever Memory of Professor Sean Tang

PhD Candidates

Avatar

Anran Liu

PhD, since 2019 (HKPFS), co-supervised with Prof. Wenping Wang

Low-Level Vision, Deep Learning

Avatar

Chaofan Tao

PhD, since 2020. webpage Co-supervised with Prof. Ngai Wong

Model Compression and Acceleration, Hardware-efficient AI

Avatar

Chengyue Wu

PhD (HKPFS), 2023-, webpage

Multimodality

Avatar

Chonghao Si Ma

PhD, 2023-

Autonomous Driving

Avatar

Chongjian GE

PhD, since 2020 (HKPFS). webpage

Object Detection, Visual Question Answering, Deep Learning

Avatar

Fanqing Meng

PhD, 2023-, Shanghai AI Lab Joint PhD Program

Text-to-Image, LLM

Avatar

Haibao Yu

PhD, since 2022. webpage

V2X, Autonomous Driving, Computer Vision, Efficient AI

Avatar

Jiahao Wang

PhD, 2023-, webpage

Fast Neural Architecture Design

Avatar

Jiannan Wu

PhD, since 2020 (HKPFS). webpage

Math Exercise Representation, Visual Question Answering, Deep Learning

Avatar

Jin Wang

PhD, 2023-, webpage

Deepfake Detection, Explainable AI

Avatar

Li Chen

PhD, 2023-, webpage

Autonomous Driving

Avatar

Mengkang Hu

PhD, 2023-, webpage

NLP, Multimodality, Robotics Learning

Avatar

Peize Sun

PhD, since 2020 (HKPFS). webpage

Computer Vision

Avatar

Peng Xu

PhD, since 2021 (HKU-SUSTech Joint PhD Programme). Co-supervised with Prof. Fengwei An

Computer Vision, Edge Computing

Avatar

Qiushan Guo

PhD, since 2020. Co-supervised with Prof. Yizhou Yu

Knowledge Distillation, Object Detection, Deep Learning

Avatar

Runjian Chen

PhD, since 2021 (HKPFS). webpage

Representation Learning, Deep Learning, Autonomous Driving, 3D Computer Vision

Avatar

Sheng Jin

PhD, since 2020 (HKPFS). webpage

Human Pose Estimation, Deep Learning

Avatar

Shilong Zhang

PhD, 2023-, webpage

Computer Vision

Avatar

Shoufa Chen

PhD, since 2021 (HKPFS). webpage

Video Understanding, Deep Learning

Avatar

Teng Wang

PhD, since 2020 (HKU-SUSTech Joint PhD Programme). Co-supervised with Prof. Feng Zheng

Neural Architecture Search, Deep Learning

Avatar

Tianqi Wang

PhD, since 2020 (HKU-PS). webpage

Autonomous Driving, 3D Object Detection

Avatar

Yao Lai

PhD, since 2021 (HKPFS). webpage

AI Security, Electronic Design Automation, High Performance Computing

Avatar

Yao Mu

PhD, since 2021 (HKPFS). webpage

Unsupervised Representation Learning, Reinforcement Learning

Avatar

Yizhuo Li

PhD, since 2022. webpage

Video Understanding, Self-supervised Learning

Avatar

Yuanfeng Ji

PhD, since 2020. webpage

Medical Image Analysis, Deep Learning

Avatar

Yue Yang

PhD, 2022-, Shanghai AI Lab Joint PhD Program

Text-to-Image, LLM

Avatar

Yuheng Lei

PhD (HKPFS), 2023-, webpage

Embodied AI, Reinforcement Learning, Robotics, Autonomous Driving

Avatar

Zeyue Xue

PhD, since 2022.

Large-scale Deep Learning, Computer Vision

Avatar

Zhanglin Peng

PhD, since 2020 (University Fellowship UPF). webpage Co-supervised with Prof. Wenping Wang

Normalization Methods, Image Recognition, Object Detection and Semantic Segmention, Image Demosaicing and Denoising, Deep Learning

Avatar

Zhixuan Liang

PhD, since 2022 (HKPFS). webpage

Active Learning and Incremental Learning, Open World Detection, Autonomous Driving

Alumni

Avatar

Enze Xie

PhD, 2019-2022. webpage

Instance-level Detection and Segmentation, Text Understanding, Deep Learning

Avatar

Jiaming Xie

PhD, 2017-2023, co-supervised with Prof. Wenping Wang

Medical Image, VR/AR

Avatar

Mingyu Ding

PhD, 2019-2023. webpage

3D Vision, Autonoumus Driving, Deep Learning

Avatar

Nenglun Chen

PhD, 2017-2023. webpage Co-supervised with Prof. Wenping Wang

Geometric Deep Learning, Multimodal Learning

Avatar

Qiang Zhai

Visitor, 2021-2022. webpage

Autonoumus Driving, Robotics

Avatar

Wenhai Wang

RA, 2019-2020. webpage

Text Understanding, Instance-level Detection and Segmentation, Deep Learning

Avatar

Wenqi Shao

PhD, since 2018. webpage Co-supervised with Prof. Xiaogang Wang

Normalization Methods, Efficient Neural Nets, Deep Learning

Avatar

Xingang Pan

PhD, 2017-2021. webpage Co-supervised with Prof. Xiaoou Tang

Generative Models, Deep Learning

Avatar

Yangyang Xu

Postdoc Fellow, 2021-2023. webpage

Generative Models, Image Editing, Transfer Learning

Avatar

Yutao Hu

Postdoc Fellow, 2022-2023. webpage

AI for Healthcare, Computer Vision

Avatar

Yuying Ge

PhD, 2019-2023. webpage

Fashion AI, Deep Learning

Avatar

Zhaoyang Zhang

PhD, 2019-2023. webpage Co-supervised with Prof. Xiaogang Wang

Efficient Algorithm Design, Optimization, Computer Vision

Avatar

Zhouxia Wang

PhD, 2020-2023. webpage Co-supervised with Prof. Wenping Wang

Exposure Bracketing Selection, Multi-exposure Fusion and Image Denoising, Image Recognition and Object Detection, Deep Learning

Projects

*

DeepFashion2

DeepFashion second edition with a full-spectrum of fashion image analyses.

Switchable Normalization

Meta-learning to learn normalization method for each hidden layer in ConvNet.

Regularization in BN

Understanding Batch Normalization in deep learning.

Traffic Scene Segmentation

Fast scene segmentation by layer cascade deep networks.

Lane Detection

Spatial CNN for Lane Detection.

Understanding Normalization

Understanding Normalization Methods in Deep Learning.

Face Image Generation

Image Generation via GANs.

CUImage Dataset

A large-scale dataset for learning general visual representation.

Face Relationship

A large-scale face relationship dataset.

Language Guided Image Segmentation

Joint learning image and language.

WIDERFace

A large-scale dense face detection challenge.

DeepFashion

DeepFashion first edition.

Face Model Compression

An extremely fast face recognition system .

Comprehensive Car

A large-scale car re-identification benchmark.

CelebA

Face celebrity dataset for attribute recognition and GANs.

Deep Learning MRF for Image Segmentation

Deep learning for semantic image segmentation.

Pedestrian Detection

Pedestrian Detection via Rich Supervisions.

Pedestrian Parsing

A pedestrian parsing benchmark.

Contact

  • (+852) 2859 2190
  • Room 326, Department of Computer Science, Chow Yei Ching Building, The Univeristy of Hong Kong, Pokfulam Road, Hong Kong,