Kai Chen avatar

Kai Chen (陈铠)

Ph.D. Candidate @ HKUST

Email  /  CV  /  Github  /  Google Scholar
About Me

I am currently a Ph.D. candidate in CSE department of Hong Kong University of Science and Technology (HKUST), supervised by Prof. Dit-Yan Yeung. Previously, I was an undergraduate student majoring in Computer Science in Fudan University honored as Outstanding Undergraduate of Shanghai, supervised by Prof. Yanwei Fu. My research interests include Machine Learning and Artificial Intelligence, especially in representation learning, vision perception and generative modeling (AIGC).

News
  • [2023.10] A recent survey shows the remarkble GPT-4V still suffers from corner cases from our CODA dataset!
  • [2023.10] The first round of our geometric-aware controllable generation series of works, GeoDiffusion (2D detection), MagicDrive (3D detection) and Geom-Erasing (implicit concept removal), is on Arxiv!
  • [2023.10] Our MagicDrive is reported by Xinzhiyuan, and Mistake Analysis is reported by Liangziwei!
  • [2023.10] Invited to serve as a reviewer for CVPR 2024, ICLR 2024!
  • [2023.05] Our papers MixedAE (CVPR 2023), MoCE (ICLR 2023) and CODA (ECCV 2022) will be presented in VALSE 2023! See you in Wuxi!
  • [2023.05] One paper accepted by Workshop of Self-supervised Visual Representation Learning, VALSE 2023 (spotlight)!
  • [2023.05] One paper accepted by Workshop of Autonomous Driving, VALSE 2023 (spotlight)!
  • [2023.03] Invited to serve as a reviewer for NeurIPS 2023!
  • [2023.02] One paper accepted by CVPR 2023! See you in Vancouver!
  • [2023.01] One paper accepted by ICLR 2023 (spotlight Top25%)! Happy Lunar New Year!
  • [2023.01] Invited to serve as a reviewer for ICCV 2023, IJCAI 2023!
  • [2022.11] Invited to serve as a reviewer for CVPR 2023!
  • [2022.08] Our CODA dataset will be utilized to hold the 2nd SSLAD ECCV 2022 workshop and competition at CodaLab!
  • [2022.08] Invited to serve as a reviewer for ICLR 2023!
  • [2022.07] One paper accepted by ECCV 2022!
  • [2022.06] Invited to serve as a reviewer for TIP!
  • [2022.05] Invited to serve as a reviewer for NeurIPS 2022, ECCV 2022!
  • [2021.12] One paper accepted by AAAI 2022!
  • [2021.11] Invited to serve as a reviewer for CVPR 2022, ICRA 2022 and AAAI 2022!
  • [2021.10] One paper accepted by NeurIPS 2021!
  • [2021.07] One paper accepted by ICCV 2021!
  • [2021.07] Our SODA10M dataset will be utilized to hold the SSLAD ICCV 2021 workshop on Self-supervised Learning for Next-Generation Industry-level Autonomous Driving. All are welcome!
  • [2021.06] Invited to serve as a reviewer for NeurIPS 2021!
  • [2020.06] Successful undergrad thesis defend!
  • [2020.03] One paper accepted by IEEE Access!
  • [2019.06] One paper accepted by IROS 2019!
Publications

(* denotes equal contribution)

AIGC Harmfulness
mistake_analysis.png

Gaining Wisdom from Setbacks: Aligning Large Language Models via Mistake Analysis

Kai Chen*, Chunwei Wang*, Kuo Yang, Jianhua Han, Lanqing Hong, Fei Mi, Hang Xu, Zhengying Liu, Wenyong Huang, Zhenguo Li, Dit-Yan Yeung, Lifeng Shang, Xin Jiang, Qun Liu.

"A fall into the pit is a gain in your wit" - LLM can learn from mistakes like human beings!

Arxiv preprint, 2023.

[PDF] [Wechat Post]
AIGC Helpfulness - Geometric-aware Perception Data Generation
magicdrive.png

MagicDrive: Street View Generation with Diverse 3D Geometry Control

Ruiyuan Gao*, Kai Chen*, Enze Xie, Lanqing Hong, Zhenguo Li, Dit-Yan Yeung, Qiang Xu.

Fully advanced geometric-aware diffusion model for object detection systems!

Arxiv preprint, 2023.

[PDF][Project page][Wechat Post]
geom_erasing.png

Geom-Erasing: Geometry-Driven Removal of Implicit Concept in Diffusion Models

Zhili Liu*, Kai Chen*, Yifan Zhang, Jianhua Han, Lanqing Hong, Hang Xu, Zhenguo Li, Dit-Yan Yeung, James Kwok.

First utilization of controllable diffusion models for concept removal!

Arxiv preprint, 2023.

[PDF]
geodiffusion.png

GeoDiffusion: Text-Prompted Geometric Control for Object Detection Data Generation

Kai Chen*, Enze Xie*, Zhe Chen, Yibo Wang, Lanqing Hong, Zhenguo Li, Dit-Yan Yeung.

First geometric-aware AIGC work beneficial for object detection systems!

Arxiv preprint, 2023.

[PDF][Project page]
AIGC Helpfulness - Efficient Self-supervised Learning
mixedae.png

Mixed Autoencoder for Self-supervised Visual Representation Learning

Kai Chen, Zhili Liu, Lanqing Hong, Hang Xu, Zhenguo Li, Dit-Yan Yeung.

IEEE/CVF Computer Vision and Pattern Recognition Conference (CVPR), 2023.

Workshop of Self-supervised Visual Representation Learning, Vision and Learning Seminar (VALSE), 2023 (spotlight).

[PDF][Wechat Post][Video]
moce.png

Task-customized Masked Autoencoder via Mixture of Cluster-conditional Experts

Zhili Liu, Kai Chen, Jianhua Han, Lanqing HONG, Hang Xu, Zhenguo Li, James Kwok.

International Conference on Learning Representations (ICLR), 2023 (spotlight Top25%).

[PDF][Wechat Post]
sdr.png

Task-Customized Self-Supervised Pre-training with Scalable Dynamic Routing.

Zhili Liu, Jianhua Han, Kai Chen, Lanqing Hong, Hang Xu, Chunjing Xu, Zhenguo Li.

AAAI Conference on Artificial Intelligence (AAAI), 2022.

[PDF]
MultiSiam.png

MultiSiam: Self-supervised Multi-instance Siamese Representation Learning for Autonomous Driving

Kai Chen, Lanqing Hong, Hang Xu, Zhenguo Li, Dit-Yan Yeung.

IEEE/CVF International Conference on Computer Vision (ICCV), 2021.

[PDF] [Zhihu]
Others
coda.png

CODA: A Real-World Road Corner Case Dataset for Object Detection in Autonomous Driving

Kaican Li*, Kai Chen*, Haoyu Wang*, Lanqing Hong, Chaoqiang Ye, Jianhua Han, Yukuai Chen, Wei Zhang, Chunjing Xu, Dit-Yan Yeung, Xiaodan Liang, Zhenguo Li, Hang Xu.

First open-sourced large-scale real-life corner case dataset!

European Conference on Computer Vision (ECCV), 2022.

Workshop of Automonous Driving, Vision and Learning Seminar (VALSE), 2023 (spotlight).

[PDF] [Website] [ECCV 2022 Workshop] [GPT-4V still suffers from CODA]
SODA10M.png

SODA10M: A Large-Scale 2D Self/Semi-Supervised Object Detection Dataset for Autonomous Driving.

Jianhua Han, Xiwen Liang, Hang Xu, Kai Chen, Lanqing Hong, Jiageng Mao, Chaoqiang Ye, Wei Zhang, Zhenguo Li, Xiaodan Liang, Chunjing Xu.

Datasets and Benchmarks Track, Neural Information Processing Systems (NeurIPS), 2021.

[PDF] [Website] [ICCV 2021 Workshop]
Access2020.png

Automatic Dense Annotation for Monocular 3D Scene Understanding.

Md. Alimoor Reza, Kai Chen, Akshay Naik, David Crandall, and Soon-Heung Jung.

IEEE Access Journal (IEEE Access), 2020.

[PDF]
IROS2019.png

Automatic Annotation for Semantic Segmentation in Indoor Scenes.

Md Alimoor Reza, Akshay Naik, Kai Chen, David Crandall.

IEEE International Conference on Intelligent Robots and Systems (IROS), 2019.

[PDF]
Talks
  • [VALSE 2023@Wuxi] Mixed Autoencoder for Self-supervised Visual Representation Learning. [Recording]
  • [VALSE 2023@Wuxi] CODA: A Real-World Road Corner Case Dataset for Object Detection in Autonomous Driving. [Recording]
Experiences
Sensetime
Shanghai, China
Oct. 2019 - Apr. 2020
Research Intern, supervised by Dr. Wenxiu Sun
Indiana University Bloomington
Indiana, U.S.A.
June 2019 - Sep. 2019
Visiting Scholar at Computer Vision lab, supervised by Prof. David Crandall
University of Manchester
Manchester, U.K.
Sep. 2018 - Jan. 2019
International exchange student, supervised by Dr. Tingting Mu
Services

Program committee: 2nd SSLAD workshop at ECCV 2022.

Reviewer:
  • Conference: CVPR 2024/2023/2022, ICLR 2024/2023, NeurIPS 2023/2022/2021, ICCV 2023, IJCAI 2023, ECCV 2022, ICRA 2022, AAAI 2022.
  • Journal: TIP and IEEE Access.
Awards

HKUST Postgraduate Scholarship

2020

Scholarship for Outstanding Graduates of Fudan University [post]

2020

Joel & Ruth Spira Scholarship

2019

Oversea Visiting Student Stipend of Fudan University

2019

National Scholarship

2018

Scholarship for Outstanding Undergraduates of Fudan University

2017
Interest

I love basketball and I'm also a big fan of Stepfen Curry, MVP point guard of Golden State Warriors, NBA. I'm a team member of my class's basketball team and often play Score / Power forward (SF/PF). In my spare time, I also play the role of a basketball game referee. Hope one day I can have a chance to see a home game of Warriors in Chase Center San Francisco!