Bio

I am a second-year Master stduent at the Soolab in ShanghaiTech University advised by Prof. Sibei Yang. Previously, I obtained my Bachelor's degree from the ShanghaiTech University in 2022. My research interests lie at the computer vision, natural language processing, and the intersection of them. My current research focuses on open-vocabulary detection and prompt tuning for Vision-Language models.

Publications

* denotes equal contribution and † corresponding author

Zip-Your-CLIP: CLIP Itself is a Good Object-detector
ICLR 2024
Cheng Shi and Sibei Yang†
Free-Bloom: Zero-Shot Text-to-Video Generator with LLM Director and LDM Animator
NeurIPS 2023
Hanzhuo Huang*, Yufan Feng*, Cheng Shi, Lan Xu, Jingyi Yu, and Sibei Yang†
LoGoPrompt: Synthetic Text Images Can Be Good Visual Prompts for Vision-Language Models
ICCV 2023
Cheng Shi, and Sibei Yang†
EdaDet: Open-Vocabulary Object Detection Using Early Dense Alignment
ICCV 2023
Cheng Shi, and Sibei Yang†
Contrastive Grouping with Transformer for Referring Image Segmentation
CVPR 2023
Jiajin Tang, Ge Zheng, Cheng Shi, and Sibei Yang†
DreamFace: Progressive Generation of Animatable 3D Faces under Text Guidance
SIGGRAPH 2023
Longwen Zhang*, Qiwei Qiu*, Hongyang Lin*, Qixuan Zhang, Cheng Shi, Wei Yang, Ye Shi, Sibei Yang†, Lan Xu†, Jingyi Yu†
Spatial and Visual Perspective-Taking via View Rotation and Relation Reasoning for Embodied Reference Understanding
ECCV 2022
Cheng Shi, and Sibei Yang†