Bio

I am a second-year Master stduent at the Soolab in ShanghaiTech University advised by Prof. Sibei Yang. Previously, I obtained my Bachelor's degree from the ShanghaiTech University in 2022. My research interests lie at the computer vision, natural language processing, and the intersection of them. My current research focuses on open-vocabulary detection and prompt tuning for Vision-Language models.

Publications

* denotes equal contribution and † corresponding author

Zip-Your-CLIP: CLIP Itself is a Good Object-detector

ICLR 2024

Cheng Shi and Sibei Yang†

[Paper] [Code]

Free-Bloom: Zero-Shot Text-to-Video Generator with LLM Director and LDM Animator

NeurIPS 2023

Hanzhuo Huang*, Yufan Feng*, Cheng Shi, Lan Xu, Jingyi Yu, and Sibei Yang†

[Paper] [Code]

LoGoPrompt: Synthetic Text Images Can Be Good Visual Prompts for Vision-Language Models

ICCV 2023

Cheng Shi, and Sibei Yang†

[Project page] [Paper]

EdaDet: Open-Vocabulary Object Detection Using Early Dense Alignment

ICCV 2023

Cheng Shi, and Sibei Yang†

[Project page] [Paper]

Contrastive Grouping with Transformer for Referring Image Segmentation

CVPR 2023

Jiajin Tang, Ge Zheng, Cheng Shi, and Sibei Yang†

[Paper] [Code]

DreamFace: Progressive Generation of Animatable 3D Faces under Text Guidance

SIGGRAPH 2023

Longwen Zhang*, Qiwei Qiu*, Hongyang Lin*, Qixuan Zhang, Cheng Shi, Wei Yang, Ye Shi, Sibei Yang†, Lan Xu†, Jingyi Yu†

[Project page] [Paper] [Video]

Spatial and Visual Perspective-Taking via View Rotation and Relation Reasoning for Embodied Reference Understanding

ECCV 2022

Cheng Shi, and Sibei Yang†

[Paper] [Code]