IF-VidCap: Can Video Caption Models Follow Instructions?

Shihao Li
Shihao Li
Master Student

Master student at NJU-LINK Lab, passionate about artificial intelligence research.

Yanghai Wang
Yanghai Wang
Master Student

Master student at NJU-LINK Laboratory, focused on artificial intelligence and machine learning research.

Qianqian Xie
Qianqian Xie
硕士研究生

Master student at NJU-LINK Lab, passionate about artificial intelligence research.

Zhaoxiang Zhang
Zhaoxiang Zhang
Professor, PhD Supervisor

Cheung Kong Scholar of the Ministry of Education, Young Top-notch Talent of the National Ten Thousand Talents Program, New Century Excellent Talent of the Ministry of Education.

Jiaheng Liu
Jiaheng Liu
Assistant Professor, PhD Supervisor

Alibaba Star, one of the founding members of Multimodal Art Projection (M-A-P). Expert in large language models and multimodal large models.