Home
News
People
Publications
Recruitment
Light
Dark
Automatic
English
中文 (简体)
An Ping
Latest
IF-VidCap: Can Video Caption Models Follow Instructions?
MT-Video-Bench: A Holistic Video Understanding Benchmark for Evaluating Multimodal LLMs in Multi-Turn Dialogues
Cite
×