ACL2025

Do Vision-Language Models Have Internal World Models? Towards an Atomic Evaluation

Qiyue Gao, Xinyu Pi, Kevin Liu, Junrong Chen, Ruolan Yang, Xinqi Huang, Xinyu Fang, Lu Sun, Gautham Kishore, Bo Ai, Stone Tao, Mengyang Liu, Jiaxi Yang, Chao-Jung Lai, Chuanyang Jin, Jiannan Xiang, Benhao Huang, Zeming Chen, David Danks, Hao Su, Tianmin Shu, Ziqiao Ma, Lianhui Qin, Zhiting Hu

Abstract

OpenAI. 2025b. Openai o3 and o4-mini system card. https://cdn.openai.com/pdf/2221c875-02d c-4789-800b-e7758f3722c1/o3-and-o4-min i-system-card.pdf . Accessed May 31, 2025.