EMNLP2025

MovieCORE: COgnitive REasoning in Movies

Gueter Josmy Faure, Min-Hung Chen, Jia-Fong Yeh, Ying Cheng, Hung-Ting Su, Yung-Hao Tang, Shang-Hong Lai, Winston H. Hsu

1 citation

Abstract

Contrasts wisdom and warmth (old age) with energy and curiosity (youth). (Q: How are intergenerational themes demonstrated through specific scenes in the video?) Captures shift in emotional state due to external factors. (Q: How do changes in settings impact the elderly character's emotions and sense of identity?) Emotional/Psychological States Character Contrasts Cause-Effect Relationships Figure 1: Beyond Shallow Video Understanding: The proposed benchmark, MovieCORE, challenges visionlanguage models (VLMs) to understand the subtle interplay between emotions (Top, Middle), character dynamics and causality (Middle, Bottom), and psychological complexity (Top, Middle). From empathy to introspection, from wisdom to curiosity MovieCORE tests VLMs' ability to comprehend the deeper elements of movies.