CVPR2025
The Photographer's Eye: Teaching Multimodal Large Language Models to See, and Critique Like Photographers
Daiqing Qi, Handong Zhao, Jing Shi, Simon Jenni, Yifei Fan, Franck Dernoncourt, Scott Cohen, Sheng Li
摘要
User: Are there any weaknesses in this photograph? Q-Instruct: The image features a beautiful sunset over a lush green hillside, with the sun's rays shining through the trees. The scene captures the vibrant colors of the sunset, creating a serene and picturesque atmosphere. The hillside is covered in lush green trees, providing a sense of tranquility and natural beauty. The combination of the sunset and the verdant landscape creates a captivating and visually stunning scene. AesExpert: This image looks very clear. The colors in this image are rich and vibrant, with a good color combination, making the picture appear more vivid and bright. The lighting in this image is strong, creating a warm and soft atmosphere, with a strong contrast between light and dark. The composition of this image is a high horizon composition, highlighting the texture in the lower half of the picture. Therefore, this image looks quite beautiful. PhotoEye: The composition of this image could be improved by straightening the horizon to create a more balanced and visually appealing frame. Additionally, the presence of lens flare may be distracting and detract from the overall impact of the image. To address this, the photographer could consider using a lens hood or adjusting their shooting angle to minimize the effect of the flare. Furthermore, the tilted horizon could be corrected to create a more stable and visually appealing composition. Finally, the lighting could be more nuanced, with a more subtle transition between highlights and shadows to create a more natural and inviting atmosphere.