CVPR2025

A Unified Image-Dense Annotation Generation Model for Underwater Scenes

Hongkai Lin, Dingkang Liang, Zhenghao Qi, Xiang Bai

摘要

Figure 1 . We present TIDE, a unified underwater image-dense annotation generation model. Its core lies in the shared layout information and the natural complementarity between multimodal features. Our model, derived from the text-to-image model and finetuned with underwater data, enables the generation of highly consistent underwater image-dense annotations from solely text conditions.