TY - JOUR
T1 - All-in-one Weather-degraded Image Restoration via Adaptive Degradation-aware Self-prompting Model
AU - Wen, Yuanbo
AU - Gao, Tao
AU - Li, Ziqi
AU - Zhang, Jing
AU - Zhang, Kaihao
AU - Chen, Ting
N1 - © 2025 The Author(s)
PY - 2025/1
Y1 - 2025/1
N2 - Existing approaches for all-in-one weather-degraded image restoration suffer from inefficiencies in leveraging degradation-aware priors, resulting in sub-optimal performance in adapting to different weather conditions. To this end, we develop an adaptive degradation-aware self-prompting model (ADSM) for all-in-one weather-degraded image restoration. Specifically, our model employs the contrastive language-image pre-training model (CLIP) to facilitate the training of our proposed latent prompt generators (LPGs), which represent three types of latent prompts to characterize the degradation type, degradation property and image caption. Moreover, we integrate the acquired degradation-aware prompts into the time embedding of diffusion model to improve degradation perception. Meanwhile, we employ the latent caption prompt to guide the reverse sampling process using the cross-attention mechanism, thereby guiding the accurate image reconstruction. Furthermore, to accelerate the reverse sampling procedure of diffusion model and address the limitations of frequency perception, we introduce a wavelet-oriented noise estimating network (WNE-Net). Extensive experiments conducted on eight publicly available datasets demonstrate the effectiveness of our proposed approach in both task-specific and all-in-one applications.
AB - Existing approaches for all-in-one weather-degraded image restoration suffer from inefficiencies in leveraging degradation-aware priors, resulting in sub-optimal performance in adapting to different weather conditions. To this end, we develop an adaptive degradation-aware self-prompting model (ADSM) for all-in-one weather-degraded image restoration. Specifically, our model employs the contrastive language-image pre-training model (CLIP) to facilitate the training of our proposed latent prompt generators (LPGs), which represent three types of latent prompts to characterize the degradation type, degradation property and image caption. Moreover, we integrate the acquired degradation-aware prompts into the time embedding of diffusion model to improve degradation perception. Meanwhile, we employ the latent caption prompt to guide the reverse sampling process using the cross-attention mechanism, thereby guiding the accurate image reconstruction. Furthermore, to accelerate the reverse sampling procedure of diffusion model and address the limitations of frequency perception, we introduce a wavelet-oriented noise estimating network (WNE-Net). Extensive experiments conducted on eight publicly available datasets demonstrate the effectiveness of our proposed approach in both task-specific and all-in-one applications.
KW - all-in-one method
KW - computer vision
KW - diffusion model
KW - prompt learning
KW - weather-degraded image restoration
UR - https://www.scopus.com/pages/publications/85216858575
U2 - 10.1109/TMM.2025.3535316
DO - 10.1109/TMM.2025.3535316
M3 - Article
AN - SCOPUS:85216858575
SN - 1520-9210
VL - 27
JO - IEEE Transactions on Multimedia
JF - IEEE Transactions on Multimedia
ER -