MOWA: Multiple-in-One Image Warping Model

Liao, Kang; Yue, Zongsheng; Wu, Zhonghua; Loy, Chen Change

Computer Science > Computer Vision and Pattern Recognition

arXiv:2404.10716 (cs)

[Submitted on 16 Apr 2024 (v1), last revised 17 Jun 2024 (this version, v2)]

Title:MOWA: Multiple-in-One Image Warping Model

Authors:Kang Liao, Zongsheng Yue, Zhonghua Wu, Chen Change Loy

View PDF HTML (experimental)

Abstract:While recent image warping approaches achieved remarkable success on existing benchmarks, they still require training separate models for each specific task and cannot generalize well to different camera models or customized manipulations. To address diverse types of warping in practice, we propose a Multiple-in-One image WArping model (named MOWA) in this work. Specifically, we mitigate the difficulty of multi-task learning by disentangling the motion estimation at both the region level and pixel level. To further enable dynamic task-aware image warping, we introduce a lightweight point-based classifier that predicts the task type, serving as prompts to modulate the feature maps for more accurate estimation. To our knowledge, this is the first work that solves multiple practical warping tasks in one single model. Extensive experiments demonstrate that our MOWA, which is trained on six tasks for multiple-in-one image warping, outperforms state-of-the-art task-specific models across most tasks. Moreover, MOWA also exhibits promising potential to generalize into unseen scenes, as evidenced by cross-domain and zero-shot evaluations. The code and more visual results can be found on the project page: this https URL.

Comments:	Project page: this https URL
Subjects:	Computer Vision and Pattern Recognition (cs.CV)
Cite as:	arXiv:2404.10716 [cs.CV]
	(or arXiv:2404.10716v2 [cs.CV] for this version)
	https://doi.org/10.48550/arXiv.2404.10716

Submission history

From: Kang Liao [view email]
[v1] Tue, 16 Apr 2024 16:50:35 UTC (21,793 KB)
[v2] Mon, 17 Jun 2024 14:57:39 UTC (8,571 KB)

Computer Science > Computer Vision and Pattern Recognition

Title:MOWA: Multiple-in-One Image Warping Model

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Computer Vision and Pattern Recognition

Title:MOWA: Multiple-in-One Image Warping Model

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators