Revision Matters: Generative Design Guided by Revision Edits

Li, Tao; Cheng, Chin-Yi; Xie, Amber; Li, Gang; Li, Yang

Computer Science > Human-Computer Interaction

arXiv:2406.18559 (cs)

[Submitted on 27 May 2024]

Title:Revision Matters: Generative Design Guided by Revision Edits

Authors:Tao Li, Chin-Yi Cheng, Amber Xie, Gang Li, Yang Li

View PDF HTML (experimental)

Abstract:Layout design, such as user interface or graphical layout in general, is fundamentally an iterative revision process. Through revising a design repeatedly, the designer converges on an ideal layout. In this paper, we investigate how revision edits from human designer can benefit a multimodal generative model. To do so, we curate an expert dataset that traces how human designers iteratively edit and improve a layout generation with a prompted language goal. Based on such data, we explore various supervised fine-tuning task setups on top of a Gemini multimodal backbone, a large multimodal model. Our results show that human revision plays a critical role in iterative layout refinement. While being noisy, expert revision edits lead our model to a surprisingly strong design FID score ~10 which is close to human performance (~6). In contrast, self-revisions that fully rely on model's own judgement, lead to an echo chamber that prevents iterative improvement, and sometimes leads to generative degradation. Fortunately, we found that providing human guidance plays at early stage plays a critical role in final generation. In such human-in-the-loop scenario, our work paves the way for iterative design revision based on pre-trained large multimodal models.

Subjects:	Human-Computer Interaction (cs.HC); Artificial Intelligence (cs.AI); Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
Cite as:	arXiv:2406.18559 [cs.HC]
	(or arXiv:2406.18559v1 [cs.HC] for this version)
	https://doi.org/10.48550/arXiv.2406.18559

Submission history

From: Yang Li [view email]
[v1] Mon, 27 May 2024 17:54:51 UTC (509 KB)

Computer Science > Human-Computer Interaction

Title:Revision Matters: Generative Design Guided by Revision Edits

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Human-Computer Interaction

Title:Revision Matters: Generative Design Guided by Revision Edits

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators