CelebV-Text contains 70,000 video clips with a total duration of around 279 hours. Each video is accompanied by 20 sentences describing 6 designed attributes, ...
Mar 26, 2023 · CelebV-Text comprises 70,000 in-the-wild face video clips with diverse visual content, each paired with 20 texts generated using the proposed ...
CelebV-Text contains 70,000 in-the-wild face video clips covering diverse visual content. Each video clip is paired with 20 texts generated by the proposed semi ...
This paper presents CelebV-Text, a large-scale, di- verse, and high-quality dataset of facial text-video pairs, to facilitate research on facial text-to-video ...
The effectiveness and potential of CelebV- Text are shown through extensive self-evaluation, and a benchmark is constructed with representative methods to ...
CelebV-Text comprises 70,000 in-the-wild face video clips with diverse visual content, each paired with 20 texts generated using the proposed semi-automatic ...
This paper presents Celeb V- Text, a large-scale, di-verse, and high-quality dataset of facial text-video pairs, to facilitate research on facial text-to- video ...
CelebV-Text comprises 70,000 in-the-wild face video clips with diverse visual content, each paired with 20 texts gen- erated using the proposed semi-automatic ...
Jul 11, 2024 · This is a large, high-quality dataset of text and video pairs. CelebV-Text is a dataset of 70,000 diverse facial video clips, each with 20 ...