Please use this identifier to cite or link to this item: http://bura.brunel.ac.uk/handle/2438/29612
Full metadata record
DC FieldValueLanguage
dc.contributor.authorIslam, T-
dc.contributor.authorMiron, A-
dc.contributor.authorLiu, X-
dc.contributor.authorLi, Y-
dc.date.accessioned2024-08-27T11:03:23Z-
dc.date.available2024-08-27T11:03:23Z-
dc.date.issued2024-08-08-
dc.identifierORCiD: Tasin Islam https://orcid.org/0000-0001-7568-9322-
dc.identifierORCiD: Alina Miron https://orcid.org/0000-0002-0068-4495-
dc.identifierORCiD: Xiaohui Liu https://orcid.org/0000-0003-1589-1267-
dc.identifierORCiD: Yongmin Li https://orcid.org/0000-0003-1668-2440-
dc.identifier287-
dc.identifier.citationIslam, T. et al. (2024) 'Dynamic Fashion Video Synthesis from Static Imagery', Future Internet, 16 (8), 287, pp. 1 - 21. doi: 10.3390/fi16080287.en_US
dc.identifier.urihttps://bura.brunel.ac.uk/handle/2438/29612-
dc.descriptionData Availability Statement: This paper did not generate any new data.en_US
dc.description.abstractOnline shopping for clothing has become increasingly popular among many people. However, this trend comes with its own set of challenges. For example, it can be difficult for customers to make informed purchase decisions without trying on the clothes to see how they move and flow. We address this issue by introducing a new image-to-video generator called FashionFlow to generate fashion videos to show how clothing products move and flow on a person. By utilising a latent diffusion model and various other components, we are able to synthesise a high-fidelity video conditioned by a fashion image. The components include the use of pseudo-3D convolution, VAE, CLIP, frame interpolator and attention to generate a smooth video efficiently while preserving vital characteristics from the conditioning image. The contribution of our work is the creation of a model that can synthesise videos from images. We show how we use a pre-trained VAE decoder to process the latent space and generate a video. We demonstrate the effectiveness of our local and global conditioners, which help preserve the maximum amount of detail from the conditioning image. Our model is unique because it produces spontaneous and believable motion using only one image, while other diffusion models are either text-to-video or image-to-video using pre-recorded pose sequences. Overall, our research demonstrates a successful synthesis of fashion videos featuring models posing from various angles, showcasing the movement of the garment. Our findings hold great promise for improving and enhancing the online fashion industry’s shopping experience.en_US
dc.description.sponsorshipEngineering and Physical Sciences Research Council (EPSRC) grant number EP/T518116/1.en_US
dc.format.extent1 - 21-
dc.format.mediumElectronic-
dc.languageEnglish-
dc.language.isoen_USen_US
dc.rightsCopyright © 2024 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).-
dc.rights.urihttps://creativecommons.org/licenses/by/4.0/-
dc.subjectdiffusion modelsen_US
dc.subjectfashion synthesisen_US
dc.subjectgenerative AIen_US
dc.subjectimage-to-video synthesisen_US
dc.titleDynamic Fashion Video Synthesis from Static Imageryen_US
dc.typeArticleen_US
dc.date.dateAccepted2024-08-07-
dc.identifier.doihttps://doi.org/10.3390/fi16080287-
dc.relation.isPartOfFuture Internet-
pubs.issue8-
pubs.publication-statusPublished online-
pubs.volume16-
dc.identifier.eissn1999-5903-
dc.rights.licensehttps://creativecommons.org/licenses/by/4.0/legalcode.en-
dc.rights.holderThe authors-
Appears in Collections:Dept of Computer Science Research Papers

Files in This Item:
File Description SizeFormat 
FullText.pdfCopyright © 2024 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).17.11 MBAdobe PDFView/Open


This item is licensed under a Creative Commons License Creative Commons