Generative AI Shanshui animation enhancement using Perlin noise and diffusion models

dc.contributor.authorWattanachote K.
dc.contributor.authorLin C.Y.
dc.contributor.authorHsu S.E.
dc.contributor.authorShih T.K.
dc.contributor.correspondenceWattanachote K.
dc.contributor.otherMahidol University
dc.date.accessioned2026-02-16T18:12:53Z
dc.date.available2026-02-16T18:12:53Z
dc.date.issued2026-12-01
dc.description.abstractDeep learning models have achieved remarkable advancements in image generation but face persistent challenges in synthesizing traditional Shanshui (mountain-water) landscape paintings due to limited domain-specific training data and the complexity of aesthetic principles. This study integrated Perlin Noise, Stable Diffusion, ControlNet, and AnimateDiff to enhance Shanshui landscape generation and animation. Perlin Noise constructs naturalistic skeletal structures, which are further refined using ControlNet for precise structural control. Advanced prompt engineering with GPT-4 and Textual Inversion improved prompt descriptiveness and mitigated low-quality outputs. Furthermore, LoRA fine-tuning improved the adaptability of our Shanshui landscapes model. Integrating I2V Encoders and AnimateDiff enabled the seamless transformation of static landscape images into dynamic animations, preserving artistic authenticity while introducing motion consistency. The experimental results demonstrated significant improvements in realism, stylistic fidelity, and diversity, addressing key limitations in existing generative approaches. This framework not only advances the field of generative AI in digital art but also offers new opportunities for the creation of multimedia content and cultural preservation through the synthesis of computational Shanshui animation.
dc.identifier.citationDiscover Artificial Intelligence Vol.6 No.1 (2026)
dc.identifier.doi10.1007/s44163-025-00797-6
dc.identifier.eissn27310809
dc.identifier.scopus2-s2.0-105029636764
dc.identifier.urihttps://repository.li.mahidol.ac.th/handle/123456789/115094
dc.rights.holderSCOPUS
dc.subjectComputer Science
dc.titleGenerative AI Shanshui animation enhancement using Perlin noise and diffusion models
dc.typeArticle
mu.datasource.scopushttps://www.scopus.com/inward/record.uri?partnerID=HzOxMe3b&scp=105029636764&origin=inward
oaire.citation.issue1
oaire.citation.titleDiscover Artificial Intelligence
oaire.citation.volume6
oairecerif.author.affiliationMahidol University
oairecerif.author.affiliationNational Central University

Files

Collections