Crowdsource, Crawl, or Generate? Creating SEA-VL, a Multicultural Vision-Language Dataset for Southeast Asia

dc.contributor.authorCahyawijaya S.
dc.contributor.authorLovenia H.
dc.contributor.authorMoniz J.R.A.
dc.contributor.authorWong T.H.
dc.contributor.authorFarhansyah M.R.
dc.contributor.authorMaung T.T.
dc.contributor.authorHudi F.
dc.contributor.authorAnugraha D.
dc.contributor.authorHabibi M.R.S.
dc.contributor.authorQorib M.R.
dc.contributor.authorAgarwal A.
dc.contributor.authorImperial J.M.
dc.contributor.authorPatel H.L.
dc.contributor.authorFeliren V.
dc.contributor.authorNasution B.I.
dc.contributor.authorRufino M.A.
dc.contributor.authorWinata G.I.
dc.contributor.authorRajagede R.A.
dc.contributor.authorCatalan C.R.
dc.contributor.authorImam M.F.
dc.contributor.authorPattnayak P.
dc.contributor.authorPranida S.Z.
dc.contributor.authorPratama K.
dc.contributor.authorBangera Y.
dc.contributor.authorNa-Thalang A.
dc.contributor.authorMonderin P.N.
dc.contributor.authorSong Y.
dc.contributor.authorSimon C.
dc.contributor.authorNg L.H.X.
dc.contributor.authorSapan R.L.
dc.contributor.authorRafi T.H.
dc.contributor.authorWang B.
dc.contributor.authorSupryadi
dc.contributor.authorVeerakanjana K.
dc.contributor.authorIttichaiwong P.
dc.contributor.authorRoque M.T.
dc.contributor.authorVincentio K.
dc.contributor.authorKreangphet T.
dc.contributor.authorArtkaew P.
dc.contributor.authorPalgunadi K.H.
dc.contributor.authorYu Y.
dc.contributor.authorHastuti R.P.
dc.contributor.authorNixon W.
dc.contributor.authorBangera M.
dc.contributor.authorLim A.X.W.
dc.contributor.authorKhine A.H.
dc.contributor.authorZhafran H.M.
dc.contributor.authorFerdinan T.
dc.contributor.authorIzzani A.A.
dc.contributor.authorSingh A.
dc.contributor.authorEvan
dc.contributor.authorKrito J.A.
dc.contributor.authorAnugraha M.
dc.contributor.authorIlasariya F.A.
dc.contributor.authorLi H.
dc.contributor.authorDaniswara J.A.
dc.contributor.authorTjiaranata F.A.
dc.contributor.authorYulianrifat E.P.
dc.contributor.authorUdomcharoenchaikit C.
dc.contributor.authorAnsori F.R.
dc.contributor.authorIhsani M.K.
dc.contributor.authorNguyen G.
dc.contributor.authorBarik A.M.
dc.contributor.authorVelasco D.J.
dc.contributor.authorGenadi R.A.
dc.contributor.authorSaha S.
dc.contributor.authorWei C.
dc.contributor.authorFlores I.
dc.contributor.authorChen K.K.H.
dc.contributor.authorSantos A.G.
dc.contributor.authorLim W.S.
dc.contributor.authorPhyo K.S.
dc.contributor.authorSantos T.
dc.contributor.authorDwiastuti M.
dc.contributor.authorLuo J.
dc.contributor.authorCruz J.C.B.
dc.contributor.authorHee M.S.
dc.contributor.authorHanif I.A.
dc.contributor.authorAlif Al Hakim M.
dc.contributor.authorSya'ban M.R.
dc.contributor.authorKerdthaisong K.
dc.contributor.authorMiranda L.J.V.
dc.contributor.authorKoto F.
dc.contributor.authorFatyanosa T.N.
dc.contributor.authorAji A.F.
dc.contributor.authorRosal J.J.
dc.contributor.authorKevin J.
dc.contributor.authorWijaya R.
dc.contributor.authorKampman O.P.
dc.contributor.authorZhang R.
dc.contributor.authorKarlsson B.F.
dc.contributor.authorLimkonchotiwat P.
dc.contributor.correspondenceCahyawijaya S.
dc.contributor.otherMahidol University
dc.date.accessioned2025-11-18T18:18:42Z
dc.date.available2025-11-18T18:18:42Z
dc.date.issued2025-01-01
dc.description.abstractDespite Southeast Asia's (SEA) extraordinary linguistic and cultural diversity, the region remains significantly underrepresented in vision-language (VL) research, resulting in AI models that inadequately capture SEA cultural nuances. To fill this gap, we present SEA-VL, an open-source initiative dedicated to developing culturally relevant high-quality datasets for SEA languages. By involving contributors from SEA countries, SEA-VL ensures better cultural relevance and diversity, fostering greater inclusivity of underrepresented languages and cultural depictions in VL research. Our methodology employed three approaches: community-driven crowdsourcing with SEA contributors, automated image crawling, and synthetic image generation. We evaluated each method's effectiveness in capturing cultural relevance. We found that image crawling achieves approximately ∼85% cultural relevance while being more cost- and time-efficient than crowdsourcing, whereas synthetic image generation failed to accurately reflect SEA cultural nuances and contexts. Collectively, we gathered 1.28 million SEA culturally relevant images, more than 50 times larger than other existing datasets. This work bridges the representation gap in SEA, establishes a foundation for developing culturally aware AI systems for this region, and provides a replicable framework for addressing representation gaps in other underrepresented regions.
dc.identifier.citationProceedings of the Annual Meeting of the Association for Computational Linguistics Vol.1 (2025) , 18685-18717
dc.identifier.issn0736587X
dc.identifier.scopus2-s2.0-105021028710
dc.identifier.urihttps://repository.li.mahidol.ac.th/handle/123456789/113066
dc.rights.holderSCOPUS
dc.subjectComputer Science
dc.subjectSocial Sciences
dc.subjectArts and Humanities
dc.titleCrowdsource, Crawl, or Generate? Creating SEA-VL, a Multicultural Vision-Language Dataset for Southeast Asia
dc.typeConference Paper
mu.datasource.scopushttps://www.scopus.com/inward/record.uri?partnerID=HzOxMe3b&scp=105021028710&origin=inward
oaire.citation.endPage18717
oaire.citation.startPage18685
oaire.citation.titleProceedings of the Annual Meeting of the Association for Computational Linguistics
oaire.citation.volume1
oairecerif.author.affiliationUniversity of Toronto
oairecerif.author.affiliationUniversity of Illinois Urbana-Champaign
oairecerif.author.affiliationThe University of Manchester
oairecerif.author.affiliationNational University of Singapore
oairecerif.author.affiliationMonash University
oairecerif.author.affiliationTianjin University
oairecerif.author.affiliationNew York University
oairecerif.author.affiliationCarnegie Mellon University
oairecerif.author.affiliationBrown University
oairecerif.author.affiliationAuburn University
oairecerif.author.affiliationHanyang University
oairecerif.author.affiliationChulalongkorn University
oairecerif.author.affiliationUniversity of Bath
oairecerif.author.affiliationUniversitas Indonesia
oairecerif.author.affiliationPolytechnique Montréal
oairecerif.author.affiliationUniversitas Gadjah Mada
oairecerif.author.affiliationInstitut Teknologi Bandung
oairecerif.author.affiliationNara Institute of Science and Technology
oairecerif.author.affiliationThammasat University
oairecerif.author.affiliationInstitut Teknologi Sepuluh Nopember
oairecerif.author.affiliationMacau University of Science and Technology
oairecerif.author.affiliationBrawijaya University
oairecerif.author.affiliationSiriraj Hospital
oairecerif.author.affiliationKing Mongkut's University of Technology Thonburi
oairecerif.author.affiliationBina Nusantara University
oairecerif.author.affiliationIndian Statistical Institute, Kolkata
oairecerif.author.affiliationTon-Duc-Thang University
oairecerif.author.affiliationSeoul National University of Science and Technology
oairecerif.author.affiliationA-Star, Institute for Infocomm Research
oairecerif.author.affiliationSingapore University of Technology and Design
oairecerif.author.affiliationSrinakharinwirot University
oairecerif.author.affiliationUniversitas Islam Indonesia
oairecerif.author.affiliationAteneo de Manila University
oairecerif.author.affiliationUniversity of New Haven
oairecerif.author.affiliationMohamed Bin Zayed University of Artificial Intelligence
oairecerif.author.affiliationMontreal Institute for Learning Algorithms
oairecerif.author.affiliationUniversitas Pelita Harapan
oairecerif.author.affiliationOracle Corporation
oairecerif.author.affiliationUniversity of the Philippines
oairecerif.author.affiliationVidyasirimedhi Institute of Science and Technology
oairecerif.author.affiliationSingapore Polytechnic
oairecerif.author.affiliationNational University, Philippines
oairecerif.author.affiliationSony Group Corporation
oairecerif.author.affiliationGraphcore Limited
oairecerif.author.affiliationMOH Office for Healthcare Transformation
oairecerif.author.affiliationAllen Institute for AI
oairecerif.author.affiliationAI Singapore
oairecerif.author.affiliationBeijing Academy of Artificial Intelligence (BAAI)
oairecerif.author.affiliationWroclaw Tech
oairecerif.author.affiliationCohere
oairecerif.author.affiliationSCB 10X
oairecerif.author.affiliationMeta
oairecerif.author.affiliationSamsung R&D Institute Philippines
oairecerif.author.affiliationCapital One
oairecerif.author.affiliationWorks Applications Lab
oairecerif.author.affiliationSEACrowd
oairecerif.author.affiliationDataxet:Sonar
oairecerif.author.affiliationIndoNLP

Files

Collections