Crowdsource, Crawl, or Generate? Creating SEA-VL, a Multicultural Vision-Language Dataset for Southeast Asia

1

Suggested Citation

Cahyawijaya S., Lovenia H., Moniz J.R.A., Wong T.H., Farhansyah M.R., Maung T.T., Hudi F., Anugraha D., Habibi M.R.S., Qorib M.R., Agarwal A., Imperial J.M., Patel H.L., Feliren V., Nasution B.I., Rufino M.A., Winata G.I., Rajagede R.A., Catalan C.R., Imam M.F., Pattnayak P., Pranida S.Z., Pratama K., Bangera Y., Na-Thalang A., Monderin P.N., Song Y., Simon C., Ng L.H.X., Sapan R.L., Rafi T.H., Wang B., Supryadi, Veerakanjana K., Ittichaiwong P., Roque M.T., Vincentio K., Kreangphet T., Artkaew P., Palgunadi K.H., Yu Y., Hastuti R.P., Nixon W., Bangera M., Lim A.X.W., Khine A.H., Zhafran H.M., Ferdinan T., Izzani A.A., Singh A., Evan, Krito J.A., Anugraha M., Ilasariya F.A., Li H., Daniswara J.A., Tjiaranata F.A., Yulianrifat E.P., Udomcharoenchaikit C., Ansori F.R., Ihsani M.K., Nguyen G., Barik A.M., Velasco D.J., Genadi R.A., Saha S., Wei C., Flores I., Chen K.K.H., Santos A.G., Lim W.S., Phyo K.S., Santos T., Dwiastuti M., Luo J., Cruz J.C.B., Hee M.S., Hanif I.A., Alif Al Hakim M., Sya'ban M.R., Kerdthaisong K., Miranda L.J.V., Koto F., Fatyanosa T.N., Aji A.F., Rosal J.J., Kevin J., Wijaya R., Kampman O.P., Zhang R., Karlsson B.F., Limkonchotiwat P. Crowdsource, Crawl, or Generate? Creating SEA-VL, a Multicultural Vision-Language Dataset for Southeast Asia. Proceedings of the Annual Meeting of the Association for Computational Linguistics Vol.1 (2025) , 18685-18717. 18717. Retrieved from: https://repository.li.mahidol.ac.th/handle/123456789/113066

Availability

Collections