TrafficInternVL: Spatially-Guided Fine-Tuning with Caption Refinement for Fine-Grained Traffic Safety Captioning and Visual Question Answering

1

Suggested Citation

Phimsiri S., Sunpawatr S., Cherdchusakulchai R., Kiawjak P., Tosawadi T., Tungjitnob S., Trairattanapa V., Vatathanavaro S., Kudisthalert W., Utintu C., Saetan W., Kongsawat N., Borisuitsawat P., Mahakijdechachai K., Su-Inn N., Thamwiwatthana E., Suttichaya V. TrafficInternVL: Spatially-Guided Fine-Tuning with Caption Refinement for Fine-Grained Traffic Safety Captioning and Visual Question Answering. Proceedings 2025 IEEE Cvf International Conference on Computer Vision Workshops Iccv W 2025 (2025) , 5358-5365. 5365. doi:10.1109/ICCVW69036.2025.00559 Retrieved from: https://repository.li.mahidol.ac.th/handle/123456789/116233

Availability

Collections