Publication: Set Prediction in the Latent Space
Issued Date
2021-01-01
Resource Type
ISSN
10495258
Other identifier(s)
2-s2.0-85131963371
Rights
Mahidol University
Rights Holder(s)
SCOPUS
Bibliographic Citation
Advances in Neural Information Processing Systems. Vol.31, (2021), 25516-25527
Suggested Citation
Konpat Preechakul, Chawan Piansaddhayanon, Burin Naowarat, Tirasan Khandhawit, Sira Sriswasdi, Ekapol Chuangsuwanich Set Prediction in the Latent Space. Advances in Neural Information Processing Systems. Vol.31, (2021), 25516-25527. Retrieved from: https://repository.li.mahidol.ac.th/handle/20.500.14594/76688
Research Projects
Organizational Units
Authors
Journal Issue
Thesis
Title
Set Prediction in the Latent Space
Other Contributor(s)
Abstract
Set prediction tasks require the matching between predicted set and ground truth set in order to propagate the gradient signal. Recent works have performed this matching in the original feature space thus requiring predefined distance functions. We propose a method for learning the distance function by performing the matching in the latent space learned from encoding networks. This method enables the use of teacher forcing which was not possible previously since matching in the feature space must be computed after the entire output sequence is generated. Nonetheless, a naive implementation of latent set prediction might not converge due to permutation instability. To address this problem, we provide sufficient conditions for permutation stability which begets an algorithm to improve the overall model convergence. Experiments on several set prediction tasks, including image captioning and object detection, demonstrate the effectiveness of our method. Code is available at https://github.com/phizaz/latent-set-prediction.