A Discriminant Correlation Neural Network for Feature Representation Learning

dc.contributor.authorGao L.
dc.contributor.authorLiu K.
dc.contributor.authorTermritthikun C.
dc.contributor.authorMuneesawang P.
dc.contributor.authorHnoohom N.
dc.contributor.authorGuan L.
dc.contributor.correspondenceGao L.
dc.contributor.otherMahidol University
dc.date.accessioned2025-12-25T18:42:47Z
dc.date.available2025-12-25T18:42:47Z
dc.date.issued2025-01-01
dc.description.abstractThe progress of multimedia computing, particularly through deep neural networks (DNNs), has gained remarkable attention across a wide array of research domains and practical applications, such as image classification, video computing, natural language processing, among others. However, the inherent black-box characteristics and concern for sustainability of DNNs have emerged as significant challenges, especially when dealing with multi-view data. To tackle these challenges, alternative learning models have been actively explored in recent years, with green learning and interpretable learning standing out as two key directions. In this study, we explore one such model aimed at two-view feature representation learning. This proposed model leverages a perceptron-style neural network grounded in discriminant correlation analysis, which is named DC-PNN. The model incorporates statistical machine learning (SML) techniques to optimize network training. To evaluate its effectiveness and generalizability, we conduct experiments on audio emotion recognition and action recognition on two public data sets (i.e., RML emotional database and NTU RGB+D 120 database). Then, move forward to a real application, Glaucoma image classification. Experimental results demonstrate that the DC-PNN model outperforms state-of-the-art (SOTA) methods. In addition, the parameters and FLOPs analysis are implemented, validating the advantage of the model in terms of efficiency and resource requirements.
dc.identifier.citationProceedings of the International Conference on Multimedia Information Processing and Retrieval Mipr (2025) , 277-282
dc.identifier.doi10.1109/MIPR67560.2025.00051
dc.identifier.eissn27704319
dc.identifier.issn27704327
dc.identifier.scopus2-s2.0-105025009240
dc.identifier.urihttps://repository.li.mahidol.ac.th/handle/123456789/113656
dc.rights.holderSCOPUS
dc.subjectComputer Science
dc.subjectEngineering
dc.titleA Discriminant Correlation Neural Network for Feature Representation Learning
dc.typeConference Paper
mu.datasource.scopushttps://www.scopus.com/inward/record.uri?partnerID=HzOxMe3b&scp=105025009240&origin=inward
oaire.citation.endPage282
oaire.citation.startPage277
oaire.citation.titleProceedings of the International Conference on Multimedia Information Processing and Retrieval Mipr
oairecerif.author.affiliationMahidol University
oairecerif.author.affiliationToronto Metropolitan University
oairecerif.author.affiliationNaresuan University

Files

Collections