Similarity measurement for sentiment classification on textual reviews

Tan Thongtan; Tanasanee Phienthrakul

Publication:
Similarity measurement for sentiment classification on textual reviews

dc.contributor.author	Tan Thongtan	en_US
dc.contributor.author	Tanasanee Phienthrakul	en_US
dc.contributor.other	Mahidol University	en_US
dc.date.accessioned	2019-08-23T10:57:36Z
dc.date.available	2019-08-23T10:57:36Z
dc.date.issued	2018-03-24	en_US
dc.description.abstract	© 2018 Association for Computing Machinery. Sentiment classification on textual reviews refers to classifying textual reviews based on whether they are positive or negative. This research focuses on classifying movie reviews, and is benchmarked on the IMDB dataset, which consists of long movie reviews, using accuracy as the evaluation metric. In sentiment classification, each document must be mapped to a fixed length vector. Document embedding models map each document to a dense, low-dimensional vector in continuous vector space. This research proposes to train document embedding using cosine similarity instead of dot product. Experiments on the IMDB dataset show that accuracy is improved when using cosine similarity compared to using dot product, while using feature combination with Naïve-Bayes weighted bag of n-grams achieves a new state of the art accuracy of 97.4%.	en_US
dc.identifier.citation	ACM International Conference Proceeding Series. (2018), 24-28	en_US
dc.identifier.doi	10.1145/3206185.3206204	en_US
dc.identifier.other	2-s2.0-85057605424	en_US
dc.identifier.uri	https://repository.li.mahidol.ac.th/handle/123456789/45644
dc.rights	Mahidol University	en_US
dc.rights.holder	SCOPUS	en_US
dc.source.uri	https://www.scopus.com/inward/record.uri?partnerID=HzOxMe3b&scp=85057605424&origin=inward	en_US
dc.subject	Computer Science	en_US
dc.title	Similarity measurement for sentiment classification on textual reviews	en_US
dc.type	Conference Paper	en_US
dspace.entity.type	Publication
mu.datasource.scopus	https://www.scopus.com/inward/record.uri?partnerID=HzOxMe3b&scp=85057605424&origin=inward	en_US

Collections

Scopus 2018

	Office Hour: Monday-Friday 08.30-12.00 and 13.00-16.30 hrs.
	Phutthamonthon Sai 4 Rd. Salaya, Nakhon Pathom 73170, Thailand
	The office: +66 (2) 800 2680 ext.4306
	thipsuda.van@mahidol.ac.th
	https://repository.li.mahidol.ac.th

Publication: Similarity measurement for sentiment classification on textual reviews

Files

Collections

Publication:
Similarity measurement for sentiment classification on textual reviews