Publication:
A graph-based approach to topic clustering of tourist attraction reviews

dc.contributor.authorNuttha Sirilertworakulen_US
dc.contributor.authorBoonsit Yimwadsanaen_US
dc.contributor.otherMahidol Universityen_US
dc.date.accessioned2020-01-27T08:22:56Z
dc.date.available2020-01-27T08:22:56Z
dc.date.issued2019-01-01en_US
dc.description.abstract© Springer Nature Switzerland AG 2019. A large volume of user reviews on tourist attractions can prohibit travel businesses from acquiring overall consumers’ expectations and consumers themselves from seeing the big picture and making thoughtful decisions on trip planning. Summarization of the reviews allows both parties to catch the main themes and underlying tones of the attractions. In this paper, we address the task of topic clustering, by applying a graph-based approach to group the reviews into clusters. To interpret the resulting review clusters, WordNet and Inverse Document Frequency (IDF) are utilized to extract keywords from each cluster which represents the topic. We evaluate the graph-based clustering approach against gold standard data annotated by human and the results are compared against Latent Dirichlet Allocation (LDA), a widely used algorithm for topic discovery. The approach is shown to be competitive to LDA in terms of clustering user reviews on tourist attractions. The graph-based approach, unlike LDA which requires the number of clusters as an input, can dynamically clusters the reviews into groups, revealing the number of clusters.en_US
dc.identifier.citationCommunications in Computer and Information Science. Vol.1078 CCIS, (2019), 343-354en_US
dc.identifier.doi10.1007/978-3-030-30275-7_26en_US
dc.identifier.issn18650937en_US
dc.identifier.issn18650929en_US
dc.identifier.other2-s2.0-85076834505en_US
dc.identifier.urihttps://repository.li.mahidol.ac.th/handle/20.500.14594/50670
dc.rightsMahidol Universityen_US
dc.rights.holderSCOPUSen_US
dc.source.urihttps://www.scopus.com/inward/record.uri?partnerID=HzOxMe3b&scp=85076834505&origin=inwarden_US
dc.subjectComputer Scienceen_US
dc.subjectMathematicsen_US
dc.titleA graph-based approach to topic clustering of tourist attraction reviewsen_US
dc.typeConference Paperen_US
dspace.entity.typePublication
mu.datasource.scopushttps://www.scopus.com/inward/record.uri?partnerID=HzOxMe3b&scp=85076834505&origin=inwarden_US

Files

Collections