Publication:
Identifying Important Citations Using Contextual Information from Full Text

dc.contributor.authorSaeed Ul Hassanen_US
dc.contributor.authorAnam Akramen_US
dc.contributor.authorPeter Haddawyen_US
dc.contributor.otherInformation Technology Universityen_US
dc.contributor.otherMahidol Universityen_US
dc.date.accessioned2018-12-21T07:35:44Z
dc.date.accessioned2019-03-14T08:03:37Z
dc.date.available2018-12-21T07:35:44Z
dc.date.available2019-03-14T08:03:37Z
dc.date.issued2017-07-25en_US
dc.description.abstract© 2017 IEEE. In this paper we address the problem of classifying cited work into important and non-important to the developments presented in a research publication. This task is vital for the algorithmic techniques that detect and follow emerging research topics and to qualitatively measure the impact of publications in increasingly growing scholarly big data. We consider cited work as important to a publication if that work is used or extended in some way. If a reference is cited as background work or for the purpose of comparing results, the cited work is considered to be non-important. By employing five classification techniques (Support Vector Machine, Naïve Bayes, Decision Tree, K-Nearest Neighbors and Random Forest) on an annotated dataset of 465 citations, we explore the effectiveness of eight previously published features and six novel features (including context based, cue words based and textual based). Within this set, our new features are among the best performing. Using the Random Forest classifier we achieve an overall classification accuracy of 0.91 AUC.en_US
dc.identifier.citationProceedings of the ACM/IEEE Joint Conference on Digital Libraries. (2017)en_US
dc.identifier.doi10.1109/JCDL.2017.7991558en_US
dc.identifier.issn15525996en_US
dc.identifier.other2-s2.0-85027975200en_US
dc.identifier.urihttps://repository.li.mahidol.ac.th/handle/20.500.14594/42595
dc.rightsMahidol Universityen_US
dc.rights.holderSCOPUSen_US
dc.source.urihttps://www.scopus.com/inward/record.uri?partnerID=HzOxMe3b&scp=85027975200&origin=inwarden_US
dc.subjectEngineeringen_US
dc.titleIdentifying Important Citations Using Contextual Information from Full Texten_US
dc.typeConference Paperen_US
dspace.entity.typePublication
mu.datasource.scopushttps://www.scopus.com/inward/record.uri?partnerID=HzOxMe3b&scp=85027975200&origin=inwarden_US

Files

Collections