Publication:
Detecting target text related to algorithmic efficiency in scholarly big data using recurrent convolutional neural network model

dc.contributor.authorIqra Safderen_US
dc.contributor.authorJunaid Sarfrazen_US
dc.contributor.authorSaeed Ul Hassanen_US
dc.contributor.authorMohsen Alien_US
dc.contributor.authorSuppawong Tuaroben_US
dc.contributor.otherInformation Technology Universityen_US
dc.contributor.otherMahidol Universityen_US
dc.date.accessioned2018-12-21T07:23:38Z
dc.date.accessioned2019-03-14T08:03:28Z
dc.date.available2018-12-21T07:23:38Z
dc.date.available2019-03-14T08:03:28Z
dc.date.issued2017-01-01en_US
dc.description.abstract© 2017, Springer International Publishing AG. We are observing an exponential growth of scientific literature since the last few decades. Tapping on the advancement of web-enabled tools and technologies, millions of articles are stored and indexed in the digital libraries. Among this archived scientific literature, thousands of newly emerging algorithms, mostly illustrated with pseudo-codes, are published every year in the area of Computer Science and other related computational fields. Previously, an array of techniques has been deployed to retrieve information related to these algorithms by indexing their pseudo-codes and metadata from a vast pool of scholarly documents. Unfortunately, existing search engines are only limited to indexing a textual description of each pseudo-code and are unable to provide simple algorithm-specific information such as run-time complexity, performance evaluation (such as precision, recall, or f-measure), and the size of the dataset it can effectively process, etc. In this paper, we propose a set of algorithms that extract information pertaining to the performance of algorithm(s) presented and/or discussed in the research article. Specifically, sentences in the paper that convey information about the efficiency of the corresponding algorithm are identified and extracted, using the Recurrent Convolutional Neural Network (RCNN) model. To evaluate the performance of our algorithm, we have collected a dataset of 258 manually annotated scholarly documents by four experts, originally downloaded from CiteseerX. Our proposed RCNN based model achieves encouraging 77.65% f-measure and 76.35% accuracy.en_US
dc.identifier.citationLecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics). Vol.10647 LNCS, (2017), 30-40en_US
dc.identifier.doi10.1007/978-3-319-70232-2_3en_US
dc.identifier.issn16113349en_US
dc.identifier.issn03029743en_US
dc.identifier.other2-s2.0-85034018669en_US
dc.identifier.urihttps://repository.li.mahidol.ac.th/handle/20.500.14594/42427
dc.rightsMahidol Universityen_US
dc.rights.holderSCOPUSen_US
dc.source.urihttps://www.scopus.com/inward/record.uri?partnerID=HzOxMe3b&scp=85034018669&origin=inwarden_US
dc.subjectComputer Scienceen_US
dc.titleDetecting target text related to algorithmic efficiency in scholarly big data using recurrent convolutional neural network modelen_US
dc.typeConference Paperen_US
dspace.entity.typePublication
mu.datasource.scopushttps://www.scopus.com/inward/record.uri?partnerID=HzOxMe3b&scp=85034018669&origin=inwarden_US

Files

Collections