Detecting target text related to algorithmic efficiency in scholarly big data using recurrent convolutional neural network model

Iqra Safder; Junaid Sarfraz; Saeed Ul Hassan; Mohsen Ali; Suppawong Tuarob

Publication:
Detecting target text related to algorithmic efficiency in scholarly big data using recurrent convolutional neural network model

dc.contributor.author	Iqra Safder	en_US
dc.contributor.author	Junaid Sarfraz	en_US
dc.contributor.author	Saeed Ul Hassan	en_US
dc.contributor.author	Mohsen Ali	en_US
dc.contributor.author	Suppawong Tuarob	en_US
dc.contributor.other	Information Technology University	en_US
dc.contributor.other	Mahidol University	en_US
dc.date.accessioned	2018-12-21T07:23:38Z
dc.date.accessioned	2019-03-14T08:03:28Z
dc.date.available	2018-12-21T07:23:38Z
dc.date.available	2019-03-14T08:03:28Z
dc.date.issued	2017-01-01	en_US
dc.description.abstract	© 2017, Springer International Publishing AG. We are observing an exponential growth of scientific literature since the last few decades. Tapping on the advancement of web-enabled tools and technologies, millions of articles are stored and indexed in the digital libraries. Among this archived scientific literature, thousands of newly emerging algorithms, mostly illustrated with pseudo-codes, are published every year in the area of Computer Science and other related computational fields. Previously, an array of techniques has been deployed to retrieve information related to these algorithms by indexing their pseudo-codes and metadata from a vast pool of scholarly documents. Unfortunately, existing search engines are only limited to indexing a textual description of each pseudo-code and are unable to provide simple algorithm-specific information such as run-time complexity, performance evaluation (such as precision, recall, or f-measure), and the size of the dataset it can effectively process, etc. In this paper, we propose a set of algorithms that extract information pertaining to the performance of algorithm(s) presented and/or discussed in the research article. Specifically, sentences in the paper that convey information about the efficiency of the corresponding algorithm are identified and extracted, using the Recurrent Convolutional Neural Network (RCNN) model. To evaluate the performance of our algorithm, we have collected a dataset of 258 manually annotated scholarly documents by four experts, originally downloaded from CiteseerX. Our proposed RCNN based model achieves encouraging 77.65% f-measure and 76.35% accuracy.	en_US
dc.identifier.citation	Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics). Vol.10647 LNCS, (2017), 30-40	en_US
dc.identifier.doi	10.1007/978-3-319-70232-2_3	en_US
dc.identifier.issn	16113349	en_US
dc.identifier.issn	03029743	en_US
dc.identifier.other	2-s2.0-85034018669	en_US
dc.identifier.uri	https://repository.li.mahidol.ac.th/handle/123456789/42427
dc.rights	Mahidol University	en_US
dc.rights.holder	SCOPUS	en_US
dc.source.uri	https://www.scopus.com/inward/record.uri?partnerID=HzOxMe3b&scp=85034018669&origin=inward	en_US
dc.subject	Computer Science	en_US
dc.title	Detecting target text related to algorithmic efficiency in scholarly big data using recurrent convolutional neural network model	en_US
dc.type	Conference Paper	en_US
dspace.entity.type	Publication
mu.datasource.scopus	https://www.scopus.com/inward/record.uri?partnerID=HzOxMe3b&scp=85034018669&origin=inward	en_US

Collections

Scopus 2016-2017

	Office Hour: Monday-Friday 08.30-12.00 and 13.00-16.30 hrs.
	Phutthamonthon Sai 4 Rd. Salaya, Nakhon Pathom 73170, Thailand
	The office: +66 (2) 800 2680 ext.4306
	thipsuda.van@mahidol.ac.th
	https://repository.li.mahidol.ac.th

Publication: Detecting target text related to algorithmic efficiency in scholarly big data using recurrent convolutional neural network model

Files

Collections

Publication:
Detecting target text related to algorithmic efficiency in scholarly big data using recurrent convolutional neural network model