Publication:
Parallel Streaming Random Sampling

dc.contributor.authorKanat Tangwongsanen_US
dc.contributor.authorSrikanta Tirthapuraen_US
dc.contributor.otherMahidol Universityen_US
dc.contributor.otherIowa State Universityen_US
dc.date.accessioned2020-01-27T08:23:25Z
dc.date.available2020-01-27T08:23:25Z
dc.date.issued2019-01-01en_US
dc.description.abstract© 2019, Springer Nature Switzerland AG. This paper investigates parallel random sampling from a potentially-unending data stream whose elements are revealed in a series of element sequences (minibatches). While sampling from a stream was extensively studied sequentially, not much has been explored in the parallel context, with prior parallel random-sampling algorithms focusing on the static batch model. We present parallel algorithms for minibatch-stream sampling in two settings: (1) sliding window, which draws samples from a prespecified number of most-recently observed elements, and (2) infinite window, which draws samples from all the elements received. Our algorithms are computationally and memory efficient: their work matches the fastest sequential counterpart, their parallel depth is small (polylogarithmic), and their memory usage matches the best known.en_US
dc.identifier.citationLecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics). Vol.11725 LNCS, (2019), 451-465en_US
dc.identifier.doi10.1007/978-3-030-29400-7_32en_US
dc.identifier.issn16113349en_US
dc.identifier.issn03029743en_US
dc.identifier.other2-s2.0-85077127039en_US
dc.identifier.urihttps://repository.li.mahidol.ac.th/handle/20.500.14594/50677
dc.rightsMahidol Universityen_US
dc.rights.holderSCOPUSen_US
dc.source.urihttps://www.scopus.com/inward/record.uri?partnerID=HzOxMe3b&scp=85077127039&origin=inwarden_US
dc.subjectComputer Scienceen_US
dc.subjectMathematicsen_US
dc.titleParallel Streaming Random Samplingen_US
dc.typeConference Paperen_US
dspace.entity.typePublication
mu.datasource.scopushttps://www.scopus.com/inward/record.uri?partnerID=HzOxMe3b&scp=85077127039&origin=inwarden_US

Files

Collections