Publication: Parallel Streaming Random Sampling
dc.contributor.author | Kanat Tangwongsan | en_US |
dc.contributor.author | Srikanta Tirthapura | en_US |
dc.contributor.other | Mahidol University | en_US |
dc.contributor.other | Iowa State University | en_US |
dc.date.accessioned | 2020-01-27T08:23:25Z | |
dc.date.available | 2020-01-27T08:23:25Z | |
dc.date.issued | 2019-01-01 | en_US |
dc.description.abstract | © 2019, Springer Nature Switzerland AG. This paper investigates parallel random sampling from a potentially-unending data stream whose elements are revealed in a series of element sequences (minibatches). While sampling from a stream was extensively studied sequentially, not much has been explored in the parallel context, with prior parallel random-sampling algorithms focusing on the static batch model. We present parallel algorithms for minibatch-stream sampling in two settings: (1) sliding window, which draws samples from a prespecified number of most-recently observed elements, and (2) infinite window, which draws samples from all the elements received. Our algorithms are computationally and memory efficient: their work matches the fastest sequential counterpart, their parallel depth is small (polylogarithmic), and their memory usage matches the best known. | en_US |
dc.identifier.citation | Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics). Vol.11725 LNCS, (2019), 451-465 | en_US |
dc.identifier.doi | 10.1007/978-3-030-29400-7_32 | en_US |
dc.identifier.issn | 16113349 | en_US |
dc.identifier.issn | 03029743 | en_US |
dc.identifier.other | 2-s2.0-85077127039 | en_US |
dc.identifier.uri | https://repository.li.mahidol.ac.th/handle/20.500.14594/50677 | |
dc.rights | Mahidol University | en_US |
dc.rights.holder | SCOPUS | en_US |
dc.source.uri | https://www.scopus.com/inward/record.uri?partnerID=HzOxMe3b&scp=85077127039&origin=inward | en_US |
dc.subject | Computer Science | en_US |
dc.subject | Mathematics | en_US |
dc.title | Parallel Streaming Random Sampling | en_US |
dc.type | Conference Paper | en_US |
dspace.entity.type | Publication | |
mu.datasource.scopus | https://www.scopus.com/inward/record.uri?partnerID=HzOxMe3b&scp=85077127039&origin=inward | en_US |