Publication:
Column-based partitioning for data in high dimensional space

dc.contributor.authorEkasit Kijsipongseen_US
dc.contributor.authorSudsanguan Ngamsuriyarojen_US
dc.contributor.otherMahidol Universityen_US
dc.date.accessioned2018-08-24T01:48:01Z
dc.date.available2018-08-24T01:48:01Z
dc.date.issued2007-12-01en_US
dc.description.abstractSeveral scientific applications such as 3D Jacobi Iteration [17] and LQCD [5] demand high computing power, and run on parallel systems. Such applications mostly operate on high dimensional data, and partitioning them into smaller units would help reduce their execution time considerably. Many algorithms such as CBP [2], Dissect [15], and Bisection [3] are proposed to find an optimal partitioning for two dimensional data. Simply extending such algorithms to handle higher dimensional data does not guarantee the maximum efficiency since the number of data dimensions must be taken into account. In addition, the communication cost among data in high dimensions is increased since data have high interaction to each other. This paper proposes a new algorithm called HyperCBP which is a general optimal column-based partitioning in high dimensional space. The algorithm divides high dimensional data into rectangle blocks of different sizes according to the computing power of each computing node, and minimizes the communication time used in transferring data among rectangles. We evaluate our algorithm using the new defined performance metric called Communication Saving Ratio (CSR). When compared with Dissect [15] and Bisection [3], the results show that HyperCBP gives a higher CSR than those two algorithms, and thus results in a better partitioning. © 2007 IEEE.en_US
dc.identifier.citationProceedings of the International Conference on Parallel Processing. (2007)en_US
dc.identifier.doi10.1109/ICPP.2007.27en_US
dc.identifier.issn01903918en_US
dc.identifier.other2-s2.0-47249131266en_US
dc.identifier.urihttps://repository.li.mahidol.ac.th/handle/20.500.14594/24383
dc.rightsMahidol Universityen_US
dc.rights.holderSCOPUSen_US
dc.source.urihttps://www.scopus.com/inward/record.uri?partnerID=HzOxMe3b&scp=47249131266&origin=inwarden_US
dc.subjectComputer Scienceen_US
dc.subjectEngineeringen_US
dc.titleColumn-based partitioning for data in high dimensional spaceen_US
dc.typeConference Paperen_US
dspace.entity.typePublication
mu.datasource.scopushttps://www.scopus.com/inward/record.uri?partnerID=HzOxMe3b&scp=47249131266&origin=inwarden_US

Files

Collections