您的位置:首页 > 大数据

超大数据之GPU聚类 (10亿量级)

2009-12-09 09:52 183 查看
先转一篇以前的文章.

 

Clustering billions of data points using GPUs

"In this paper, we report our research on using GPUs to accelerate
clustering of very large data sets, which are common in today's real
world applications. While many published works have shown that GPUs can
be used to accelerate various general purpose applications with
respectable performance gains, few attempts have been made to tackle
very large problems. Our goal here is to investigate if GPUs can be
useful accelerators even with very large data sets that cannot fit into
GPU's onboard memory.

 

Using a popular clustering algorithm,
K-Means, as an example, our results have been very positive. On a data
set with a billion data points, our GPU-accelerated implementation
achieved an order of magnitude performance gain over a highly optimized
CPU-only version running on 8 cores, and more than two orders of
magnitude gain over a popular benchmark, MineBench, running on a single
core."

 
http://portal.acm.org/citation.cfm?id=1531668
 

 

rw
内容来自用户分享和网络整理,不保证内容的准确性,如有侵权内容,可联系管理员处理 点击这里给我发消息