plda - A parallel C++ implementation of fast Gibbs sampling of Latent Dirichlet Allocation - Google Project Hosting
2013-03-18 16:46
639 查看
plda - A parallel C++ implementation of fast Gibbs sampling of Latent Dirichlet Allocation - Google Project Hosting
plda is a parallel C++ implementation of Latent Dirichlet Allocation (LDA) (1,2). We are expecting to present a highly optimized parallel implemention of the Gibbs sampling algorithm for the training/inference of LDA (3). The carefully designed architecture is expected to support extensions of this algorithm.
We will release an enhanced parallel implementation of LDA, named as PLDA+ (2), which can improve scalability of LDA by significantly reducing the unparallelizable communication bottleneck and achieve good load balancing.
If you wish to publish any work based on plda, please cite our paper as:
Zhiyuan Liu, Yuzhou Zhang, Edward Y. Chang, Maosong Sun, PLDA+: Parallel Latent Dirichlet Allocation with Data Placement and Pipeline Processing. ACM Transactions on Intelligent Systems and Technology, special issue on Large Scale Machine Learning. 2011. Software available at http://code.google.com/p/plda.
If you have any questions, please visit http://groups.google.com/group/plda
We will release an enhanced parallel implementation of LDA, named as PLDA+ (2), which can improve scalability of LDA by significantly reducing the unparallelizable communication bottleneck and achieve good load balancing.
If you wish to publish any work based on plda, please cite our paper as:
Zhiyuan Liu, Yuzhou Zhang, Edward Y. Chang, Maosong Sun, PLDA+: Parallel Latent Dirichlet Allocation with Data Placement and Pipeline Processing. ACM Transactions on Intelligent Systems and Technology, special issue on Large Scale Machine Learning. 2011. Software available at http://code.google.com/p/plda.
If you have any questions, please visit http://groups.google.com/group/plda
相关文章推荐
- Markov Chain - Monte Carlo method and Gibbs Sampling for Latent Dirichlet Allocation
- google-glog - Logging library for C++ - Google Project Hosting
- qdjango - QDjango, a Qt-based C++ web framework - Google Project Hosting
- google-glog - Logging library for C++ - Google Project Hosting
- darts-clone - A clone of the Darts (Double-ARray Trie System) - Google Project Hosting
- Fast implementation/approximation of pow() function in C/C++
- javamelody - monitoring of JavaEE applications - Google Project Hosting
- xrelayer - A lightweight HTTP proxy written in C++ - Google Project Hosting
- py-webkit-html-manipulator - Server side rendering, extraction and manipulation of HTML over HTTP. - Google Project Hosting
- What is a good explanation of Latent Dirichlet Allocation
- gfwinterceptor - A set of tools to get around internet censorship for iOS device - Google Project Hosting
- googletest - Google C++ Testing Framework - Google Project Hosting
- cpp-btree - C++ B-tree - Google Project Hosting
- spserver - SPServer is a high concurrency server framework library written on C++ - Google Project Hosting
- pykoala - A simple, small and fast web crawler - Google Project Hosting
- gperftools - Fast, multi-threaded malloc() and nifty performance analysis tools - Google Project Hosting
- imageclipper - A tool to crop images manually fast - Google Project Hosting
- darts-clone - A clone of the Darts (Double-ARray Trie System) - Google Project Hosting
- aranduka - A simple e-book manager and reader - Google Project Hosting
- aoapc-book - Official resources for the book series - Google Project Hosting