MongoDB vs Hadoop
2013-07-18 09:42
106 查看
MongoDB has its own MapReduce framework and Hadoop has HBase. HBase is a scalable database similar to MongoDB.
The main flaw in Hadoop is that it has a single point of failure, namely the “NameNode”. If the NameNode goes down, the entire system becomes unavailable.
MongoDB has no such single point of failure. If at any point in time, one of the primaries, config-servers, or nodes goes down, there is a replicated resource which can take over the responsibility of the system automatically
MongoDB supports rich queries like traditional RDBMS systems and is written in a standard JavaScript shell.
Hadoop has two different components for writing MapReduce (MR) code, Pig and Hive. Pig is a scripting language (similar to python, perl) that generates MR code, while Hive is a more SQL-like language. Hive is mainly used to structure the data and provides a
rich set of queries.
Data has to be in JSON or CSV format to be imported into MongoDB. Hadoop, on the other hand can accept data in almost any format.
Hadoop structures data using Hive, but can handle unstructured data easily using Pig. With the help of Apache Sqoop, Pig can even translate between RDBMS and Hadoop.
MongoDB (written in C++) manages memory more cost-efficiently than Hadoop’s HBase (written in Java).
Both systems also take a different approach to space utilization. MongoDB pre-allocates space for storage, improving performance, but wasting space. Hadoop optimizes space usage, but ends up with lower write performance by comparison with MongoDB.
Typically, MongoDB is used with systems less than approximately 5 TB of data. Hadoop, on the other hand, has been used for systems larger than 100 TB, including systems containing petabytes of data.
http://osintegrators.com/MongoAndHadoop
The main flaw in Hadoop is that it has a single point of failure, namely the “NameNode”. If the NameNode goes down, the entire system becomes unavailable.
MongoDB has no such single point of failure. If at any point in time, one of the primaries, config-servers, or nodes goes down, there is a replicated resource which can take over the responsibility of the system automatically
MongoDB supports rich queries like traditional RDBMS systems and is written in a standard JavaScript shell.
Hadoop has two different components for writing MapReduce (MR) code, Pig and Hive. Pig is a scripting language (similar to python, perl) that generates MR code, while Hive is a more SQL-like language. Hive is mainly used to structure the data and provides a
rich set of queries.
Data has to be in JSON or CSV format to be imported into MongoDB. Hadoop, on the other hand can accept data in almost any format.
Hadoop structures data using Hive, but can handle unstructured data easily using Pig. With the help of Apache Sqoop, Pig can even translate between RDBMS and Hadoop.
MongoDB (written in C++) manages memory more cost-efficiently than Hadoop’s HBase (written in Java).
Both systems also take a different approach to space utilization. MongoDB pre-allocates space for storage, improving performance, but wasting space. Hadoop optimizes space usage, but ends up with lower write performance by comparison with MongoDB.
Typically, MongoDB is used with systems less than approximately 5 TB of data. Hadoop, on the other hand, has been used for systems larger than 100 TB, including systems containing petabytes of data.
http://osintegrators.com/MongoAndHadoop
相关文章推荐
- MongoDB VS hadoop
- Hbase-MongoDB-MemCache-Redis-PostgreSQL-Hadoop/Spark如何选择
- 第四章 查询语句:MongoDb VS MySql 4.1
- MongoDB vs Cassandra
- MongoDB与Hadoop结合之使用MapReduce官方实例
- MongoDB vs TokuMX 性能测试
- 大数据架构开发 挖掘分析 Hadoop HBase Hive Storm Spark Java Flume ZooKeeper Kafka Redis MongoDB 机器学习 云计算 视频教程
- 大数据架构开发 挖掘分析 Hadoop HBase Hive Storm Spark Flume ZooKeeper Kafka Redis MongoDB Java 机器学习 云计算 视频教程
- hadoop学习之MongoDB增删改查Java实现
- Cassandra vs MongoDB vs CouchDB vs Redis vs Riak vs HBase vs Membase vs Neo4j comparison
- hadoop+spark+mongodb+mysql+c#
- MongoDB vs Redis vs Tokyo Tyrant
- Choosing Between ElasticSearch, MongoDB & Hadoop
- Cassandra vs MongoDB vs CouchDB vs Redis vs Riak vs HBase comparison :: KKovacs
- 第四章 查询语句:MongoDb VS MySql 4.2
- hadoop vs spark
- 设计实例对比:MySQL vs MongoDB
- NoSQL 比较 - Cassandra vs MongoDB vs Redis vs ElasticSearch vs HBase
- Mongodb VS Hbase