HDFS integration with Openstack Swift
2015-10-18 11:07
417 查看
The overall model in HDFS data flow with Openstack Swift integration.
The works of HDFS and Openstack Swift have been done, refer to HADOOP-8545. It introduces an new concept in big data storage. And benefit from this marriage,Hadoop cluster would be more agile and easier to be managed.
However, there are still some debates about this new model. One of the most controversial issues are the data
locality and performance, and there is an report to address this issue, refer to
AMPLab
analysis. The second one is the security and how to manage user's privacy. To date, the new model doesn't have
any mechanism of permission control, refer to
this warning.
The user specifies the connection URI, such as
hdfs://namenode:8020or
swift://store.example.org
The HDFS client would initialize the corresponding implements for the class FileSystem.
The class FileSystem defines the interfaces of file operations.
Next we can drill down and take "file read" as an example to explain the data flow.
Anatomy of a File Read in case of the Swift backend
Before any operation of HDFS, the application should initialize the class FileSystem by following, for example in java
String uri = "swift://storage.example.org"; FileSystem fs = FileSystem(uri, new Configuration());
Because the scheme is specified as "swift" by user, the instance of FileSystem would be SwiftNativeFileSystem.
相关文章推荐
- 详解HDFS Short Circuit Local Reads
- Hadoop_2.1.0 MapReduce序列图
- 使用Hadoop搭建现代电信企业架构
- 单机版搭建Hadoop环境图文教程详解
- hadoop常见错误以及处理方法详解
- hadoop 单机安装配置教程
- hadoop的hdfs文件操作实现上传文件到hdfs
- hadoop实现grep示例分享
- Apache Hadoop版本详解
- linux下搭建hadoop环境步骤分享
- hadoop client与datanode的通信协议分析
- hadoop中一些常用的命令介绍
- Hadoop单机版和全分布式(集群)安装
- 用PHP和Shell写Hadoop的MapReduce程序
- hadoop map-reduce中的文件并发操作
- Hadoop1.2中配置伪分布式的实例
- java结合HADOOP集群文件上传下载
- 用python + hadoop streaming 分布式编程(一) -- 原理介绍,样例程序与本地调试
- Hadoop安装感悟
- hadoop安装lzo