您的位置:首页 > 数据库

重新编译spark源码,使CDH支持spark sql

2016-12-28 15:12 337 查看
1、编辑$MAVEN_HOME/bin/mvn文件,增加配置:

MAVEN_OPTS="-Xmx2g -XX:MaxPermSize=512M -XX:ReservedCodeCacheSize=512m"


2、执行mvn命令:

mvn -Pyarn -PHadoop-2.6 -Dhadoop.version=2.6.0-cdh5.8.3 -Dscala-2.10.5 -Phive -Phive-thriftserver -DskipTests install


编译成功截图:



3、复制jar包:

cp spark-1.6.0/assembly/target/scala-2.10/spark-assembly-1.6.0-hadoop2.6.0-cdh5.8.3.jar /opt/cloudera/parcels/CDH-5.8.3-1.cdh5.8.3.p0.2/jars


4、修改jar包软链接(/opt/cloudera/parcels/CDH/lib/spark/lib):

ln -s ../../../jars/spark-assembly-1.6.0-hadoop2.6.0-cdh5.8.3.jar spark-assembly-1.6.0-cdh5.8.3-hadoop2.6.0-cdh5.8.3.jar
ln -s spark-assembly-1.6.0-cdh5.8.3-hadoop2.6.0-cdh5.8.3.jar spark-assembly.jar


5、复制jar包到hdfs:

hdfs dfs -put /opt/cloudera/parcels/CDH/jars/spark-assembly-1.6.0-hadoop2.6.0-cdh5.8.3.jar /user/spark/lib


查看jar包:

[root@cdh1 lib]# hdfs dfs -ls /user/spark/lib
Found 1 items
-rwxr-xr-x   3 hdfs spark  192854141 2016-12-28 13:54 /user/spark/lib/spark-assembly-1.6.0-hadoop2.6.0-cdh5.8.3.jar


6、复制spark-sql文件:

cp spark-1.6.0/bin/spark-sql /opt/cloudera/parcels/CDH/lib/spark/bin


7、配置CM:



内容来自用户分享和网络整理,不保证内容的准确性,如有侵权内容,可联系管理员处理 点击这里给我发消息
标签: