基因数据处理29之avocado运行snap-basic有问题
2016-05-27 23:08
417 查看
hadoop@Master:~/xubo/data/testTools/se$ avocado-submit /xubo/avocado/hs2.fq /xubo/avocado/hs38DH.fa /xubo/avocado/test20160527NUMhs2snap /home/hadoop/xubo/data/testTools/se/snap-basic.properties Using SPARK_SUBMIT=/home/hadoop/cloud/spark-1.5.2//bin/spark-submit Command body threw exception: java.lang.IllegalArgumentException: No input stage with name: /home/hadoop/xubo/data/testTools/se/hs2.fq Exception in thread "main" java.lang.IllegalArgumentException: No input stage with name: /home/hadoop/xubo/data/testTools/se/hs2.fq at org.bdgenomics.avocado.input.Input$.apply(Input.scala:57) at org.bdgenomics.avocado.cli.Avocado$$anonfun$4.apply(Avocado.scala:208) at org.bdgenomics.avocado.cli.Avocado$$anonfun$4.apply(Avocado.scala:208) at org.apache.spark.rdd.Timer.time(Timer.scala:57) at org.bdgenomics.avocado.cli.Avocado.run(Avocado.scala:207) at org.bdgenomics.utils.cli.BDGSparkCommand$class.run(BDGCommand.scala:54) at org.bdgenomics.avocado.cli.Avocado.run(Avocado.scala:82) at org.bdgenomics.utils.cli.BDGCommandCompanion$class.main(BDGCommand.scala:32) at org.bdgenomics.avocado.cli.Avocado$.main(Avocado.scala:52) at org.bdgenomics.avocado.cli.Avocado.main(Avocado.scala) at sun.reflect.NativeMethodAccessorImpl.invoke0(Native Method) at sun.reflect.NativeMethodAccessorImpl.invoke(NativeMethodAccessorImpl.java:57) at sun.reflect.DelegatingMethodAccessorImpl.invoke(DelegatingMethodAccessorImpl.java:43) at java.lang.reflect.Method.invoke(Method.java:606) at org.apache.spark.deploy.SparkSubmit$.org$apache$spark$deploy$SparkSubmit$$runMain(SparkSubmit.scala:674) at org.apache.spark.deploy.SparkSubmit$.doRunMain$1(SparkSubmit.scala:180) at org.apache.spark.deploy.SparkSubmit$.submit(SparkSubmit.scala:205) at org.apache.spark.deploy.SparkSubmit$.main(SparkSubmit.scala:120) at org.apache.spark.deploy.SparkSubmit.main(SparkSubmit.scala)
问题2:
hadoop@Master:~/xubo/data/testTools$ avocado-submit /xubo/avocado/hs2.fq /xubo/avocado/hs38DH.fa /xubo/avocado/test20160527NUMhs2snap /home/hadoop/xubo/data/testTools/se/snap-basic.propertiesUsing SPARK_SUBMIT=/home/hadoop/cloud/spark-1.5.2//bin/spark-submit Command body threw exception: java.lang.IllegalArgumentException: No input stage with name: SnapInputStage Exception in thread "main" java.lang.IllegalArgumentException: No input stage with name: SnapInputStage at org.bdgenomics.avocado.input.Input$.apply(Input.scala:57) at org.bdgenomics.avocado.cli.Avocado$$anonfun$4.apply(Avocado.scala:208) at org.bdgenomics.avocado.cli.Avocado$$anonfun$4.apply(Avocado.scala:208) at org.apache.spark.rdd.Timer.time(Timer.scala:57) at org.bdgenomics.avocado.cli.Avocado.run(Avocado.scala:207) at org.bdgenomics.utils.cli.BDGSparkCommand$class.run(BDGCommand.scala:54) at org.bdgenomics.avocado.cli.Avocado.run(Avocado.scala:82) at org.bdgenomics.utils.cli.BDGCommandCompanion$class.main(BDGCommand.scala:32) at org.bdgenomics.avocado.cli.Avocado$.main(Avocado.scala:52) at org.bdgenomics.avocado.cli.Avocado.main(Avocado.scala) at sun.reflect.NativeMethodAccessorImpl.invoke0(Native Method) at sun.reflect.NativeMethodAccessorImpl.invoke(NativeMethodAccessorImpl.java:57) at sun.reflect.DelegatingMethodAccessorImpl.invoke(DelegatingMethodAccessorImpl.java:43) at java.lang.reflect.Method.invoke(Method.java:606) at org.apache.spark.deploy.SparkSubmit$.org$apache$spark$deploy$SparkSubmit$$runMain(SparkSubmit.scala:674) at org.apache.spark.deploy.SparkSubmit$.doRunMain$1(SparkSubmit.scala:180) at org.apache.spark.deploy.SparkSubmit$.submit(SparkSubmit.scala:205) at org.apache.spark.deploy.SparkSubmit$.main(SparkSubmit.scala:120) at org.apache.spark.deploy.SparkSubmit.main(SparkSubmit.scala)
配置:
问题1的:
hadoop@Master:~/xubo/data/testTools/se$ vi snap-basic.properties { SnapInputStage = { indexDirectory = /Users/fnothaft/IdeaProjects/avocado2/avocado/chrM snapPath = /Users/fnothaft/IdeaProjects/snap/snap numMachines = 1 coresPerMachine = 4 } readExplorer = { } biallelicGenotyper = { } defPart = { } #inputStage = SnapInputStage inputStage = /home/hadoop/xubo/data/testTools/se/hs2.fq preprocessorNames = ( ); preprocessorAlgorithms = ( ); explorerName = readExplorer; explorerAlgorithm = ReadExplorer; genotyperName = biallelicGenotyper; genotyperAlgorithm = BiallelicGenotyper; postprocessorNames = ( ); postprocessorAlgorithms = ( ); } "snap-basic.properties" 32L, 640C 1,1 All
问题2的:
hadoop@Master:~/xubo/data/testTools$ cat snap-basic.properties { SnapInputStage = { indexDirectory = /home/xubo/xubo/data/testTools snapPath = /home/xubo/xubo/tools/snap numMachines = 1 coresPerMachine = 1 } readExplorer = { } biallelicGenotyper = { } defPart = { } inputStage = SnapInputStage preprocessorNames = ( ); preprocessorAlgorithms = ( ); explorerName = readExplorer; explorerAlgorithm = ReadExplorer; genotyperName = biallelicGenotyper; genotyperAlgorithm = BiallelicGenotyper; postprocessorNames = ( ); postprocessorAlgorithms = ( ); }
问题1是由于修改了snapinputstage
问题2定位:
/** * Licensed to Big Data Genomics (BDG) under one * or more contributor license agreements. See the NOTICE file * distributed with this work for additional information * regarding copyright ownership. The BDG licenses this file * to you under the Apache License, Version 2.0 (the * "License"); you may not use this file except in compliance * with the License. You may obtain a copy of the License at * * http://www.apache.org/licenses/LICENSE-2.0 * * Unless required by applicable law or agreed to in writing, software * distributed under the License is distributed on an "AS IS" BASIS, * WITHOUT WARRANTIES OR CONDITIONS OF ANY KIND, either express or implied. * See the License for the specific language governing permissions and * limitations under the License. */ package org.bdgenomics.avocado.input import org.bdgenomics.formats.avro.{ AlignmentRecord, NucleotideContigFragment } import org.apache.commons.configuration.HierarchicalConfiguration import org.apache.spark.SparkContext import org.apache.spark.rdd.RDD object Input { // all our input stages val stages = List(AlignedReadsInputStage) /** * Builds the input stage that corresponds to the given stage name, and returns the read data * that the stage provides. The input stage name to use is collected from the provided * configuration. * * @param sc A SparkContext * @param inputPath Path to input read data. * @param reference * @param config Configuration file containing the necessary data. * @return Returns an RDD of read data. */ def apply(sc: SparkContext, inputPath: String, reference: RDD[NucleotideContigFragment], config: HierarchicalConfiguration): RDD[AlignmentRecord] = { // get input stage to use; if none is specified, default to input being aligned reads val stageName: String = config.getString("inputStage", "AlignedReads") val stage = stages.find(_.stageName == stageName) stage match { case Some(s: InputStage) => { val stageConfig = config.configurationAt(stageName) s.apply(sc, inputPath, stageConfig, reference) } case None => { throw new IllegalArgumentException("No input stage with name: " + stageName) } } } }
原因在分析
相关文章推荐
- 基因数据处理28之avocado运行
- 基因数据处理72之GATK安装成功
- 基因数据处理33之Avocado运行记录(参考基因组)
- 基因数据处理32之Avocado运行记录(人造数据集)
- Adam学习25之读取sam生成的alignmentRecord含recordGroupDictionary
- 基因数据处理31之avocado运行avocado-cli中的avocado问题3-变异识别找不到RecordGroupSample(null)
- 基因数据处理30之avocado运行avocado-cli中的avocado问题1和2
- writing avocado tests(写avocado测试用例)
- 30岁到35岁:为你的生命多积累一些厚度
- Linux环境下Python的安装过程
- POJ_1269_Intersecting_Lines_(计算几何基础)
- 笔记本就是力量
- IOS学习路线
- 1008. 数组元素循环右移问题 (20)
- three.js笔记
- python 练习 27
- PHP魔术常量:__LINE__ __FILE__ __CLASS__ ```
- setTimeout() 方法的运用
- setTimeout() 方法的运用
- setTimeout() 方法的运用