mapping生成sam文件时出现[mem_sam_pe] paired reads have different names错误
方法一:
用以下命令修复:
bbrename.sh in1=read1.fq in2=read2.fq out1=renamed1.fq out2=renamed2.fq
bbrename.sh 下载地址网上自行搜索
对于多个fq文件,可以用以下命令:
while read line do nohup bbrename.sh in=${line}_R1.fq in2=${line}_R2.fq out=${line}_R1.sh.fq out2=${line}_R2.sh.fq & done < name.txt #####其中,name.txt指的是包含sample名字文件
方法二:
使用repair.sh进行修复:
while read line do nohup repair.sh in=${line}_R1.fastq in2=${line}_R2.fastq out=${line}_R1.sh.fastq out2=${line}_R2.sh.fastq & done < name.txt
repair.sh脚本:
#!/bin/bash #repair in=<infile> out=<outfile> usage(){ echo " Written by Brian Bushnell Last modified November 9, 2016 Description: Re-pairs reads that became disordered or had some mates eliminated. Usage: repair.sh in=<input file> out=<pair output> outs=<singleton output> Input may be fasta, fastq, or sam, compressed or uncompressed. Parameters: in=<file> The 'in=' flag is needed if the input file is not the first parameter. 'in=stdin' will pipe from standard in. in2=<file> Use this if 2nd read of pairs are in a different file. out=<file> The 'out=' flag is needed if the output file is not the second parameter. 'out=stdout' will pipe to standard out. out2=<file> Use this to write 2nd read of pairs to a different file. outs=<file> (outsingle) Write singleton reads here. overwrite=t (ow) Set to false to force the program to abort rather than overwrite an existing file. showspeed=t (ss) Set to 'f' to suppress display of processing speed. ziplevel=2 (zl) Set to 1 (lowest) through 9 (max) to change compression level; lower compression is faster. fint=f (fixinterleaving) Fixes corrupted interleaved files using read names. Only use on files with broken interleaving - correctly interleaved files from which some reads were removed. repair=t (rp) Fixes arbitrarily corrupted paired reads by using read names. Uses much more memory than 'fint' mode. ain=f (allowidenticalnames) When detecting pair names, allows identical names, instead of requiring /1 and /2 or 1: and 2: monitor=f Kill this process if it crashes. monitor=600,0.01 would kill after 600 seconds under 1% usage. Java Parameters: -Xmx This will be passed to Java to set memory usage, overriding the program's automatic memory detection. -Xmx20g will specify 20 gigs of RAM, and -Xmx200m will specify 200 megs. The max is typically 85% of physical memory. Please contact Brian Bushnell at bbushnell@lbl.gov if you encounter any problems. " } pushd . > /dev/null DIR="${BASH_SOURCE[0]}" while [ -h "$DIR" ]; do cd "$(dirname "$DIR")" DIR="$(readlink "$(basename "$DIR")")" done cd "$(dirname "$DIR")" DIR="$(pwd)/" popd > /dev/null #DIR="$( cd "$( dirname "${BASH_SOURCE[0]}" )" && pwd )/" CP="$DIR""current/" z="-Xmx4g" z2="-Xms4g" EA="-ea" set=0 if [ -z "$1" ] || [[ $1 == -h ]] || [[ $1 == --help ]]; then usage exit fi calcXmx () { source "$DIR""/calcmem.sh" parseXmx "$@" if [[ $set == 1 ]]; then return fi freeRam 4000m 84 z="-Xmx${RAM}m" z2="-Xms${RAM}m" } calcXmx "$@" repair() { if [[ $NERSC_HOST == genepool ]]; then module unload oracle-jdk module load oracle-jdk/1.8_64bit module load pigz fi local CMD="java $EA $z -cp $CP jgi.SplitPairsAndSingles rp $@" echo $CMD >&2 eval $CMD } repair "$@"
脚本来源:https://anaconda.org/bioconda/bbmap
本文来自博客园,作者:橙子牛奶糖(陈文燕),转载请注明原文链接:https://www.cnblogs.com/chenwenyan/p/7055637.html