缺少输入snakemake扩展错误混合输入

如何解决缺少输入snakemake扩展错误混合输入

我的 snakemake 工作流程打印混合输入。

/scratch/blumenscheitc/workplace_5z/diaa_nn_fasta/Snakefile 的第 21 行中的 MissingInputException：缺少规则 prokka 的输入文件：基因组/real_name_hypo_200707_20-07797_19-03322_S19.fasta.fasta

我不知道为什么难过。代码如下。我写了很多这样的工作流程，但这个不起作用。

感谢您的帮助！

import sys
import os

shell.executable('/bin/bash')

IDS,= glob_wildcards('genome/{sample}.fasta')

rule all:
    input:
        expand('results/proteome_DIA_input_{sample}.fasta',sample=IDS),expand('temp/real_name_hypo_{sample}.fasta',expand('temp/real_name_hypo_{sample}.tsv',expand('temp/names_hypo_{sample}.data',expand('temp/hypo_count_{sample}.fasta',expand('temp/non_hypo_{sample}.fasta',expand('temp/hypo_{sample}.fasta',expand('temp/{sample}.fasta',expand('proteomes/{sample}.gbk',expand('temp/{sample}',rule prokka:
    input:
        'genome/{sample}.fasta',output:
        'temp/{sample}',shell:
        '''
        prokka {input} -prefix proteome --addgenes --compliant --outdir {output} --cpu 40
        '''

rule prokka_rename:
    input:
        'temp/{sample}',output:
        'proteomes/{sample}.gbk',shell:
        '''
        cp '{input}/proteome.gbk' '{output}'
        '''

rule gbk2fasta:
    input:
        'proteomes/{sample}.gbk',output:
        'temp/{sample}.fasta',shell:
        '''
        python scripts/gbk2fa_prot.py {input} {output}
        '''

rule hypo_fasta:
    input:
        'temp/{sample}.fasta',output:
        'temp/hypo_{sample}.fasta',shell:
        '''
        seqkit grep -w 0 -rnip 'hypothetical' {input} > {output}
        '''

rule non_hypo_fasta:
    input:
        'temp/{sample}.fasta',output:
        'temp/non_hypo_{sample}.fasta',shell:
        '''
        cat {input} | grep -v '^>' | grep '^.' | tr -d '[:blank:]' | cat >( echo '>seq_name') - > {output}
        '''

rule hypo_fasta_count:
    input:
        'temp/hypo_{sample}.fasta',output:
        'temp/hypo_count_{sample}.fasta',shell:
        """
        seqkit replace -p .+ -r "hypothetical_protein_{{nr}}" {input} > {output}
        """

rule diamond_hypo:
    input:
        'temp/hypo_count_{sample}.fasta',output:
        'temp/blast_hypo_{sample}.data',shell:
        '''
        diamond blastp -b 20 --threads 40 --max-target-seqs 1 --db "/home/xxxxxxxxxxxxx/wissdaten/ZBS6_Orbitrap/xxxxxxxx/Databases/uni_bac.dmnd" -f 6 sseqid full_qseq --query {input} --out {output}
        '''

rule hypo_header:
    input:
        'temp/names_hypo_{sample}.data',output:
        'temp/real_name_hypo_{sample}.tsv',script:'scripts/hypo_header.R'

rule hypo_tsv2fasta:
    input:
        'temp/real_name_hypo_{sample}.tsv',output:
        'temp/real_name_hypo_{sample}.fasta',shell:
        '''
        seqkit tab2fx {input} > {output}
        '''

rule combine_fasta:
    input:
        I1='temp/real_name_hypo_{sample}.fasta',I2='temp/non_hypo_{sample}.fasta',output:
        'results/proteome_DIA_input_{sample}.fasta',shell:
        '''
        cat {input.I2} {input.I1} > {output}
        '''

缺少输入snakemake扩展错误混合输入

如何解决缺少输入snakemake扩展错误混合输入

相关推荐