Skip to content

MissingInputException in rule filter_reads_mhcII_SE #43

@rashidma

Description

@rashidma

I tested scanneo2 examples and it worked fine. Now, when i run on my data using below command:
snakemake --cores all --configfile /media/user/Drive1/NRC21R-082-03/FFPE/scanneo2/config.yaml --software-deployment-method conda

It gives me error as shown below:

Config file config/config.yaml is extended by additional config specified via the command line.
Assuming unrestricted shared filesystem usage for local execution.
Building DAG of jobs...
MissingInputException in rule filter_reads_mhcII_SE in file /media/user/Drive1/ScanNeo2/workflow/rules/hlatyping.smk, line 301:
Missing input files for rule filter_reads_mhcII_SE:
output: results/BC-FFPE-0003/hla/mhc-II/reads/dna_tumor_DNA_flt_SE.bam
wildcards: sample=BC-FFPE-0003, group=dna_tumor, nartype=DNA
affected files:
results/BC-FFPE-0003/hla/reads/dna_tumor_DNA_SE.fq

Please help to troubleshoot.
Thanks

######################## config.yaml #################################

General Settings

reference:
release: 111
nonchr: false
threads: 30
mapq: 30 # overall required mapping quality
basequal: 20 # overall required base quality

data

data:
name: BC-FFPE-0003
dnaseq:
dna_normal: /media/user/Drive1/NRC21R-082-03/FFPE/rawdata/BC-FFPE-0003-N-exome.fastq
dna_tumor: /media/user/Drive1/NRC21R-082-03/FFPE/rawdata/BC-FFPE-0003-T-exome.fastq
rnaseq:
rna_tumor: /media/user/Drive1/NRC21R-082-03/FFPE/rawdata/BC-FFPE-0003-T-rnaseq.fastq
normal: dna_normal

custom:
variants:
hlatyping:
MHC-I:
MHC-II:

pre-processing (only applied on fastq reads)

preproc:
activate: true # whether (=true) or not (=false) to include pre-processing
minlen: 10
qual: 20
slidingwindow:
activate: true
wsize: 3
wqual: 20

alingment

align:
minovlps: 10
chimsegmin: 20
chimoverhang: 10
chimmax: 50
chimmaxdrop: 30

variant calling

alternative splicing

altsplicing:
activate: true # whether (=true) or not (=false) to include alternative splicing events
confidence: 3 # confidence level (1,2 or 3) - filtering of input alignments
iterations: 5 # number of iteratios (when adding intro edges) - increases sensitivity
edgelimit: 250 # limit max number of edges in graph - affects the runtime

exitron splicing

exitronsplicing:
activate: true # whether (=true) or not (=false) to include exitron-splicing events
ao: 3 # allele observation
pso: 0.05 # percent spliced out
strand: 1 # strand specificity of library (0=unstranded, 1=forward, 2=reverse)

gene fusion

genefusion:
activate: true # whether (=true) or not (=false) to include gene fusion events
maxevalue: 0.3
suppreads: 2 # all fusions with less than suppreads are discarded
maxsuppreads: 1000
maxidentity: 0.3 # genes with fraction of identity are discarded (homologs)
hpolymerlen: 6 # removes breakpoints adjacent to homopolymers of length
readthroughdist: 10000 # distance between breakpoints with less than distance
minanchorlen: 20 # removes fusions whose segments are less than minchimlen
splicedevents: 4 # fusions between genes need at least this many spliced breakpoints
maxkmer: 0.6 # remove reads with repetitive 3-mer that make up more than maxkmer
fraglen: 200 # mean fragment length
maxmismatch: 0.01

indel

indel:
activate: true # whether (=true) or not (=false) to include indels
type: all # long, short, all
mode: BOTH # DNA, RNA or BOTH -

strategy for optimizing posterior probability threshold

strategy: OPTIMAL_F_SCORE # OPTIMAL_F_SCORE, FALSE_DISCOVERY_RATE, CONSTANT
fscorebeta: 1.0 # rel. weight of recall to precision (when OPTIMAL_F_SCORE is selected)
fdr: 0.05 # false discovery rate (when FALSE_DISCOVERY_RATE is selected)
sliplen: 8 # min number of reference bases to suspect slippage event
sliprate: 0.1 # frequency of slippage when it is supsected

quantification:
mode: BOTH # RNA, RNA or BOTH

hlatyping:
class: BOTH # I, II or BOTH

specific path for class II hlatyping (only required when class: II, or BOTH)

MHC-I_mode: DNA # DNA, RNA, or BOTH (if empty alleles have to be specified in custom)
MHC-II_mode: DNA # DNA, RNA, or BOTH (if empty alleles have to be specified in custom)

freqdata: ./hlahd.1.7.1/freq_data/
split: ./hlahd.1.7.1/HLA_gene.split.txt
dict: ./hlahd.1.7.1/dictionary/

prioritization:
class: BOTH # I, II or BOTH
lengths:
MHC-I: 8,9,10,11
MHC-II: 13,14,15

Metadata

Metadata

Assignees

No one assigned

    Labels

    No labels
    No labels

    Type

    No type

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions