Skip to content

bcftools fails to genotype MNP/indels from targeted-sequencing reads #1706

@haochenz96

Description

@haochenz96

Hi,

Given a list of known alleles, I was trying to genotype REF/ALT read counts in some samples' BAM files. Relevant to the issue below is that these are libraries generated by targeted amplicon-sequencing.
I used the following command:

bcftools mpileup \
            -Ou \
            -R {params.PANEL_AMPLICON} \
            -f {params.REF_GENOME} \
            --annotate FORMAT/AD,FORMAT/DP,INFO/AD \
            --max-depth 100000 \
            --max-idepth 100000 \
            --no-BAQ \
            --tandem-qual 10000 \
            {input.SC_BAM} | \
        bcftools call \
            --keep-alts \
            -C alleles \
            -T {input.CANDIDATE_ALLELE} \
            --multiallelic-caller \
            -Oz \
            -o {output.SC_MPILEUP_VCF} && \
        tabix {output.SC_MPILEUP_VCF}
        """

But for ALL of the MNP/Indels in the given allele list, the output VCF contains 0 ALT read in FORMAT/AD despite they show up in the INFO/AD field, as shown for the chr12:25398284 locus below:

12      25398284        .       CC      AT      146     .       DP=59;AD=36,19;VDB=1.29641e-11;SGB=-0.69168;RPB=1;MQB=1;BQB=0.825401;MQ0F=0;AC=0;AN=2;DP4=0,36,0,19;MQ=60       GT:PL:DP:AD     0/0:136,244,255:55:36,0
12      46230514        .       T       C       282     .       DP=599;AD=595,0;MQ0F=0;AC=0;AN=2;DP4=595,0,0,0;MQ=60    GT:PL:DP:AD     0/0:0,255,255:595:595,0
12      46243885        .       C       G       189     .       DP=418;AD=174,235;VDB=0;SGB=-0.693147;RPB=0.720526;MQB=1;BQB=0.999993;MQ0F=0;ICB=1;HOB=0.5;AC=1;AN=2;DP4=174,0,235,0;MQ=60      GT:PL:DP:AD     0/1:222,0,197:409:174,235

Note that this applies to every indel/MNP so is very likely caused by some call setting not compatible with the sequencing technology. I feel like this could be a easy fix by disabling some base/mapping quality filters, since my goal is to simply get the REF/ALT counts for each known allele. I would appreciate any help!

Metadata

Metadata

Assignees

No one assigned

    Labels

    No labels
    No labels

    Type

    No type

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions