Skip to content
This repository was archived by the owner on Dec 11, 2022. It is now read-only.
This repository was archived by the owner on Dec 11, 2022. It is now read-only.

kaks.py script produces inflated SNP counts #31

@jfaberha

Description

@jfaberha

Thank you for the useful script, but I noticed an error in my output for kaks.py. Because the script totals the numbers of missense and sense SNPs per gene, I found the counts to be redundant for genes that have alternatively spliced transcripts annotated in the gff/gtf file used for SnpEff. For example, two missense SNPs in one gene were each counted 12 times (once per isoform that included the exon in question) giving me a kA of 24 and a kS of 0. I was able to modify the script on my own to give me kA/kS for each feature (mRNA accession) as opposed to each gene, but it might also be useful to output kA/kS per gene while accounting for SNP redundancy.

Metadata

Metadata

Assignees

No one assigned

    Labels

    No labels
    No labels

    Type

    No type
    No fields configured for issues without a type.

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions