Skip to content

[BUG] MaskPrimers.py trims gapped match correctly only in the forward orientation despite correctly passing in both #110

@vladimirvig

Description

@vladimirvig

Describe the bug
When encountering an indel, the match is correctly identified. However, the primer is not properly trimmed in reverse orientation.

Steps to reproduce
Toy dataset:

>sample1
CTATCGCAAGGAACTGCCGATCTTAATGGATGGCCGGAGGTGGTATGGAAGCTATAAGCGCGGGTGAGAGGGTAATTAGGCGTGTTCACCTACGCTACGCTAACGGGCGATTCTATAAGATTGCACATTGCGTCAACTCATAAGATGTCTCAACGGCATGCGCAACTTGTGAAGTGTCTACTATCCTTAAACGCATATCTCGCACAGTAACTCCCGAATATGTCGGCATCTGATGTTGCCCGGGCCGAGTTAGTGTTGAGCTCACGGAACTTATTGTATGAGTAGTGATTTGTAAGAGTAATTATCTACTTTTTACAACAAATATAAAACAACTAGTATCGATGTCAGTTAGCTCGTTCAGGTAATAGTTGCCCACACAACGTCAAAATAAGAGAACGGTCGTAACATTATCCGTGATTTTCTCACTACTATCAGTACTCACGACTCGACTCTGCCGCAGCCACGTATCGCCTGAAAGCCAGTCAGCGTTAAGGAGTGCTCTGAGCAGGACAACTCGCGTAGTGAGAGTTACATGTTCGTTGGGCTCTTCCGACACGGACCTGAGTTGGCCAACGTCCCACCTGAGGTCTGTGCCCCGGTGATGAGAAGTGTGCATCTCGTTCTTGCAGCTCGTCAGTACTTTCAGAATCATGGCGTGCATGGTAGAATGACCCTTATAACGGACTTCGACATGGCAATAACCCCCCGTTTCTACTTCTAGAGGAGAAAAGTATTGACATGAGCGCTCCCGGCACAAGGGCCAAAGAAGTCTCCAATTTCTTATTTCCGAATGACATGCGTCTCCTTGCGGGTAAATCACCGACCGCAATTCATAGAAGCCTGGGGGAACAGATAGGTCTAATTAGCTTAAGAGAGTAAATCCTGGGATCATCGATACTAGTTGTAATTAAAACTTAGATTAGATTGCTATGCTTTCTTTCAGTAGTAACCATAAACTTACGCTGGGGCTTCTTCGGCGGATTTTTACAGTTACCAACCAGGAGATTTGAAGTAAATCAGTTGAGGATTTAGCCGCGCTATCCGGTAATCTCCAAATTAAAACATACCGTTCCATGAAGGCTAGAATTAC
>sample2
CTATCGCAAGGAACTGCCGATCTTAATGGATGGCCGGAGGTGGTATGGAAGCTATAAGCGCGGGTGAGAGGGTAATTAGGCGTGTTCACCTACGCTACGCTAACGGGCGATTCTATAAGATTGCACATTGCGTCAACTCATAAGATGTCTCAACGGCATGCGCAACTTGTGAAGTGTCTACTATCCTTAAACGCATATCTCGCACAGTAACTCCCGAATATGTCGGCATCTGATGTTGCCCGGGCCGAGTTAGTGTTGAGCTCACGGAACTTATTGTATGAGTAGTGATTTGTAAGAAGAAAGCATAGCAATCTAATCTAAGTTTTAATTACAACTAGTATCGATGTCAGTTAGCTCGTTCAGGTAATAGTTGCCCACACAACGTCAAAATAAGAGAACGGTCGTAACATTATCCGTGATTTTCTCACTACTATCAGTACTCACGACTCGACTCTGCCGCAGCCACGTATCGCCTGAAAGCCAGTCAGCGTTAAGGAGTGCTCTGAGCAGGACAACTCGCGTAGTGAGAGTTACATGTTCGTTGGGCTCTTCCGACACGGACCTGAGTTGGCCAACGTCCCACCTGAGGTCTGTGCCCCGGTGATGAGAAGTGTGCATCTCGTTCTTGCAGCTCGTCAGTACTTTCAGAATCATGGCGTGCATGGTAGAATGACCCTTATAACGGACTTCGACATGGCAATAACCCCCCGTTTCTACTTCTAGAGGAGAAAAGTATTGACATGAGCGCTCCCGGCACAAGGGCCAAAGAAGTCTCCAATTTCTTATTTCCGAATGACATGCGTCTCCTTGCGGGTAAATCACCGACCGCAATTCATAGAAGCCTGGGGGAACAGATAGGTCTAATTAGCTTAAGAGAGTAAATCCTGGGATCATCGATACTAGTTGTTTTATATTTGTTGTAAAAAGTAGATAATTACTTCAGTAGTAACCATAAACTTACGCTGGGGCTTCTTCGGCGGATTTTTACAGTTACCAACCAGGAGATTTGAAGTAAATCAGTTGAGGATTTAGCCGCGCTATCCGGTAATCTCCAAATTAAAACATACCGTTCCATGAAGGCTAGAATTAC
>sample3
CTATCGCAAGGAACTGCCGATCTTAATGGATGGCCGGAGGTGGTATGGAAGCTATAAGCGCGGGTGAGAGGGTAATTAGGCGTGTTCACCTACGCTACGCTAACGGGCGATTCTATAAGATTGCACATTGCGTCAACTCATAAGATGTCTCAACGGCATGCGCAACTTGTGAAGTGTCTACTATCCTTAAACGCATATCTCGCACAGTAACTCCCGAATATGTCGGCATCTGATGTTGCCCGGGCCGAGTTAGTGTTGAGCTCACGGAACTTATTGTATGAGTAGTGATTTGTAAGAGTAATTATCTACTTTTTACAACAAATATAAAACAACTATATCGATGTCAGTTAGCTCGTTCAGGTAATAGTTGCCCACACAACGTCAAAATAAGAGAACGGTCGTAACATTATCCGTGATTTTCTCACTACTATCAGTACTCACGACTCGACTCTGCCGCAGCCACGTATCGCCTGAAAGCCAGTCAGCGTTAAGGAGTGCTCTGAGCAGGACAACTCGCGTAGTGAGAGTTACATGTTCGTTGGGCTCTTCCGACACGGACCTGAGTTGGCCAACGTCCCACCTGAGGTCTGTGCCCCGGTGATGAGAAGTGTGCATCTCGTTCTTGCAGCTCGTCAGTACTTTCAGAATCATGGCGTGCATGGTAGAATGACCCTTATAACGGACTTCGACATGGCAATAACCCCCCGTTTCTACTTCTAGAGGAGAAAAGTATTGACATGAGCGCTCCCGGCACAAGGGCCAAAGAAGTCTCCAATTTCTTATTTCCGAATGACATGCGTCTCCTTGCGGGTAAATCACCGACCGCAATTCATAGAAGCCTGGGGGAACAGATAGGTCTAATTAGCTTAAGAGAGTAAATCCTGGGATCATCGATACTAGTTGTAATTAAAACTTAGATTAGATTGCTATGCTTTCTTTCAGTAGTAACCATAAACTTACGCTGGGGCTTCTTCGGCGGATTTTTACAGTTACCAACCAGGAGATTTGAAGTAAATCAGTTGAGGATTTAGCCGCGCTATCCGGTAATCTCCAAATTAAAACATACCGTTCCATGAAGGCTAGAATTAC
>sample4
CTATCGCAAGGAACTGCCGATCTTAATGGATGGCCGGAGGTGGTATGGAAGCTATAAGCGCGGGTGAGAGGGTAATTAGGCGTGTTCACCTACGCTACGCTAACGGGCGATTCTATAAGATTGCACATTGCGTCAACTCATAAGATGTCTCAACGGCATGCGCAACTTGTGAAGTGTCTACTATCCTTAAACGCATATCTCGCACAGTAACTCCCGAATATGTCGGCATCTGATGTTGCCCGGGCCGAGTTAGTGTTGAGCTCACGGAACTTATTGTATGAGTAGTGATTTGTAAGAAGAAAGCATAGCAATCTAATCTAAGTTTTAATTACAACTAGTATCGATGTCAGTTAGCTCGTTCAGGTAATAGTTGCCCACACAACGTCAAAATAAGAGAACGGTCGTAACATTATCCGTGATTTTCTCACTACTATCAGTACTCACGACTCGACTCTGCCGCAGCCACGTATCGCCTGAAAGCCAGTCAGCGTTAAGGAGTGCTCTGAGCAGGACAACTCGCGTAGTGAGAGTTACATGTTCGTTGGGCTCTTCCGACACGGACCTGAGTTGGCCAACGTCCCACCTGAGGTCTGTGCCCCGGTGATGAGAAGTGTGCATCTCGTTCTTGCAGCTCGTCAGTACTTTCAGAATCATGGCGTGCATGGTAGAATGACCCTTATAACGGACTTCGACATGGCAATAACCCCCCGTTTCTACTTCTAGAGGAGAAAAGTATTGACATGAGCGCTCCCGGCACAAGGGCCAAAGAAGTCTCCAATTTCTTATTTCCGAATGACATGCGTCTCCTTGCGGGTAAATCACCGACCGCAATTCATAGAAGCCTGGGGGAACAGATAGGTCTAATTAGCTTAAGAGAGTAAATCCTGGGATCATCGATACTAGTTGTTTTATATTTGTTGTAAAAAGTAGTAATTACTTCAGTAGTAACCATAAACTTACGCTGGGGCTTCTTCGGCGGATTTTTACAGTTACCAACCAGGAGATTTGAAGTAAATCAGTTGAGGATTTAGCCGCGCTATCCGGTAATCTCCAAATTAAAACATACCGTTCCATGAAGGCTAGAATTAC

primers.fasta

>PGK1
AGTAATTATCTACTTTTTACAACAAATATAAAACAACTAGTATCG
>TEF1
AAGAAAGCATAGCAATCTAATCTAAGTTTTAATTACAACTAGTATCG

Script used:

MaskPrimers.py align --mode cut -s sequence.fasta -p primer.fasta --skiprc --maxerror 0.2 --log MP1.log --failed --maxlen 800
MaskPrimers.py align --mode cut -s sequence_primers-pass.fasta -p primer.fasta --revpr --skiprc --maxerror 0.2 --log MP2.log --failed --maxlen 800

Environment and versions:

  • OS: Linux
  • Relevant immcantation packages and versions: pRESTO 0.7.8 (as well as 0.7.4)
  • If used, the version of the Immcantation container (in the container, use builds report): immcantation/suite:4.6.0

MP1.log

MP2.log

Metadata

Metadata

Labels

bugSomething isn't working

Type

No type

Projects

No projects

Milestone

No milestone

Relationships

None yet

Development

No branches or pull requests

Issue actions