Seqanswers Leaderboard Ad

Collapse

Announcement

Collapse
No announcement yet.
X
 
  • Filter
  • Time
  • Show
Clear All
new posts

  • Bowtie2 with tight matching constraints on repeat sequences

    Hi, all.

    I'm working with a tightly constrained sequence analysis project. The mostly unimportant part: I'm aligning repeat sequences (LINES) to the mouse genome using Bowtie2, with IonTorrent sequence.

    The important part is that I'm trying to use Bowtie2 with some serious constraints (high penalty for mismatches: mx 1000 and --ignore quals ) while allowing snps on "N" in the reference sequence with loose -np 0 and --n-ceil C,3. No indels (--rdg 1000,40, --rfg 1000,40). My mapping constraints are --score-min G,30,20.

    This would all work great if it wasn't for the fact that it's not really doing a good job matching sequences that are identical except for the variation in the reference sequence at the position where I've put an "N" to indicate an ambiguous character. Anything with an ambiguous character isn't getting aligned.

    For instance, AATAAGGACTAGGAC will align sequence, but AATAANGACTAGGAC will not.

    I was wondering if anyone else has had success with mapping highly similar sequences, while trying to allow at least one ambiguous character within the reference -- and what Bowtie parameters they used?

Latest Articles

Collapse

  • seqadmin
    Essential Discoveries and Tools in Epitranscriptomics
    by seqadmin




    The field of epigenetics has traditionally concentrated more on DNA and how changes like methylation and phosphorylation of histones impact gene expression and regulation. However, our increased understanding of RNA modifications and their importance in cellular processes has led to a rise in epitranscriptomics research. “Epitranscriptomics brings together the concepts of epigenetics and gene expression,” explained Adrien Leger, PhD, Principal Research Scientist...
    04-22-2024, 07:01 AM
  • seqadmin
    Current Approaches to Protein Sequencing
    by seqadmin


    Proteins are often described as the workhorses of the cell, and identifying their sequences is key to understanding their role in biological processes and disease. Currently, the most common technique used to determine protein sequences is mass spectrometry. While still a valuable tool, mass spectrometry faces several limitations and requires a highly experienced scientist familiar with the equipment to operate it. Additionally, other proteomic methods, like affinity assays, are constrained...
    04-04-2024, 04:25 PM

ad_right_rmr

Collapse

News

Collapse

Topics Statistics Last Post
Started by seqadmin, Yesterday, 08:06 AM
0 responses
16 views
0 likes
Last Post seqadmin  
Started by seqadmin, 04-30-2024, 12:17 PM
0 responses
19 views
0 likes
Last Post seqadmin  
Started by seqadmin, 04-29-2024, 10:49 AM
0 responses
23 views
0 likes
Last Post seqadmin  
Started by seqadmin, 04-25-2024, 11:49 AM
0 responses
28 views
0 likes
Last Post seqadmin  
Working...
X