Seqanswers Leaderboard Ad

Collapse

Announcement

Collapse
No announcement yet.
X
 
  • Filter
  • Time
  • Show
Clear All
new posts

  • Fixed length trimming of paired end reads for MISO

    Hey y'all,

    I'm in possession of a dataset that needs some pretty aggressive trimming to get it to align properly, but because MISO requires fixed read lengths, I'm trying to trim a fixed length from all of my reads. However, I'm not really sure how to approach it! Should I trim a bit off the 5' or 3' or both of either or both pairs? Anyone have any advice on how to go about doing this?

  • #2
    If you map your reads with BBMap using the "mhist" flag:

    Code:
    bbmap.sh ref=ref.fa in=reads.fq mhist=mhist.txt out=mapped.sam
    ...you can plot that output to see the mismatch rate by position in the reads, which will help you determine the best fixed length to trim. Probably you only need to trim the right end; normally the left end is pretty good for Illumina randomly-sheared libraries.

    However, if you generate the sam files using BBMap in the first place, you don't really need to trim the reads first, as it's very sensitive and designed for low-quality data. It's still a good idea to trim if, say, the last 15 bases are complete garbage but it's very rare that that's necessary.

    Comment

    Latest Articles

    Collapse

    • seqadmin
      Advancing Precision Medicine for Rare Diseases in Children
      by seqadmin




      Many organizations study rare diseases, but few have a mission as impactful as Rady Children’s Institute for Genomic Medicine (RCIGM). “We are all about changing outcomes for children,” explained Dr. Stephen Kingsmore, President and CEO of the group. The institute’s initial goal was to provide rapid diagnoses for critically ill children and shorten their diagnostic odyssey, a term used to describe the long and arduous process it takes patients to obtain an accurate...
      12-16-2024, 07:57 AM
    • seqadmin
      Recent Advances in Sequencing Technologies
      by seqadmin



      Innovations in next-generation sequencing technologies and techniques are driving more precise and comprehensive exploration of complex biological systems. Current advancements include improved accessibility for long-read sequencing and significant progress in single-cell and 3D genomics. This article explores some of the most impactful developments in the field over the past year.

      Long-Read Sequencing
      Long-read sequencing has seen remarkable advancements,...
      12-02-2024, 01:49 PM

    ad_right_rmr

    Collapse

    News

    Collapse

    Topics Statistics Last Post
    Started by seqadmin, 12-17-2024, 10:28 AM
    0 responses
    22 views
    0 likes
    Last Post seqadmin  
    Started by seqadmin, 12-13-2024, 08:24 AM
    0 responses
    42 views
    0 likes
    Last Post seqadmin  
    Started by seqadmin, 12-12-2024, 07:41 AM
    0 responses
    28 views
    0 likes
    Last Post seqadmin  
    Started by seqadmin, 12-11-2024, 07:45 AM
    0 responses
    42 views
    0 likes
    Last Post seqadmin  
    Working...
    X