Seqanswers Leaderboard Ad

Collapse

Announcement

Collapse
No announcement yet.
X
 
  • Filter
  • Time
  • Show
Clear All
new posts

  • bbmerge detects different adapters

    Hello,

    I have some bacterial genomes which were sequenced using the Nextera Flex library prep kit ( insert size ~350 bp) and MiSeq paired end reads (2x300). I want to merge these reads with bbmerge prior to adapter trimming and assembly.

    I have used bbmerge's adapter detection feature but have gotten some strange results.
    bbmerge seems to detect different adapters on each sample, and many have multiple N's.

    The first part of the detected adapter sequence is what Illumina provides as the Nextera Flex adapter sequence, but I'm not sure what the other stuff is.

    Here are some of the adapters that were detected:

    >Read1_adapter
    CTGTCNCTTATACACATCTCCGAGCCCACGAGACGGACTCCTANCTCGTATGCCGTCTTCTGCTTG

    >Read2_adapter
    CTGTCNCTTATACNCATCTGACGCTGCCGACGAAGAGGANAGNGNNNNNNNNGNNNGNNNC

    This is the Nextera Flex adapter sequence given by Illumina CTGTCTCTTATACACATCT

    as you can see the first part matches this except for an N stuck in.

    Across my 20 samples 16 unique Read1 adapters were detected and
    18 unique Read2 adapters were detected.

    Should I worry about this? should I feed the detected adapter sequences into bbmerge? should I just give bbmerge the Illumina provided sequence?

    Thanks!

  • #2
    I would suggest that you use the Illumina provided adapter sequence. BBMerge detection feature is good when you don't have that information a priori. There may be some sequencing errors in your reads which is leading to that N insertion.

    Comment

    Latest Articles

    Collapse

    • seqadmin
      Essential Discoveries and Tools in Epitranscriptomics
      by seqadmin


      The field of epigenetics has traditionally concentrated more on DNA and how changes like methylation and phosphorylation of histones impact gene expression and regulation. However, our increased understanding of RNA modifications and their importance in cellular processes has led to a rise in epitranscriptomics research. “Epitranscriptomics brings together the concepts of epigenetics and gene expression,” explained Adrien Leger, PhD, Principal Research Scientist on Modified Bases...
      Today, 07:01 AM
    • seqadmin
      Current Approaches to Protein Sequencing
      by seqadmin


      Proteins are often described as the workhorses of the cell, and identifying their sequences is key to understanding their role in biological processes and disease. Currently, the most common technique used to determine protein sequences is mass spectrometry. While still a valuable tool, mass spectrometry faces several limitations and requires a highly experienced scientist familiar with the equipment to operate it. Additionally, other proteomic methods, like affinity assays, are constrained...
      04-04-2024, 04:25 PM

    ad_right_rmr

    Collapse

    News

    Collapse

    Topics Statistics Last Post
    Started by seqadmin, 04-11-2024, 12:08 PM
    0 responses
    37 views
    0 likes
    Last Post seqadmin  
    Started by seqadmin, 04-10-2024, 10:19 PM
    0 responses
    41 views
    0 likes
    Last Post seqadmin  
    Started by seqadmin, 04-10-2024, 09:21 AM
    0 responses
    35 views
    0 likes
    Last Post seqadmin  
    Started by seqadmin, 04-04-2024, 09:00 AM
    0 responses
    54 views
    0 likes
    Last Post seqadmin  
    Working...
    X