Seqanswers Leaderboard Ad

Collapse

Announcement

Collapse
No announcement yet.
X
 
  • Filter
  • Time
  • Show
Clear All
new posts

  • Per base sequence content error

    Hi,

    I have 3 questions related with my fails in FastQC report.


    1. I attached a printscreen presents graph with per base sequence content. Could tell me why I see deviation in the 3' end. Should I cut 3 bp on the 3' end?

    2. My report shows red flag in Per tile sequence quality. I see few red tiles on the heatmap. What I should do with it?

    3. I would like to use BWA-MEM to do alignment to reference genome. I heard that BWA-MEM expect min. length of read = 70 bp. Should I set MINLEN in Trimmomatic = 70?
    Attached Files

  • #2
    Fails (red X) does not automatically mean your data is bad. You need to take into account context of what kind of experiment this is. Can you tell us about that?

    1. Aligners should soft-clip (remove from alignment) bases that do not map, so those at the end of the reads should be taken care of.
    2. Do nothing specific. You don't have a high number of bad tiles.
    3. 70 is pretty high. What genome is this data from. Even for human genome reads > 40 bp will map fine.

    Comment


    • #3
      Thank you. I do not know the context. I have just obtained this raw data and I would like to do SNP calling.

      So in 1 and 2 cases I can just ignore without trimming?

      3. I read that BWA-MAM want min length = 70 (in manual). It is bovine genome (Bos taurus).

      Comment


      • #4
        Context was the type of experiment. Sounds like this is genome resequencing.

        If you want to reduce possibility of reads multi-mapping by chance then going with longer reads is better. Minlen of 70 should be strict enough.

        You would want to scan and trim your data so any extraneous sequence present (adapter etc.) can be removed. I recommend using "bbduk.sh" from BBMap suite (thread here as well as a guide). Set "minlen=70".

        Comment


        • #5
          Ah, ok I understand. It is WGS sequencing

          Comment


          • #6
            Hmm you recommend a BBDUK trimmer. What do you think about trimmomatic? I would like to use with this set of settings: TruSeq3-PE.fa:2:30:10 SLIDINGWINDOW:4:12 MINLEN:70

            My supervisor prefers Trimmomatic. Can I use it with settings above?

            Thank you!

            Comment


            • #7
              Sure you can use trimmomatic. The options looks right (I don't know them off the top of my head) and should work.

              Comment

              Latest Articles

              Collapse

              • seqadmin
                Current Approaches to Protein Sequencing
                by seqadmin


                Proteins are often described as the workhorses of the cell, and identifying their sequences is key to understanding their role in biological processes and disease. Currently, the most common technique used to determine protein sequences is mass spectrometry. While still a valuable tool, mass spectrometry faces several limitations and requires a highly experienced scientist familiar with the equipment to operate it. Additionally, other proteomic methods, like affinity assays, are constrained...
                04-04-2024, 04:25 PM
              • seqadmin
                Strategies for Sequencing Challenging Samples
                by seqadmin


                Despite advancements in sequencing platforms and related sample preparation technologies, certain sample types continue to present significant challenges that can compromise sequencing results. Pedro Echave, Senior Manager of the Global Business Segment at Revvity, explained that the success of a sequencing experiment ultimately depends on the amount and integrity of the nucleic acid template (RNA or DNA) obtained from a sample. “The better the quality of the nucleic acid isolated...
                03-22-2024, 06:39 AM

              ad_right_rmr

              Collapse

              News

              Collapse

              Topics Statistics Last Post
              Started by seqadmin, 04-11-2024, 12:08 PM
              0 responses
              18 views
              0 likes
              Last Post seqadmin  
              Started by seqadmin, 04-10-2024, 10:19 PM
              0 responses
              22 views
              0 likes
              Last Post seqadmin  
              Started by seqadmin, 04-10-2024, 09:21 AM
              0 responses
              16 views
              0 likes
              Last Post seqadmin  
              Started by seqadmin, 04-04-2024, 09:00 AM
              0 responses
              47 views
              0 likes
              Last Post seqadmin  
              Working...
              X