Seqanswers Leaderboard Ad

Collapse

Announcement

Collapse
No announcement yet.
X
 
  • Filter
  • Time
  • Show
Clear All
new posts

  • Too many reads containing all N's in Miseq read

    Dear All,
    I recently used Miseq for small RNA library sequencing (single end, 35 cycle). The library was prepared using size selected (19-24 nt) small RNA sequences. When I analyzed the result using Galaxy FastQC, 50 % of my reads contain all Ns. What could be the reason ? Does that mean my sequencing result was horrible ? Also, in the reads that contain readable sequences, all of them are followed at the end with about 7-9 N bases. Could this be because my actual small RNA sequences were shorter than the cycle number and somehow, the instrument started calling for N once it started reading adapter sequence that follows the sRNA insert sequence ?
    I am new with NGS. Any help will be highly appreciated.

    Cheers.

  • #2
    This indicates to me a high probability of a library without sufficient color-balancing. Can you post your fastQC graph of per-position base content, and another of per-position average Q-score? Also, I don't know a lot about small-RNA library-prep - does it involve custom amplification primers, or just size-selection?

    Comment


    • #3
      Hi Brian,
      Please find the dropbox link for FastQC report for my data.

      The 3' adapter ligated sRNA samples were size-selected by gel fractionation and used for 5' adapter ligation followed by reverse transcription and PCR amplification.
      Thanks again for your help.

      Lodoe

      Comment


      • #4
        There is definitely substantial base-content bias, but enough variation that lack of color-balancing shouldn't be the problem. And even if you had 0bp sequences, Illumina primers themselves would still go out past cycle 31 - furthermore, when you go off the end of the molecule, you don't get Ns, but typically bases called from noise from nearby clusters.

        Hmm... maybe someone else has a better idea of what went wrong, but it certainly looks like a failed run to me. I don't know if that's due to the machine or the library.

        Comment


        • #5
          Did you spike phiX in this run? There could be several things wrong here. Low nucleotide diversity, bad libraries are all possibilities. Have you contacted illumina tech support and have them take a look at the raw data?

          Comment


          • #6
            To my knowledge libraries prepared with TruSeq small RNA kit can not be sequenced on MiSeq due to lack of sequencing primers for them in primer mixes of kit.

            Comment


            • #7
              Originally posted by windhorse8 View Post
              50 % of my reads contain all Ns. What could be the reason ?
              Hi- This is just an hypothesis. What is the origin of the fastq files? If they went through the MiSeq processing pipeline it might be that adapter sequences have been masked with N, hence the large number of Ns you see (see also
              http://support.illumina.com/content/...15028392-j.pdf). Note also that you don't observe enrichment for adapter sequences further supporting the hypothesis that adapters have been removed from this file. Also, the raw fastq files I have seen from Illumina sequencers don't contain N no matter how bad the quality is, you get one of ACTG.

              Comment


              • #8
                Originally posted by nucacidhunter View Post
                To my knowledge libraries prepared with TruSeq small RNA kit can not be sequenced on MiSeq due to lack of sequencing primers for them in primer mixes of kit.
                Do you have more information about this? From this page it seems the MiSeq supports small RNA library preps.

                Comment


                • #9
                  Hi dariober
                  Once I tried to prepare a sample sheet with Illumina Experiment Manager for small RNA library, but there was not any option for small RNA under MiSeq. I do not know if new version have that option.

                  Comment


                  • #10
                    Hi Dariober,
                    The fastQ file was sent by our collaborator who did the sequencing. I don't know if they went through Miseq pipeline. However, based on what you are saying and info provided in the link you sent, it seems like adapter sequences are masked. The reads containing all N's could represent adapter dimer sequences. Also, the reads with the actual sequence trailed with 6-8 N's could be the sRNA sequence followed by adapter sequences which are again masked.
                    Thank you again for your help. I will contact our collaborator and ask more details about the sequencing protocol followed for my library.

                    Comment


                    • #11
                      Originally posted by nucacidhunter View Post
                      Hi dariober
                      Once I tried to prepare a sample sheet with Illumina Experiment Manager for small RNA library, but there was not any option for small RNA under MiSeq. I do not know if new version have that option.
                      Don't know what version you used but IEM v1.8.2 certainly does.

                      MiSeq --> RNA Sequencing (Category) --> Small RNA (Application) if you want MiSeq Reporter to run the small RNA analysis.

                      OR

                      MiSeq --> Other --> FASTQ Only and then choose TruSeq Small RNA from the Sample Prep Kit drop down on the next page if you just want the sequence files.

                      Comment


                      • #12
                        Reason for so many Ns

                        This same issue happened to me. What causes it is the minimum threshold setting for read length after adapter trimming. The default setting after trimming adapters is to label everything under 35 nucleotides to be Ns. So, if you used a read length of 75 nucleotides and tried to sequence miRNA, then the miRNA would be less than 35 nucleotides and show up all as Ns. This minimum threshold can be set by running bcl2fastq. However, if you are doing your analysis in basespace, you can either set a regular mRNA or miRNA library but not both. So, if you select a mRNA library, then the small miRNA reads will be masked as Ns. This happened to me when I combined ChIP-Seq data with miRNA data. There is NO way to fix this in basespace after the run. You have to download the raw bcl files and then convert bcl2fastq by changing the minimum threshold setting. I know this is way after you posted this, but if you couldn't figure out what was causing the problem... here it is.

                        Comment

                        Latest Articles

                        Collapse

                        • seqadmin
                          Strategies for Sequencing Challenging Samples
                          by seqadmin


                          Despite advancements in sequencing platforms and related sample preparation technologies, certain sample types continue to present significant challenges that can compromise sequencing results. Pedro Echave, Senior Manager of the Global Business Segment at Revvity, explained that the success of a sequencing experiment ultimately depends on the amount and integrity of the nucleic acid template (RNA or DNA) obtained from a sample. “The better the quality of the nucleic acid isolated...
                          03-22-2024, 06:39 AM
                        • seqadmin
                          Techniques and Challenges in Conservation Genomics
                          by seqadmin



                          The field of conservation genomics centers on applying genomics technologies in support of conservation efforts and the preservation of biodiversity. This article features interviews with two researchers who showcase their innovative work and highlight the current state and future of conservation genomics.

                          Avian Conservation
                          Matthew DeSaix, a recent doctoral graduate from Kristen Ruegg’s lab at The University of Colorado, shared that most of his research...
                          03-08-2024, 10:41 AM

                        ad_right_rmr

                        Collapse

                        News

                        Collapse

                        Topics Statistics Last Post
                        Started by seqadmin, Yesterday, 06:37 PM
                        0 responses
                        10 views
                        0 likes
                        Last Post seqadmin  
                        Started by seqadmin, Yesterday, 06:07 PM
                        0 responses
                        10 views
                        0 likes
                        Last Post seqadmin  
                        Started by seqadmin, 03-22-2024, 10:03 AM
                        0 responses
                        51 views
                        0 likes
                        Last Post seqadmin  
                        Started by seqadmin, 03-21-2024, 07:32 AM
                        0 responses
                        67 views
                        0 likes
                        Last Post seqadmin  
                        Working...
                        X