Unconfigured Ad

Collapse
X
 
  • Filter
  • Time
  • Show
Clear All
new posts
  • bioinfosm
    Senior Member
    • Jan 2008
    • 483

    % unmapped reads

    Hi all,

    For any solexa run, we see 60-80% passing filter reads, of which 70-90% map to the reference sequence. There are quite a few reads that map to adaptors and other random sequences.

    Does anyone know of a resource to get an average for those passing-filter reads? that map to adapter, etc and are essentially noise and not useful. I remember it being mentioned in some review as well, but cannot recollect

    thanks..
    --
    bioinfosm
  • der_eiskern
    Member
    • Jul 2009
    • 46

    #2
    i'm not sure what exactly you want ("an average for those passing-filter reads")

    but i've also noticed often two-thirds of the 30-10% of unmappable reads are alignable if you can tolerate a higher error rate.

    from cloning random fragments i've noticed that 1/10-1/100 of cloned fragments from a sequencing library will have a truncated adaptor or multiple adaptors. since i didn't get my adaptors from illumina directly this could reflect contamination of my modified IDT oligos. it varied between library preps using the same batch of adaptors.

    can you not just make a fasta file of the adaptor and iteratively truncate your read during mapping to such a fast a file to estimate it? i'd be more interested in seeing what what doesn't pass filter looks like, any idea how to get that data from the pipeline?

    Comment

    • Xi Wang
      Senior Member
      • Oct 2009
      • 317

      #3
      Originally posted by der_eiskern View Post
      from cloning random fragments i've noticed that 1/10-1/100 of cloned fragments from a sequencing library will have a truncated adaptor or multiple adaptors. since i didn't get my adaptors from illumina directly this could reflect contamination of my modified IDT oligos. it varied between library preps using the same batch of adaptors.
      I am wondering what makes the sequencer not report the adaptor sequences and exactly the very beginning of what we want to sequence. Or it will be quite normal to sequence the tails of adaptors.

      Originally posted by der_eiskern View Post
      can you not just make a fasta file of the adaptor and iteratively truncate your read during mapping to such a fast a file to estimate it? i'd be more interested in seeing what what doesn't pass filter looks like, any idea how to get that data from the pipeline?
      Those not passing the filtering usually are with low quality scores, which indicates that the base calling may be incorrect or it is hard to call bases.
      Xi Wang

      Comment

      • aleferna
        Senior Member
        • Sep 2009
        • 121

        #4
        Solexa Sequencing of Paired Ends

        Does anybody know what is the expected percent of adapter sequence that one would expect in a Solexa sequencing run?

        I'm mapping this solexa paired end run and I had to mask away about 50% of the sample because it maps to the adapters. Is this normal? If so and we do this again, how would you minimize the amount of adapter sequence that you get in the library????

        If it is not normal, is there any doc/spec that we can use to complain to the sequencing service and get it done properly?

        Comment

        • Xi Wang
          Senior Member
          • Oct 2009
          • 317

          #5
          Originally posted by aleferna View Post
          Does anybody know what is the expected percent of adapter sequence that one would expect in a Solexa sequencing run?

          I'm mapping this solexa paired end run and I had to mask away about 50% of the sample because it maps to the adapters. Is this normal? If so and we do this again, how would you minimize the amount of adapter sequence that you get in the library????

          If it is not normal, is there any doc/spec that we can use to complain to the sequencing service and get it done properly?
          It may depend on what you sequenced. RNA-seq? miRNA-seq?
          Xi Wang

          Comment

          • aleferna
            Senior Member
            • Sep 2009
            • 121

            #6
            just plain human DNA

            Comment

            • Xi Wang
              Senior Member
              • Oct 2009
              • 317

              #7
              I have no experience on plain DNA sequencing. But I think your problem may largely due to sample preparation.
              Xi Wang

              Comment

              • aleferna
                Senior Member
                • Sep 2009
                • 121

                #8
                But in what part, is that the main issue is that we provide the samples, but another lab did the Paired End preparation. If there was a problem in the paired end preparation we need to go back to this people and tell them that they made a mistake. Because its such an expensive test this can get really ugly and I don't have any reference to make the point...

                Comment

                • aleferna
                  Senior Member
                  • Sep 2009
                  • 121

                  #9
                  How much do you expect in RNA?

                  Comment

                  Latest Articles

                  Collapse

                  • GATTACAT
                    Reply to Nine Things a Sample Prep Scientist Thinks About Before Sequencing
                    by GATTACAT
                    Love this - good data definitely starts from good input, and poor input can only give relatively poor data. I particularly like the mention of Nanodrop/absorbance based methods for quantification. It's such a toss up if you'll get an accurate reading or what amounts to a randomly generated number, and a lot of library/sequencing related issues can be traced back to poor quant.
                    07-01-2026, 11:43 AM
                  • SEQadmin2
                    Nine Things a Sample Prep Scientist Thinks About Before Sequencing
                    by SEQadmin2


                    I’m not a sequencing expert. I’m a purification scientist who uses NGS to evaluate workflows my group develops. With this perspective, we think about the sample first and the NGS workflow second. The sequencer is an exceptionally honest reporter, but it can only report on what you give it, so whether you get clean, interpretable data from an NGS workflow is largely determined before you begin.

                    Here are nine questions we think about, in roughly the order they matter, before...
                    06-18-2026, 07:11 AM

                  ad_right_rmr

                  Collapse

                  News

                  Collapse

                  Topics Statistics Last Post
                  Started by SEQadmin2, Yesterday, 11:08 AM
                  0 responses
                  6 views
                  0 reactions
                  Last Post SEQadmin2  
                  Started by SEQadmin2, 06-30-2026, 05:37 AM
                  0 responses
                  11 views
                  0 reactions
                  Last Post SEQadmin2  
                  Started by SEQadmin2, 06-26-2026, 11:10 AM
                  0 responses
                  19 views
                  0 reactions
                  Last Post SEQadmin2  
                  Started by SEQadmin2, 06-17-2026, 06:09 AM
                  0 responses
                  53 views
                  0 reactions
                  Last Post SEQadmin2  
                  Working...