Unconfigured Ad

Collapse
X
 
  • Filter
  • Time
  • Show
Clear All
new posts
  • samanta
    Senior Member
    • Feb 2010
    • 108

    Too many short reads and too little RAM?

    Someone asked me whether it makes sense to remove duplicate reads to get the library size down to fit RAM limit. I think it is a bad strategy as explained here -

    http://homolog.us
  • zhidkov.ilia
    Member
    • Dec 2010
    • 25

    #2
    I think duplicated reads removed to avoid biases that resulted from library preparation (for example) and not for reduction of data for de-novo assembly.

    Ilia

    Comment

    • samanta
      Senior Member
      • Feb 2010
      • 108

      #3
      That's a good point. Some filtering is necessary to take care of pileup of reads due to biases. I do that for alignment and SNP discovery, but think twice about it during de novo assembly. If no underlying genome is known, it is hard to tell whether the duplicated reads come from error or real sequence.
      http://homolog.us

      Comment

      • zhidkov.ilia
        Member
        • Dec 2010
        • 25

        #4
        So when you assemble reads in to contigs, you will prefer that at least several reads will support the assembly. If you will have identical reads, you might obtain false contigs.

        Ilia

        Comment

        • samanta
          Senior Member
          • Feb 2010
          • 108

          #5
          It does not work that way for K-mer based assembler. Would you please explain your rationale? Why would one get false contigs?
          http://homolog.us

          Comment

          • zhidkov.ilia
            Member
            • Dec 2010
            • 25

            #6
            Let me rephrase my last comment:
            If duplicated reads don't contribute to downstream the de novo assembly pipe, it will be good idea to remove them.

            Ilia

            Comment

            Latest Articles

            Collapse

            ad_right_rmr

            Collapse

            News

            Collapse

            Topics Statistics Last Post
            Started by SEQadmin2, 06-05-2026, 10:09 AM
            0 responses
            13 views
            0 reactions
            Last Post SEQadmin2  
            Started by SEQadmin2, 06-04-2026, 08:59 AM
            0 responses
            24 views
            0 reactions
            Last Post SEQadmin2  
            Started by SEQadmin2, 06-02-2026, 12:03 PM
            0 responses
            28 views
            0 reactions
            Last Post SEQadmin2  
            Started by SEQadmin2, 06-02-2026, 11:40 AM
            0 responses
            22 views
            0 reactions
            Last Post SEQadmin2  
            Working...