Seqanswers Leaderboard Ad

Collapse

Announcement

Collapse
No announcement yet.
X
 
  • Filter
  • Time
  • Show
Clear All
new posts

  • Please help with assembly 2x250 bp with 350 insert

    Hi folks,

    I have got data generated from 2x250 bp with 350 bp insert for fungal genome assembly. I got very good results from SPAdes, but later discovered that SPAdes recommends an insert of 550-700 bp for 2x250 bp sequencing.

    I have tried Abyss, Velvet and CLC so far, but they did not give as good results as SPAdes. Do you have opinion or suggestion what else I shall try? Or would it be just okay to use SPAdes?

    Thanks in advance and have a great weekend!

  • #2
    Discovar de novo was designed specifically for 2x250 bp reads so you could give it a try. Of course it is recommended to have inserts longer than the sequencing reads (500 bp in your case) but it goes for all assemblers and doesn't mean your assembly is not valid. Best way to improve the assembly would likely be to add long/linked reads though.

    Comment


    • #3
      @Vinn: Have you tried to merge your reads (since they must overlap in the middle) and then try assembly as a single end dataset?

      Comment


      • #4
        Originally posted by Vinn View Post
        I have got data generated from 2x250 bp with 350 bp insert for fungal genome assembly. I got very good results from SPAdes, but later discovered that SPAdes recommends an insert of 550-700 bp for 2x250 bp sequencing.
        I think that means "If you want to use SPAdes for 2x250bp reads, we recommend you target you libraries for 550-700bp" rather than "If you have a library outside of 550-700bp, don't use SPAdes". Once you have the library, it's too late, but SPAdes is very flexible with insert sizes.

        As Genomax mentioned, you might try merging the reads first; I have found that to improve SPAdes assemblies.

        Comment


        • #5
          Dear Genomax,

          Thanks for your reply and for the suggestion. I was thinking about that too, but since I just got another PE library (with another insert size), I am not sure if SPAdes can handle one single read and one paired-end?

          Have a great Easter holiday!

          Comment


          • #6
            Dear Ola,

            Thanks for your reply and for your suggestion. I just received another library with another insert size and will try using it to improve the one I have.

            Have a great Easter holiday!

            Comment


            • #7
              Dear Brian,

              Thanks for your reply and for the suggestion. I will try merging and reassembling again.
              Happy Easter holiday!

              Comment


              • #8
                Hi Vinn,

                SPAdes can handle one paired and one single-ended set of reads. I recommend that anyway when using a single library and merging reads, because not all the reads will merge.

                Comment


                • #9
                  Hi Brian,

                  Thank you very much; I will try as you suggested. Anyway, I couldn't stop wondering what if I trim both R1 and R2 reads to 150 bp using bbduk (ftr=150), and use them as a 150PE?

                  Comment


                  • #10
                    Hi Vinn,

                    You could certainly do that, but unless your sequence quality is very bad at the ends, it won't give you a better assembly; it will mainly just reduce your sequence volume. In my testing, SPAdes produces the best assemblies when you merge reads (if they are overlapping) and feed it both the merged and unmerged reads. Remember that SPAdes supports kmers up to 127bp; with 150bp reads, the kmer depth at k=127 will be quite low. Whereas with 250bp reads (or merged 350+bp reads) it will be much higher, potentially resulting in a superior 127-mer assembly.

                    Comment

                    Latest Articles

                    Collapse

                    • seqadmin
                      Strategies for Sequencing Challenging Samples
                      by seqadmin


                      Despite advancements in sequencing platforms and related sample preparation technologies, certain sample types continue to present significant challenges that can compromise sequencing results. Pedro Echave, Senior Manager of the Global Business Segment at Revvity, explained that the success of a sequencing experiment ultimately depends on the amount and integrity of the nucleic acid template (RNA or DNA) obtained from a sample. “The better the quality of the nucleic acid isolated...
                      03-22-2024, 06:39 AM
                    • seqadmin
                      Techniques and Challenges in Conservation Genomics
                      by seqadmin



                      The field of conservation genomics centers on applying genomics technologies in support of conservation efforts and the preservation of biodiversity. This article features interviews with two researchers who showcase their innovative work and highlight the current state and future of conservation genomics.

                      Avian Conservation
                      Matthew DeSaix, a recent doctoral graduate from Kristen Ruegg’s lab at The University of Colorado, shared that most of his research...
                      03-08-2024, 10:41 AM

                    ad_right_rmr

                    Collapse

                    News

                    Collapse

                    Topics Statistics Last Post
                    Started by seqadmin, Yesterday, 06:37 PM
                    0 responses
                    10 views
                    0 likes
                    Last Post seqadmin  
                    Started by seqadmin, Yesterday, 06:07 PM
                    0 responses
                    9 views
                    0 likes
                    Last Post seqadmin  
                    Started by seqadmin, 03-22-2024, 10:03 AM
                    0 responses
                    50 views
                    0 likes
                    Last Post seqadmin  
                    Started by seqadmin, 03-21-2024, 07:32 AM
                    0 responses
                    67 views
                    0 likes
                    Last Post seqadmin  
                    Working...
                    X