Seqanswers Leaderboard Ad

Collapse

Announcement

Collapse
No announcement yet.
X
 
  • Filter
  • Time
  • Show
Clear All
new posts

  • Please help with assembly 2x250 bp with 350 insert

    Hi folks,

    I have got data generated from 2x250 bp with 350 bp insert for fungal genome assembly. I got very good results from SPAdes, but later discovered that SPAdes recommends an insert of 550-700 bp for 2x250 bp sequencing.

    I have tried Abyss, Velvet and CLC so far, but they did not give as good results as SPAdes. Do you have opinion or suggestion what else I shall try? Or would it be just okay to use SPAdes?

    Thanks in advance and have a great weekend!

  • #2
    Discovar de novo was designed specifically for 2x250 bp reads so you could give it a try. Of course it is recommended to have inserts longer than the sequencing reads (500 bp in your case) but it goes for all assemblers and doesn't mean your assembly is not valid. Best way to improve the assembly would likely be to add long/linked reads though.

    Comment


    • #3
      @Vinn: Have you tried to merge your reads (since they must overlap in the middle) and then try assembly as a single end dataset?

      Comment


      • #4
        Originally posted by Vinn View Post
        I have got data generated from 2x250 bp with 350 bp insert for fungal genome assembly. I got very good results from SPAdes, but later discovered that SPAdes recommends an insert of 550-700 bp for 2x250 bp sequencing.
        I think that means "If you want to use SPAdes for 2x250bp reads, we recommend you target you libraries for 550-700bp" rather than "If you have a library outside of 550-700bp, don't use SPAdes". Once you have the library, it's too late, but SPAdes is very flexible with insert sizes.

        As Genomax mentioned, you might try merging the reads first; I have found that to improve SPAdes assemblies.

        Comment


        • #5
          Dear Genomax,

          Thanks for your reply and for the suggestion. I was thinking about that too, but since I just got another PE library (with another insert size), I am not sure if SPAdes can handle one single read and one paired-end?

          Have a great Easter holiday!

          Comment


          • #6
            Dear Ola,

            Thanks for your reply and for your suggestion. I just received another library with another insert size and will try using it to improve the one I have.

            Have a great Easter holiday!

            Comment


            • #7
              Dear Brian,

              Thanks for your reply and for the suggestion. I will try merging and reassembling again.
              Happy Easter holiday!

              Comment


              • #8
                Hi Vinn,

                SPAdes can handle one paired and one single-ended set of reads. I recommend that anyway when using a single library and merging reads, because not all the reads will merge.

                Comment


                • #9
                  Hi Brian,

                  Thank you very much; I will try as you suggested. Anyway, I couldn't stop wondering what if I trim both R1 and R2 reads to 150 bp using bbduk (ftr=150), and use them as a 150PE?

                  Comment


                  • #10
                    Hi Vinn,

                    You could certainly do that, but unless your sequence quality is very bad at the ends, it won't give you a better assembly; it will mainly just reduce your sequence volume. In my testing, SPAdes produces the best assemblies when you merge reads (if they are overlapping) and feed it both the merged and unmerged reads. Remember that SPAdes supports kmers up to 127bp; with 150bp reads, the kmer depth at k=127 will be quite low. Whereas with 250bp reads (or merged 350+bp reads) it will be much higher, potentially resulting in a superior 127-mer assembly.

                    Comment

                    Latest Articles

                    Collapse

                    • seqadmin
                      Advancing Precision Medicine for Rare Diseases in Children
                      by seqadmin




                      Many organizations study rare diseases, but few have a mission as impactful as Rady Children’s Institute for Genomic Medicine (RCIGM). “We are all about changing outcomes for children,” explained Dr. Stephen Kingsmore, President and CEO of the group. The institute’s initial goal was to provide rapid diagnoses for critically ill children and shorten their diagnostic odyssey, a term used to describe the long and arduous process it takes patients to obtain an accurate...
                      12-16-2024, 07:57 AM
                    • seqadmin
                      Recent Advances in Sequencing Technologies
                      by seqadmin



                      Innovations in next-generation sequencing technologies and techniques are driving more precise and comprehensive exploration of complex biological systems. Current advancements include improved accessibility for long-read sequencing and significant progress in single-cell and 3D genomics. This article explores some of the most impactful developments in the field over the past year.

                      Long-Read Sequencing
                      Long-read sequencing has seen remarkable advancements,...
                      12-02-2024, 01:49 PM

                    ad_right_rmr

                    Collapse

                    News

                    Collapse

                    Topics Statistics Last Post
                    Started by seqadmin, 12-17-2024, 10:28 AM
                    0 responses
                    34 views
                    0 likes
                    Last Post seqadmin  
                    Started by seqadmin, 12-13-2024, 08:24 AM
                    0 responses
                    49 views
                    0 likes
                    Last Post seqadmin  
                    Started by seqadmin, 12-12-2024, 07:41 AM
                    0 responses
                    35 views
                    0 likes
                    Last Post seqadmin  
                    Started by seqadmin, 12-11-2024, 07:45 AM
                    0 responses
                    46 views
                    0 likes
                    Last Post seqadmin  
                    Working...
                    X