Seqanswers Leaderboard Ad

Collapse

Announcement

Collapse
No announcement yet.
X
 
  • Filter
  • Time
  • Show
Clear All
new posts

  • Sspace scaffolder : does it take the "insert size" or the "fragment size"

    Hi all,

    I am trying to use SSpace for scaffolding my already assembled contigs:

    I wanted to know if the insert size column in the library.txt file for SSpace is the "insert size" exclusive of the read length or infact the fragment size (with the read length),

    cheers,

    Nandan

  • #2
    Hi Nandan,

    sorry I missed your post yesterday. The insert size in SSPACE is referred as the fragment size, so including the read length.

    Regards,
    Boetsie

    Comment


    • #3
      Thanks

      Thanks Boetsie..this solves my confusion :-)

      cheers,

      nandan

      Comment


      • #4
        Hi Boetsie,

        One more quick question..

        I am trying to use Hawkeye visualisation for displaying my "Scaffolds/contigs/reads" in tandem..

        1) I have used velvet for assembly and created a ".afg" file. (1 paired-end library)

        2) I have then used Space with the same set of paired-end reads to scaffold the contigs.

        Is there a way you can suggest to provide the scaffold relationship to Hawkeye in addition to the contig information in .afg file form velvet?

        appreciate your help,

        cheers,

        Nandan

        Comment


        • #5
          Hi Nandan,

          I'm sorry, but I've never used the Hawkeye software, so I can't comment on that. Maybe someone else can help you with this? For example the developers of Hawkeye?

          Regards,
          Boetsie

          Comment


          • #6
            Thanks

            Hi Boetsie,

            No worries..thanks .. I will check if anyone else has any suggestion and also get back to Hawkeye developers,,

            cheers,

            Nandan

            Comment


            • #7
              I've also been dealing with the problem of getting the scaffolds of SSPACE displayed in Hawkeye for assessment of assembly quality.

              From my investigations, it seems that you have to parse the output of the SSPACE evidence file into a series of scaffold (SCF) and supporting contig edge (CTE) records for inclusion in the AMOS afg file that you load into Hawkeye.

              I'm working on a script that will hopefully accomplish this, but it's still early days. If I get it ironed out, I'll be happy to share it.

              Cheers,

              Anthony

              Comment


              • #8
                Thanks

                Hi Anthony,

                thanks for your response.. I will appreciate if u can share the script when you are ready with it.. I will also work from my end to check if I can get any solution ..now that there does not seem to be a readily available script/tool,

                cheers,

                Nandan

                Comment


                • #9
                  Hi Boetsie,

                  I have a question about a specific parameter in Sspace:

                  I am using SSPACE-BASIC-2.0_linux-x86_64

                  The ‘–m’ minimum overlap
                  ---------------
                  Minimum number of overlapping bases of the reads with the contig
                  during overhang consensus build up. Higher ‘-m’ values lead to more
                  accurate contigs at the cost of decreased contiguity. We suggest to take
                  a value close to the largest read length. For example, for a library with
                  36bp reads, we suggest to use a -m value between 32 and 35 for reliable contig extension.

                  Since I am using a library from illumina with a read length 102, I was trying to use m=90 but I could see from the error report that the maximum allowable value of m=50.

                  How do I get over this problem? Appreciate your assistance.

                  cheers,

                  nandan

                  Comment


                  • #10
                    Hi Nandan,

                    You could get a work-around for this by removing the number in the SSPACE main file (SSPACE_Basic_v2.0.pl). Please change this line in the code;

                    die "ERROR: -m must be a number between 15-50. Your inserted -m is $min_overlap ...Exiting.\n" if(!($min_overlap =~ /^\d+$/) || $min_overlap < 10 || $min_overlap > 50);

                    Set the '> 50' to your liking.

                    Regards,
                    Boetsie

                    Originally posted by ndeshpan View Post
                    Hi Boetsie,

                    I have a question about a specific parameter in Sspace:

                    I am using SSPACE-BASIC-2.0_linux-x86_64

                    The ‘–m’ minimum overlap
                    ---------------
                    Minimum number of overlapping bases of the reads with the contig
                    during overhang consensus build up. Higher ‘-m’ values lead to more
                    accurate contigs at the cost of decreased contiguity. We suggest to take
                    a value close to the largest read length. For example, for a library with
                    36bp reads, we suggest to use a -m value between 32 and 35 for reliable contig extension.

                    Since I am using a library from illumina with a read length 102, I was trying to use m=90 but I could see from the error report that the maximum allowable value of m=50.

                    How do I get over this problem? Appreciate your assistance.

                    cheers,

                    nandan

                    Comment


                    • #11
                      Thanks

                      Thanks Boetsie..Appreciate your help.

                      cheers,

                      Nandan

                      Comment


                      • #12
                        Hi Boetsie,

                        I want to use SSPACE to scaffolds my assembly. I did hybrid assembly using Cerulean with Illumina HiSeq and PacBio. However, I don't know where can I get the standard deviation for the reads. I want to scaffold my assembly using Illumina reads. According to the NGS report that I got from the sequencing company, the insert size for my illumina reads is 500bp and the reads length is 90bp. So the number of my fragment size should be 590bp. Is there any software that I can use to get the standard deviation of the reads?

                        Thank you.

                        Comment


                        • #13
                          standard deviation-libraries SSPACE

                          Hi Boetsie,

                          I want to use SSPACE to scaffolds my assembly. I did hybrid assembly using Cerulean with Illumina HiSeq and PacBio. However, I don't know where can I get the standard deviation for the reads. I want to scaffold my assembly using Illumina reads. According to the NGS report that I got from the sequencing company, the insert size for my illumina reads is 500bp and the reads length is 90bp. So the number of my fragment size should be 590bp. Is there any software that I can use to get the standard deviation of the reads?

                          Thank you.

                          Comment


                          • #14
                            Align the reads to your contigs and run Picard's CollectInsertSizeMetrics: http://broadinstitute.github.io/pica...ertSizeMetrics

                            Comment

                            Latest Articles

                            Collapse

                            • seqadmin
                              Strategies for Sequencing Challenging Samples
                              by seqadmin


                              Despite advancements in sequencing platforms and related sample preparation technologies, certain sample types continue to present significant challenges that can compromise sequencing results. Pedro Echave, Senior Manager of the Global Business Segment at Revvity, explained that the success of a sequencing experiment ultimately depends on the amount and integrity of the nucleic acid template (RNA or DNA) obtained from a sample. “The better the quality of the nucleic acid isolated...
                              03-22-2024, 06:39 AM
                            • seqadmin
                              Techniques and Challenges in Conservation Genomics
                              by seqadmin



                              The field of conservation genomics centers on applying genomics technologies in support of conservation efforts and the preservation of biodiversity. This article features interviews with two researchers who showcase their innovative work and highlight the current state and future of conservation genomics.

                              Avian Conservation
                              Matthew DeSaix, a recent doctoral graduate from Kristen Ruegg’s lab at The University of Colorado, shared that most of his research...
                              03-08-2024, 10:41 AM

                            ad_right_rmr

                            Collapse

                            News

                            Collapse

                            Topics Statistics Last Post
                            Started by seqadmin, Yesterday, 06:37 PM
                            0 responses
                            12 views
                            0 likes
                            Last Post seqadmin  
                            Started by seqadmin, Yesterday, 06:07 PM
                            0 responses
                            10 views
                            0 likes
                            Last Post seqadmin  
                            Started by seqadmin, 03-22-2024, 10:03 AM
                            0 responses
                            51 views
                            0 likes
                            Last Post seqadmin  
                            Started by seqadmin, 03-21-2024, 07:32 AM
                            0 responses
                            68 views
                            0 likes
                            Last Post seqadmin  
                            Working...
                            X