Seqanswers Leaderboard Ad

Collapse

Announcement

Collapse
No announcement yet.
X
 
  • Filter
  • Time
  • Show
Clear All
new posts

  • velvet

    hi After running velvet assembly I got Estimated Coverage = 25.12000
    Estimated Coverage cutoff = 10.250000
    Final graph has 11106 nodes and n50 of 498, max 13308, total 243547, using 221137/2500000 reads, so I have 3 questions here.

    1) what does this max 13308 ( It is the maximum contig length obtained)
    2)what does this total 243547 refers to?
    3) should I run the velvet again by using the obtained estimated cov cutoff and estimated coverege (for eg here should I run velvet again using cov_cutoff 10
    est_coverege 25)

    can any one answer this???

    thanks

  • #2
    1) Yes, it's the length of the longest contig you got.
    2) It's the sum of the length of all your contigs.
    3) You should probably try out the velvetOptimizer.pl script that is distributed along Velvet. Or you can join the Velvet mailing list and ask there.
    L. Collado Torres, Ph.D. student in Biostatistics.

    Comment


    • #3
      using 221137/2500000 reads
      that is a very low read usage - i would focus on figuring out why so few of your reads are making it into the assembly. Maybe they need to be trimmed further?
      --
      Jeremy Leipzig
      Bioinformatics Programmer
      --
      My blog
      Twitter

      Comment


      • #4
        hi zingster thnx for ur reply ...but my read length is 75 bp, do u think it should be trimmed further.

        Comment


        • #5
          Trim on quality, not length
          Joseph Fass's script does soft trimming - try 10, 15, and 20

          --
          Jeremy Leipzig
          Bioinformatics Programmer
          --
          My blog
          Twitter

          Comment

          Latest Articles

          Collapse

          • seqadmin
            Current Approaches to Protein Sequencing
            by seqadmin


            Proteins are often described as the workhorses of the cell, and identifying their sequences is key to understanding their role in biological processes and disease. Currently, the most common technique used to determine protein sequences is mass spectrometry. While still a valuable tool, mass spectrometry faces several limitations and requires a highly experienced scientist familiar with the equipment to operate it. Additionally, other proteomic methods, like affinity assays, are constrained...
            04-04-2024, 04:25 PM
          • seqadmin
            Strategies for Sequencing Challenging Samples
            by seqadmin


            Despite advancements in sequencing platforms and related sample preparation technologies, certain sample types continue to present significant challenges that can compromise sequencing results. Pedro Echave, Senior Manager of the Global Business Segment at Revvity, explained that the success of a sequencing experiment ultimately depends on the amount and integrity of the nucleic acid template (RNA or DNA) obtained from a sample. “The better the quality of the nucleic acid isolated...
            03-22-2024, 06:39 AM

          ad_right_rmr

          Collapse

          News

          Collapse

          Topics Statistics Last Post
          Started by seqadmin, 04-11-2024, 12:08 PM
          0 responses
          24 views
          0 likes
          Last Post seqadmin  
          Started by seqadmin, 04-10-2024, 10:19 PM
          0 responses
          25 views
          0 likes
          Last Post seqadmin  
          Started by seqadmin, 04-10-2024, 09:21 AM
          0 responses
          22 views
          0 likes
          Last Post seqadmin  
          Started by seqadmin, 04-04-2024, 09:00 AM
          0 responses
          52 views
          0 likes
          Last Post seqadmin  
          Working...
          X