Seqanswers Leaderboard Ad

Collapse

Announcement

Collapse
No announcement yet.
X
 
  • Filter
  • Time
  • Show
Clear All
new posts

  • Velveth and velvetg.

    Hello Everyone,

    This is the first time I am using Velveth and velvetg.
    I have around 5 million single end read, which has 50-300bp long sequences.
    I used below cmd, and it work jusdt fine.
    # velveth auto 31,45,2 -fastq -short -inputfile
    output# it gave me 7 file with kmer length 31,33,35,36,39,41 &43.

    Can anybody please give me suggestion about which kmer length to select, do i need to use long or short read in command?

    How do I execute velvetg cmd?
    What cutoff and min_contig_length to use?

    Thanks in advance!

  • #2
    You need to run velvetg 7 different times to test all the different k-mer lengths

    minimum contig length for most things will be at least 100 possibly more depending on what the intended use of the data is. Generally really short contigs <100 are more trouble than they are worth for most projects, although they might be useful to suggest complexities like duplications or tandem repeats in the assembly.

    basically the command should be
    # velvetg {file name of the 7 folders created in velvet h}

    based on the number of reads it should take only a few minutes even on a desktop for each run.

    Generally it is hard to tell from the information given what kmer is best in advance, generally longer is better but not always. The best kmer is generally the one that puts the most bp in the fewest contigs in the assembly results. The program makes it easy to test multiple kmers.

    Coverage cutoff is really unknown depending on what your source data is, basically it depends on the expected depth of coverage. Probably not necessary to mess with it.

    As for short/long it is a command for velveth and you would have to re-run that with the -long option as -short is only for illumina reads generally <200. Short may work ok but it is unclear on a 300bp read.

    Comment


    • #3
      Originally posted by jebowers View Post
      You need to run velvetg 7 different times to test all the different k-mer lengths

      minimum contig length for most things will be at least 100 possibly more depending on what the intended use of the data is. Generally really short contigs <100 are more trouble than they are worth for most projects, although they might be useful to suggest complexities like duplications or tandem repeats in the assembly.

      basically the command should be
      # velvetg {file name of the 7 folders created in velvet h}

      based on the number of reads it should take only a few minutes even on a desktop for each run.

      Generally it is hard to tell from the information given what kmer is best in advance, generally longer is better but not always. The best kmer is generally the one that puts the most bp in the fewest contigs in the assembly results. The program makes it easy to test multiple kmers.

      Coverage cutoff is really unknown depending on what your source data is, basically it depends on the expected depth of coverage. Probably not necessary to mess with it.

      As for short/long it is a command for velveth and you would have to re-run that with the -long option as -short is only for illumina reads generally <200. Short may work ok but it is unclear on a 300bp read.
      Thanks Jebowers,
      In velvet manual it says it doesn't matter if you select long or short option.

      I really appreciate your feedback!

      Comment

      Latest Articles

      Collapse

      • seqadmin
        Techniques and Challenges in Conservation Genomics
        by seqadmin



        The field of conservation genomics centers on applying genomics technologies in support of conservation efforts and the preservation of biodiversity. This article features interviews with two researchers who showcase their innovative work and highlight the current state and future of conservation genomics.

        Avian Conservation
        Matthew DeSaix, a recent doctoral graduate from Kristen Ruegg’s lab at The University of Colorado, shared that most of his research...
        03-08-2024, 10:41 AM
      • seqadmin
        The Impact of AI in Genomic Medicine
        by seqadmin



        Artificial intelligence (AI) has evolved from a futuristic vision to a mainstream technology, highlighted by the introduction of tools like OpenAI's ChatGPT and Google's Gemini. In recent years, AI has become increasingly integrated into the field of genomics. This integration has enabled new scientific discoveries while simultaneously raising important ethical questions1. Interviews with two researchers at the center of this intersection provide insightful perspectives into...
        02-26-2024, 02:07 PM

      ad_right_rmr

      Collapse

      News

      Collapse

      Topics Statistics Last Post
      Started by seqadmin, 03-14-2024, 06:13 AM
      0 responses
      32 views
      0 likes
      Last Post seqadmin  
      Started by seqadmin, 03-08-2024, 08:03 AM
      0 responses
      71 views
      0 likes
      Last Post seqadmin  
      Started by seqadmin, 03-07-2024, 08:13 AM
      0 responses
      80 views
      0 likes
      Last Post seqadmin  
      Started by seqadmin, 03-06-2024, 09:51 AM
      0 responses
      68 views
      0 likes
      Last Post seqadmin  
      Working...
      X