Seqanswers Leaderboard Ad

Collapse

Announcement

Collapse
No announcement yet.
X
 
  • Filter
  • Time
  • Show
Clear All
new posts

  • velvet assemblies

    Dear all,
    I am doing an assembly of 40 Mb genome with expected coverage of 181x. I am using Illumina reads 76bp length with insert size 200 bp (Sd 20 bp). I have tried velvet for these assemblies and 86-99% of reads were used in this assembly with N50 of 80kb (with k-mer's 21,55,2). But the strange thing is that I am getting only 19 Mb genome after all assemblies. According to the staff whole genome has been covered during the library preparations.

    And secondly, should I turn off the velvet scaffolding before minimus2 assemblies/scaffolding??

    I would appreciate your suggestions.

    Thanks in advance
    Rahul
    Last edited by rahularjun86; 01-31-2012, 02:55 AM.
    Rahul Sharma,
    Ph.D
    Frankfurt am Main, Germany

  • #2
    Assuming the genome is truly 40Mb, the most likely explanation is the presence of repeats.

    Try doing a plot of contig length versus k-mer coverage which will help you visualise the repeat structure of your genome (use the values in stats.txt produced by Velvet).

    Comment


    • #3
      Also is your genome size calculation based on a haploid or diploid genome?

      Comment


      • #4
        Dear Nickloman, thank you so much for your reply. I am plotting the contig length vs k-mer coverage. It is haploid genome.
        Thanks,
        Rahul
        Rahul Sharma,
        Ph.D
        Frankfurt am Main, Germany

        Comment

        Latest Articles

        Collapse

        • seqadmin
          Essential Discoveries and Tools in Epitranscriptomics
          by seqadmin


          The field of epigenetics has traditionally concentrated more on DNA and how changes like methylation and phosphorylation of histones impact gene expression and regulation. However, our increased understanding of RNA modifications and their importance in cellular processes has led to a rise in epitranscriptomics research. “Epitranscriptomics brings together the concepts of epigenetics and gene expression,” explained Adrien Leger, PhD, Principal Research Scientist on Modified Bases...
          Yesterday, 07:01 AM
        • seqadmin
          Current Approaches to Protein Sequencing
          by seqadmin


          Proteins are often described as the workhorses of the cell, and identifying their sequences is key to understanding their role in biological processes and disease. Currently, the most common technique used to determine protein sequences is mass spectrometry. While still a valuable tool, mass spectrometry faces several limitations and requires a highly experienced scientist familiar with the equipment to operate it. Additionally, other proteomic methods, like affinity assays, are constrained...
          04-04-2024, 04:25 PM

        ad_right_rmr

        Collapse

        News

        Collapse

        Topics Statistics Last Post
        Started by seqadmin, 04-11-2024, 12:08 PM
        0 responses
        39 views
        0 likes
        Last Post seqadmin  
        Started by seqadmin, 04-10-2024, 10:19 PM
        0 responses
        41 views
        0 likes
        Last Post seqadmin  
        Started by seqadmin, 04-10-2024, 09:21 AM
        0 responses
        35 views
        0 likes
        Last Post seqadmin  
        Started by seqadmin, 04-04-2024, 09:00 AM
        0 responses
        55 views
        0 likes
        Last Post seqadmin  
        Working...
        X