Seqanswers Leaderboard Ad

Collapse

Announcement

Collapse
No announcement yet.
X
 
  • Filter
  • Time
  • Show
Clear All
new posts

  • Fungal genome de novo assembly and scaffolding

    We have data of yeast (Yarrowia)sequenced using Illumina paired-end. Each read R1 and R2 has ~41 million reads. Read length is 150 and insert size is 300. I have used velvet for de novo assembly. The highest value of N50: 22726 was obtained for Kmer 93. Total contigs are ~1900. Scaffolding was done using Contiguator. Gapfiller was used to fill the gaps. Closest Yarrowia homolog has a length of ~ 20MB. Using our data I got after gapfilling ~17MB. How to fill the gap of 3MB ? Eagerly awaiting your inputs

  • #2
    I suggest you try assembling with Spades; it often yields better continuity than Velvet.

    Comment


    • #3
      Why do you expect the genomes to have the exact same sizes? "Closest" sequenced homolog can still be far far away. I would say you go for more meaningful assembly quality assessment criteria.
      Alternatively, you could just randomly try assemblers until you find one that gives you ~20MB

      Comment

      Latest Articles

      Collapse

      • seqadmin
        Essential Discoveries and Tools in Epitranscriptomics
        by seqadmin


        The field of epigenetics has traditionally concentrated more on DNA and how changes like methylation and phosphorylation of histones impact gene expression and regulation. However, our increased understanding of RNA modifications and their importance in cellular processes has led to a rise in epitranscriptomics research. “Epitranscriptomics brings together the concepts of epigenetics and gene expression,” explained Adrien Leger, PhD, Principal Research Scientist on Modified Bases...
        Yesterday, 07:01 AM
      • seqadmin
        Current Approaches to Protein Sequencing
        by seqadmin


        Proteins are often described as the workhorses of the cell, and identifying their sequences is key to understanding their role in biological processes and disease. Currently, the most common technique used to determine protein sequences is mass spectrometry. While still a valuable tool, mass spectrometry faces several limitations and requires a highly experienced scientist familiar with the equipment to operate it. Additionally, other proteomic methods, like affinity assays, are constrained...
        04-04-2024, 04:25 PM

      ad_right_rmr

      Collapse

      News

      Collapse

      Topics Statistics Last Post
      Started by seqadmin, 04-11-2024, 12:08 PM
      0 responses
      39 views
      0 likes
      Last Post seqadmin  
      Started by seqadmin, 04-10-2024, 10:19 PM
      0 responses
      41 views
      0 likes
      Last Post seqadmin  
      Started by seqadmin, 04-10-2024, 09:21 AM
      0 responses
      35 views
      0 likes
      Last Post seqadmin  
      Started by seqadmin, 04-04-2024, 09:00 AM
      0 responses
      55 views
      0 likes
      Last Post seqadmin  
      Working...
      X