Seqanswers Leaderboard Ad

Collapse

Announcement

Collapse
No announcement yet.
X
 
  • Filter
  • Time
  • Show
Clear All
new posts

  • de novo assembly vs. reference assembly

    Hi,

    I would like to know if someone has experience in comparing a local de novo assembly to a reference assembly and measure which one is the best.

    I have mapped genomic Illumina reads to a reference genome. Then, since I'm interested in a 1Mb region of one of the chromosomes, I used a de novo assembler to assemble the reads that mapped to that 1Mb region. So now I have about 6000 contigs ranging in size from 500bp to 30kb and I would like to:
    1- visualize their position in relation to the original 1Mb region
    2- Be able to say that the de novo local assembly is better (or worse) than just to map my reads to the reference assembly.

    Many thanks

  • #2
    1 E.g. Mauve Contig Mover http://gel.ahabs.wisc.edu/mauve/
    2 What is your definition of 'better' and 'worse'?

    Comment


    • #3
      This is pretty much what Complete Genomics does. They align to the reference and identify positions where they detect a variant, then do local de novo assembly over the variant. It does seem to increase specificity in particular (by excluding potential false positives that disappear after de novo assembly).

      That said, having compared myself, it does not appear to be worth the effort for the relatively long reads you'll get off an Illumina given the computational expense of assembly because it doesn't really seem to increase sensitivity that much.
      Mendelian Disorder: A blogshare of random useful information for general public consumption. [Blog]
      Breakway: A Program to Identify Structural Variations in Genomic Data [Website] [Forum Post]
      Projects: U87MG whole genome sequence [Website] [Paper]

      Comment


      • #4
        Thanks for the replies. I would look into the mauve tool.

        By being a 'better' local de novo assembly vs. reference assembly, I consider a region on the genome that has many SNPs, indels, etc. when mapped to a reference assembly. And so, it might be due to a hyper polymorphic region where the reference genome is very different from the sample DNA you are analyzing. In these circumstances I would choose a de novo local assembly.

        Now, the most important question is where do you define a threshold so as you consider a region with "many" variants? That is almost a rhetoric question I guess...

        Comment

        Latest Articles

        Collapse

        • seqadmin
          Essential Discoveries and Tools in Epitranscriptomics
          by seqadmin


          The field of epigenetics has traditionally concentrated more on DNA and how changes like methylation and phosphorylation of histones impact gene expression and regulation. However, our increased understanding of RNA modifications and their importance in cellular processes has led to a rise in epitranscriptomics research. “Epitranscriptomics brings together the concepts of epigenetics and gene expression,” explained Adrien Leger, PhD, Principal Research Scientist on Modified Bases...
          Yesterday, 07:01 AM
        • seqadmin
          Current Approaches to Protein Sequencing
          by seqadmin


          Proteins are often described as the workhorses of the cell, and identifying their sequences is key to understanding their role in biological processes and disease. Currently, the most common technique used to determine protein sequences is mass spectrometry. While still a valuable tool, mass spectrometry faces several limitations and requires a highly experienced scientist familiar with the equipment to operate it. Additionally, other proteomic methods, like affinity assays, are constrained...
          04-04-2024, 04:25 PM

        ad_right_rmr

        Collapse

        News

        Collapse

        Topics Statistics Last Post
        Started by seqadmin, 04-11-2024, 12:08 PM
        0 responses
        55 views
        0 likes
        Last Post seqadmin  
        Started by seqadmin, 04-10-2024, 10:19 PM
        0 responses
        51 views
        0 likes
        Last Post seqadmin  
        Started by seqadmin, 04-10-2024, 09:21 AM
        0 responses
        45 views
        0 likes
        Last Post seqadmin  
        Started by seqadmin, 04-04-2024, 09:00 AM
        0 responses
        55 views
        0 likes
        Last Post seqadmin  
        Working...
        X