Unconfigured Ad

Collapse
X
 
  • Filter
  • Time
  • Show
Clear All
new posts
  • gumbos
    Junior Member
    • Feb 2011
    • 6

    Targeted Genome Assembly for region poorly represented in reference genome?

    Hello,

    I am not sure if this is the best place to put this, but any help is appreciated.

    I am working on identifying candidate genes for a mutation in a subtelomeric region of Zebrafish. The Zv9 Assembly is not great, and is particularly bad in this region. The region contains an improperly placed clone with WGS contigs surrounding it, which contain genes known to be deleted in one of the mutant alleles but are not causative. The reference in this region is so poor that my best bet is to attempt to re-build it. I am already in contact with the people at Sanger in regards to this, but I also have quite a bit of differential RNAseq data as well as WGS data of both a different mutant allele (that causes the same phenotype but was generated via ENU mutagenesis and so is likely a point mutation).

    Does anyone have any ideas for the best way to assemble this region in a targeted fashion? I have access to a 32GB ram fairly powerful computer, but this obviously is not enough to take the large amount of WGS data I have and assemble with velvet. Even then, I feel that the velvet assembly would only give me at best 5kb contigs that won't be much more effective than what is already available. I have considered trying to align sequences to the region as it is known with bowtie then assembling those reads only, but the reference is so poor that I don't think this will be effective either (multiple genes we know to be deleted are not represented in Zv9 in any fashion, or are only partially represented, or are represented split far apart with opposite strandedness).

    Thanks in advance for any assistance or advice.
  • krobison
    Senior Member
    • Nov 2007
    • 734

    #2
    Yuck! You certainly have an ugly situation.

    I think if I were in your shoes I would try as many different approaches as possible to identify paired end reads where at least one of the pairs can be mapped to the region (existing assembly, mapping to your RNA-Seq data that you think is in the region, etc), then try assembling that with Velvet or another assembler. Then use that assembly to identify additional paired end reads mapping in & repeat. Keep cycling until things don't seem to be getting better.

    Is there a publicly available BAC or cosmid library for zebrafish? I doubt you'll get anything like what you want without some long-range sequence information, and pulling out a big clone for the region could be a bunch of work but is one of the more obvious ways to go about it. Alternatively, have you tried making mate pair libraries for the whole genome?

    Comment

    Latest Articles

    Collapse

    • GATTACAT
      Reply to Nine Things a Sample Prep Scientist Thinks About Before Sequencing
      by GATTACAT
      Love this - good data definitely starts from good input, and poor input can only give relatively poor data. I particularly like the mention of Nanodrop/absorbance based methods for quantification. It's such a toss up if you'll get an accurate reading or what amounts to a randomly generated number, and a lot of library/sequencing related issues can be traced back to poor quant.
      Yesterday, 11:43 AM
    • SEQadmin2
      Nine Things a Sample Prep Scientist Thinks About Before Sequencing
      by SEQadmin2


      I’m not a sequencing expert. I’m a purification scientist who uses NGS to evaluate workflows my group develops. With this perspective, we think about the sample first and the NGS workflow second. The sequencer is an exceptionally honest reporter, but it can only report on what you give it, so whether you get clean, interpretable data from an NGS workflow is largely determined before you begin.

      Here are nine questions we think about, in roughly the order they matter, before...
      06-18-2026, 07:11 AM

    ad_right_rmr

    Collapse

    News

    Collapse

    Topics Statistics Last Post
    Started by SEQadmin2, Today, 11:08 AM
    0 responses
    6 views
    0 reactions
    Last Post SEQadmin2  
    Started by SEQadmin2, 06-30-2026, 05:37 AM
    0 responses
    11 views
    0 reactions
    Last Post SEQadmin2  
    Started by SEQadmin2, 06-26-2026, 11:10 AM
    0 responses
    19 views
    0 reactions
    Last Post SEQadmin2  
    Started by SEQadmin2, 06-17-2026, 06:09 AM
    0 responses
    53 views
    0 reactions
    Last Post SEQadmin2  
    Working...