Seqanswers Leaderboard Ad

Collapse

Announcement

Collapse
No announcement yet.
X
 
  • Filter
  • Time
  • Show
Clear All
new posts

  • human genome short read data for "Do it yourself genetic testing"

    Hi all!

    I would really like to try a certain program (http://www.cbcb.umd.edu/software/BRCA-diagnostic/), because in some ways it's similar to a project I'm working on. My problem is, that I just can't find the right data to try this program on. It needs the genome of an individual (so not the reference genome) in raw short read data format. I'm not really familiar with theese things, so I would really appreciate, if someone could tell me, where to find appropriate DNA sequences, that fit into this category.
    The program uses the bowtie short read alignment program, so with other words, I need short reads of a human genome, that can be aligned by bowtie, I guess.

    Yours,
    Attila
    Last edited by attilav; 08-18-2011, 04:13 AM.

  • #2
    You can find tons of datasets in the NCBI Short Read Archive and the European Nucleotide Archive.

    You can also find already aligned short read datasets (in BAM format) all over the place. The 1000 genome project is sequencing many at low coverage, so might not be good for you, but Watson's genome is available as well as many others (Venter's is available in long reads). Complete Genomics has made a large number of human genomes available on their website. Personal Genome Project should have files up as well.

    (aside: perhaps there should be a wiki section on repositories of human and other genome alignments)

    Comment


    • #3
      Unfortunately Bowtie is not optimized for Complete Genomics data. Specifically, Complete Genomics reads have sub-read gaps that Bowtie will interpret as mismatches. The high mismatch frequency will prevent Bowtie from successfully aligning many reads to the reference.

      If you are interested in working with SNP genotypes and Complete Genomics data, we strongly recommend using the Complete Genomics-developed snpdiff command in our open source CGA Tools package (http://cgatools.sourceforge.net/). This tool is specifically designed to extract SNP genotypes from Complete Genomics data, and to compare Complete Genomics genotype calls with SNP genotypes generated on other platforms.
      Shaun Cordes, PhD | Customer Support Scientist | Complete Genomics, Inc.
      Toll-free: (855) 267-5358 | Direct: (650) 943-2651
      [email protected]

      Comment

      Latest Articles

      Collapse

      • seqadmin
        Essential Discoveries and Tools in Epitranscriptomics
        by seqadmin




        The field of epigenetics has traditionally concentrated more on DNA and how changes like methylation and phosphorylation of histones impact gene expression and regulation. However, our increased understanding of RNA modifications and their importance in cellular processes has led to a rise in epitranscriptomics research. “Epitranscriptomics brings together the concepts of epigenetics and gene expression,” explained Adrien Leger, PhD, Principal Research Scientist...
        04-22-2024, 07:01 AM
      • seqadmin
        Current Approaches to Protein Sequencing
        by seqadmin


        Proteins are often described as the workhorses of the cell, and identifying their sequences is key to understanding their role in biological processes and disease. Currently, the most common technique used to determine protein sequences is mass spectrometry. While still a valuable tool, mass spectrometry faces several limitations and requires a highly experienced scientist familiar with the equipment to operate it. Additionally, other proteomic methods, like affinity assays, are constrained...
        04-04-2024, 04:25 PM

      ad_right_rmr

      Collapse

      News

      Collapse

      Topics Statistics Last Post
      Started by seqadmin, Today, 08:47 AM
      0 responses
      12 views
      0 likes
      Last Post seqadmin  
      Started by seqadmin, 04-11-2024, 12:08 PM
      0 responses
      60 views
      0 likes
      Last Post seqadmin  
      Started by seqadmin, 04-10-2024, 10:19 PM
      0 responses
      59 views
      0 likes
      Last Post seqadmin  
      Started by seqadmin, 04-10-2024, 09:21 AM
      0 responses
      54 views
      0 likes
      Last Post seqadmin  
      Working...
      X