Unconfigured Ad

Collapse
X
 
  • Filter
  • Time
  • Show
Clear All
new posts
  • cliff
    Member
    • Oct 2009
    • 41

    where to download hg19?

    Dear All

    I am wondering where to download hg19 reference files. I need to map my illumina reads to hg19 by using BWA.

    All your help will be appreciated.

    -C
  • thaley
    Junior Member
    • Jul 2010
    • 4

    #2


    I get my references from UCSC.

    Cheers

    Comment

    • cliff
      Member
      • Oct 2009
      • 41

      #3
      Thanks, Thaley, I just found that page two. Here is a question, how to use twoBitToFa to convert hg19.2bit to hg19.fa?

      I just tried

      ./twoBitToFa hg19.2bit hg19.fa

      but it said "Floating point exception"..

      Comment

      • thaley
        Junior Member
        • Jul 2010
        • 4

        #4
        Hmm.. You followed the directions on UCSC for the tool - build the source, etc?

        Honestly, I got my references in .fa format before they started using this 2bit format. Sorry I can't be more help.

        Off hand, I would double check the downloaded file to make sure it's not truncated and be sure the source for 2bit is building successfully.

        ...or if someone knows of an alternate location to get the .fa files, that would be the easiest.

        Comment

        • aleferna
          Senior Member
          • Sep 2009
          • 121

          #5
          Try this one its one file per chromo

          Comment

          • cliff
            Member
            • Oct 2009
            • 41

            #6
            Aleferna

            Thanks, I have that one too. I am thinking of trying

            cat chr*.fa > hg19.fa

            But I am just not sure whether this concatenated hg19.fa is different from the one converted from hg19.2bit...

            Comment

            • aleferna
              Senior Member
              • Sep 2009
              • 121

              #7
              it should be the same, but check if they have the M chromosome and the haploids, that, I'm not sure, you might have to separate those before doing the cat.

              Comment

              • mard
                Member
                • Jan 2010
                • 21

                #8
                I used the 1000 genomes hg19 reference sequence from:

                ftp://ftp.sanger.ac.uk/pub/1000genom...k_v37.fasta.gz

                They already have the haplotype chromosomes removed.

                Comment

                • cliff
                  Member
                  • Oct 2009
                  • 41

                  #9
                  mard:

                  Thanks for your response! Is this 1000 genome hg19 reference sequence different from that one from UCSC? All the files I have been using were downloaded from UCSC and I hope there won't be any discrepancy between those different versions of hg19.

                  Thanks

                  -C

                  Comment

                  • mfischer
                    Junior Member
                    • Mar 2010
                    • 9

                    #10
                    Hi cliff,

                    according to ftp://ftp.1000genomes.ebi.ac.uk/vol1...k_v37.fasta.gz the 1000 genomes hg19 reference was built as follows:

                    10th October 2009

                    Here are the steps used to produce this version of the human reference sequence to be used for the
                    main production project of the 1000 Genomes.

                    1. Download individual chrs from ensembl ftp

                    ftp://ftp.ensembl.org/pub/current_fa...o_sapiens/dna/

                    2. Download the newer version of the MT (NC_012920) from:



                    3. Create a reference with chrs1-22, X, Y, NC_012920 MT, and include the non-chromosomal supercontigs. The new single fasta is posted:

                    ftp://ftp.sanger.ac.uk/pub/1000genom...ect_reference/
                    UCSC states in http://genome.ucsc.edu/cgi-bin/hgGateway:
                    Note on chrM
                    Since the release of the UCSC hg19 assembly, the Homo sapiens mitochondrion sequence (represented as "chrM" in the Genome Browser) has been replaced in GenBank with the record NC_012920. We have not replaced the original sequence, NC_001807, in the hg19 Genome Browser. We plan to use the Revised Cambridge Reference Sequence (rCRS) in the next human assembly release.
                    Besides UCSC's older version of the mitochondrion sequence and in the included haploids, the 1000 genomes reference should be identical to UCSC.

                    Cheers

                    Comment

                    Latest Articles

                    Collapse

                    • SEQadmin2
                      Nine Things a Sample Prep Scientist Thinks About Before Sequencing
                      by SEQadmin2


                      I’m not a sequencing expert. I’m a purification scientist who uses NGS to evaluate workflows my group develops. With this perspective, we think about the sample first and the NGS workflow second. The sequencer is an exceptionally honest reporter, but it can only report on what you give it, so whether you get clean, interpretable data from an NGS workflow is largely determined before you begin.

                      Here are nine questions we think about, in roughly the order they matter, before...
                      06-18-2026, 07:11 AM
                    • SEQadmin2
                      From Collection to Sequencing: Why Sample Preparation and Preservation Define Sequencing Data
                      by SEQadmin2


                      Data variability is still an issue in sequencing technologies despite the advances in reproducibility and accuracy of these platforms. But the problem does not originate in the sequencing itself, but in the previous steps, before the sample reaches the sequencer.


                      The first step is collection, followed by preservation and sample preparation for analysis. Most scientists overlook those steps, but not being careful might just be skewing the experiment’s results.
                      ...
                      06-02-2026, 10:05 AM

                    ad_right_rmr

                    Collapse

                    News

                    Collapse

                    Topics Statistics Last Post
                    Started by SEQadmin2, Today, 05:37 AM
                    0 responses
                    5 views
                    0 reactions
                    Last Post SEQadmin2  
                    Started by SEQadmin2, 06-26-2026, 11:10 AM
                    0 responses
                    16 views
                    0 reactions
                    Last Post SEQadmin2  
                    Started by SEQadmin2, 06-17-2026, 06:09 AM
                    0 responses
                    49 views
                    0 reactions
                    Last Post SEQadmin2  
                    Started by SEQadmin2, 06-09-2026, 11:58 AM
                    0 responses
                    109 views
                    0 reactions
                    Last Post SEQadmin2  
                    Working...