Unconfigured Ad

Collapse
X
 
  • Filter
  • Time
  • Show
Clear All
new posts
  • laura
    Senior Member
    • Sep 2008
    • 151

    #16
    There are no precalculated AFs per sub population. You can calculate AN and AC numbers for each sub population and use that to work out an AF though

    Comment

    • dilly.desilva
      Junior Member
      • Jan 2011
      • 2

      #17
      Thank you Laura,

      Where would I get the AC and AN for the separate subpopulations of the 628 individuals from? Its not on the merged SNP set is it?

      Comment

      • andrehorta
        Member
        • Jan 2011
        • 14

        #18
        Hi.

        I'm newer whith genome. I need yours help. I was downloaded a sequence_read from 1000 genome project (ftp://ftp-trace.ncbi.nih.gov/1000gen.../data/HG00096/), and i sow two foldres, alignment and sequence_read. Wich this folders has a genome? And what's the diference about fastq, fasta, sra and ers? Wich this is genome?

        Comment

        • laura
          Senior Member
          • Sep 2008
          • 151

          #19
          The sequence_read dir contains the raw sequence reads that have been produced for a particular individual these are in fastq format.
          The alignment dir contains alignment files in bam format which aligns the raw reads to a reference genome (in this case GRCh37).

          There is more information about this data

          1000genomes.org is your first and best source for all of the information you’re looking for. From general topics to more of what you would expect to find here, 1000genomes.org has it all. We hope you find what you are searching for!


          thanks
          Last edited by laura; 01-13-2011, 07:59 AM.

          Comment

          • laura
            Senior Member
            • Sep 2008
            • 151

            #20
            Originally posted by dilly.desilva View Post
            Thank you Laura,

            Where would I get the AC and AN for the separate subpopulations of the 628 individuals from? Its not on the merged SNP set is it?

            I am afraid you will have to calculate that yourself. The population for each sample is described in ftp://ftp.1000genomes.ebi.ac.uk/vol1...0804.ALL.panel

            thanks

            Comment

            • andrehorta
              Member
              • Jan 2011
              • 14

              #21
              Hi Laura!

              Thank you very much with your attention! I need:

              1) Download one genome from 1000 genomes project
              2) I need use BRCA-DIAGNOSTIC or/and BOWTIE (i know how i use them, i follow the tutorial). I need to download other files to use BOWTIE?

              Obs: I have linux, perl and other, the BOWTIE and BRCA-DIAGNOSTIC is run and ok in my computer.

              Thank and sorry.

              Comment

              • andrehorta
                Member
                • Jan 2011
                • 14

                #22
                Originally posted by laura View Post
                The sequence_read dir contains the raw sequence reads that have been produced for a particular individual these are in fastq format.
                The alignment dir contains alignment files in bam format which aligns the raw reads to a reference genome (in this case GRCh37).

                There is more information about this data

                1000genomes.org is your first and best source for all of the information you’re looking for. From general topics to more of what you would expect to find here, 1000genomes.org has it all. We hope you find what you are searching for!


                thanks
                How can i use this data with BOWTIE and BRCA-DIAGNOSTIC? What they will produce?

                Comment

                • laura
                  Senior Member
                  • Sep 2008
                  • 151

                  #23
                  If you want to run alignments you need to download the data from the sequence read directory and align it to the genome.

                  I don't know how the program BRCA-Diagnotic works but it may be that you can just download the bam files from the alignment directory and work with those and then you don't need to run bowtie at all

                  I suspect you are likely to be more interested in the already discovered variants we released in November

                  ftp://ftp.1000genomes.ebi.ac.uk/vol1...lease/2010_11/

                  Comment

                  • andrehorta
                    Member
                    • Jan 2011
                    • 14

                    #24
                    Originally posted by laura View Post
                    If you want to run alignments you need to download the data from the sequence read directory and align it to the genome.

                    I don't know how the program BRCA-Diagnotic works but it may be that you can just download the bam files from the alignment directory and work with those and then you don't need to run bowtie at all

                    I suspect you are likely to be more interested in the already discovered variants we released in November

                    ftp://ftp.1000genomes.ebi.ac.uk/vol1...lease/2010_11/
                    OK, now i used SAMTOOLS to sort snp in HG00096.BAM and it's generated HG00096_snp.sorted.BAM. I need the HG000096.fna, e.g:

                    samtools pileup -cv -f genomes/NC_008253.fna ec_snp.sorted.bam

                    Comment

                    • laura
                      Senior Member
                      • Sep 2008
                      • 151

                      #25
                      Okay I think this is the point it might be a good idea for you to explain what your ultimate aim as I imagine we will be able to give you more help that way

                      In answer to your particular question. These genomes are aligned to the reference genome GRCh37 and you can find the copy we used here ftp://ftp.1000genomes.ebi.ac.uk/vol1...cal/reference/

                      Comment

                      • rstarke
                        Junior Member
                        • Feb 2011
                        • 7

                        #26
                        I'm having trouble finding information on how the high coverage exome data was generated for the 1K Genome main project. Not the targeted exon data that was part of the pilot phase, but the the full exome data that is partially available now. I want to be able to assess how good my alignments are, but need to know the exon capture method to find the intended target regions to do this. I could just use all RefSeq exons, or pick a specific exon capture kit's target list (like Nimblegen 2.1M), but it would be much better to have the real targets.

                        Comment

                        • laura
                          Senior Member
                          • Sep 2008
                          • 151

                          #27
                          The current target set for the 1000genomes exome sequencing can be found ftp://ftp.1000genomes.ebi.ac.uk/vol1...sus_exome_bed/

                          Comment

                          • rstarke
                            Junior Member
                            • Feb 2011
                            • 7

                            #28
                            Thanks! This is just what i needed.

                            Comment

                            Latest Articles

                            Collapse

                            • GATTACAT
                              Reply to Nine Things a Sample Prep Scientist Thinks About Before Sequencing
                              by GATTACAT
                              Love this - good data definitely starts from good input, and poor input can only give relatively poor data. I particularly like the mention of Nanodrop/absorbance based methods for quantification. It's such a toss up if you'll get an accurate reading or what amounts to a randomly generated number, and a lot of library/sequencing related issues can be traced back to poor quant.
                              07-01-2026, 11:43 AM
                            • SEQadmin2
                              Nine Things a Sample Prep Scientist Thinks About Before Sequencing
                              by SEQadmin2


                              I’m not a sequencing expert. I’m a purification scientist who uses NGS to evaluate workflows my group develops. With this perspective, we think about the sample first and the NGS workflow second. The sequencer is an exceptionally honest reporter, but it can only report on what you give it, so whether you get clean, interpretable data from an NGS workflow is largely determined before you begin.

                              Here are nine questions we think about, in roughly the order they matter, before...
                              06-18-2026, 07:11 AM

                            ad_right_rmr

                            Collapse

                            News

                            Collapse

                            Topics Statistics Last Post
                            Started by SEQadmin2, 07-02-2026, 11:08 AM
                            0 responses
                            7 views
                            0 reactions
                            Last Post SEQadmin2  
                            Started by SEQadmin2, 06-30-2026, 05:37 AM
                            0 responses
                            12 views
                            0 reactions
                            Last Post SEQadmin2  
                            Started by SEQadmin2, 06-26-2026, 11:10 AM
                            0 responses
                            20 views
                            0 reactions
                            Last Post SEQadmin2  
                            Started by SEQadmin2, 06-17-2026, 06:09 AM
                            0 responses
                            54 views
                            0 reactions
                            Last Post SEQadmin2  
                            Working...