Seqanswers Leaderboard Ad

Collapse

Announcement

Collapse
No announcement yet.
X
 
  • Filter
  • Time
  • Show
Clear All
new posts

  • gatk python script

    Hi NGS users,
    anyone could say me where i can find the python script of GATK (ConvertTableToAnnotatorRod.py, GenerateTranscriptToInfo.py...)?

    Thanks a lot,
    ME

  • #2
    Yes I know where you can get, the problem is that it's not public yet

    Comment


    • #3
      thanks a lot.
      I have to generate rod file for depth coverage in all exon of all genes in a whole-exome bam file (according to http://www.broadinstitute.org/gsa/wi...It_Was_Created instruction).
      Any idea about fix this problem?

      Comment


      • #4
        So I guess we are trying to do the same analysis

        I think, you don't need rod files anymore, you should use VCF files instead

        To calculate the depth of coverage:



        You will need:

        -R /path/to/your/reference.fasta
        -I /path/to/your/bam_file.bam
        -o /path/to/your/output_file

        And the intervals ...

        -L /path/to/your/intervals.interval_list

        Comment


        • #5
          You are right, we are performing the same analysis.
          Ok, I'm already using the depth coverage v3.
          As interval list what do you use?
          I have a whole exome. Can I use the bed file?
          How can i select the specific gene for calculate the coverage in each single exon?

          Thanks a lot again!!!!
          :-)

          Comment


          • #6
            I'm still looking at an efficient way to do this too. I use samtools view and perl to create a table for each sample (.bam) with for each region (so in this case exon coordinates):

            - average coverage in region
            - percentage bases covered by at least 1
            - percentage bases covered by at least 5
            - etc

            But still searching for a way to store this efficiently while being able to compare it between any sample (whether whole-exome or any other capture design). If you do it for each exon, there's around 600.000 of them to calculate the stats for..

            Comment


            • #7
              Originally posted by m_elena_bioinfo View Post
              You are right, we are performing the same analysis.
              Ok, I'm already using the depth coverage v3.
              As interval list what do you use?
              I have a whole exome. Can I use the bed file?
              How can i select the specific gene for calculate the coverage in each single exon?

              Thanks a lot again!!!!
              :-)
              Yes sure you can use the .bed file. You should have a .bed file with all targeted regions (exons regions). Set in depthofcoverage the option -L exons.bed.
              If you want to have a specific coverage (of exons) for each gene you can provide an interval list with custom coordinates of the exons of your gene.

              Instead if you want only the coverage of all genes use the option -genelist.

              Comment


              • #8
                Originally posted by Seq84 View Post
                If you want to have a specific coverage (of exons) for each gene you can provide an interval list with custom coordinates of the exons of your gene.
                Hi,

                is it also possible to use the GRanges object from the R package GenomicRanges as an interval list of exons/gernes?

                Thanks

                A.

                Comment

                Latest Articles

                Collapse

                • seqadmin
                  Strategies for Sequencing Challenging Samples
                  by seqadmin


                  Despite advancements in sequencing platforms and related sample preparation technologies, certain sample types continue to present significant challenges that can compromise sequencing results. Pedro Echave, Senior Manager of the Global Business Segment at Revvity, explained that the success of a sequencing experiment ultimately depends on the amount and integrity of the nucleic acid template (RNA or DNA) obtained from a sample. “The better the quality of the nucleic acid isolated...
                  03-22-2024, 06:39 AM
                • seqadmin
                  Techniques and Challenges in Conservation Genomics
                  by seqadmin



                  The field of conservation genomics centers on applying genomics technologies in support of conservation efforts and the preservation of biodiversity. This article features interviews with two researchers who showcase their innovative work and highlight the current state and future of conservation genomics.

                  Avian Conservation
                  Matthew DeSaix, a recent doctoral graduate from Kristen Ruegg’s lab at The University of Colorado, shared that most of his research...
                  03-08-2024, 10:41 AM

                ad_right_rmr

                Collapse

                News

                Collapse

                Topics Statistics Last Post
                Started by seqadmin, Yesterday, 06:37 PM
                0 responses
                10 views
                0 likes
                Last Post seqadmin  
                Started by seqadmin, Yesterday, 06:07 PM
                0 responses
                9 views
                0 likes
                Last Post seqadmin  
                Started by seqadmin, 03-22-2024, 10:03 AM
                0 responses
                51 views
                0 likes
                Last Post seqadmin  
                Started by seqadmin, 03-21-2024, 07:32 AM
                0 responses
                67 views
                0 likes
                Last Post seqadmin  
                Working...
                X