Seqanswers Leaderboard Ad

Collapse

Announcement

Collapse
No announcement yet.
X
 
  • Filter
  • Time
  • Show
Clear All
new posts

  • gatk python script

    Hi NGS users,
    anyone could say me where i can find the python script of GATK (ConvertTableToAnnotatorRod.py, GenerateTranscriptToInfo.py...)?

    Thanks a lot,
    ME

  • #2
    Yes I know where you can get, the problem is that it's not public yet

    Comment


    • #3
      thanks a lot.
      I have to generate rod file for depth coverage in all exon of all genes in a whole-exome bam file (according to http://www.broadinstitute.org/gsa/wi...It_Was_Created instruction).
      Any idea about fix this problem?

      Comment


      • #4
        So I guess we are trying to do the same analysis

        I think, you don't need rod files anymore, you should use VCF files instead

        To calculate the depth of coverage:



        You will need:

        -R /path/to/your/reference.fasta
        -I /path/to/your/bam_file.bam
        -o /path/to/your/output_file

        And the intervals ...

        -L /path/to/your/intervals.interval_list

        Comment


        • #5
          You are right, we are performing the same analysis.
          Ok, I'm already using the depth coverage v3.
          As interval list what do you use?
          I have a whole exome. Can I use the bed file?
          How can i select the specific gene for calculate the coverage in each single exon?

          Thanks a lot again!!!!
          :-)

          Comment


          • #6
            I'm still looking at an efficient way to do this too. I use samtools view and perl to create a table for each sample (.bam) with for each region (so in this case exon coordinates):

            - average coverage in region
            - percentage bases covered by at least 1
            - percentage bases covered by at least 5
            - etc

            But still searching for a way to store this efficiently while being able to compare it between any sample (whether whole-exome or any other capture design). If you do it for each exon, there's around 600.000 of them to calculate the stats for..

            Comment


            • #7
              Originally posted by m_elena_bioinfo View Post
              You are right, we are performing the same analysis.
              Ok, I'm already using the depth coverage v3.
              As interval list what do you use?
              I have a whole exome. Can I use the bed file?
              How can i select the specific gene for calculate the coverage in each single exon?

              Thanks a lot again!!!!
              :-)
              Yes sure you can use the .bed file. You should have a .bed file with all targeted regions (exons regions). Set in depthofcoverage the option -L exons.bed.
              If you want to have a specific coverage (of exons) for each gene you can provide an interval list with custom coordinates of the exons of your gene.

              Instead if you want only the coverage of all genes use the option -genelist.

              Comment


              • #8
                Originally posted by Seq84 View Post
                If you want to have a specific coverage (of exons) for each gene you can provide an interval list with custom coordinates of the exons of your gene.
                Hi,

                is it also possible to use the GRanges object from the R package GenomicRanges as an interval list of exons/gernes?

                Thanks

                A.

                Comment

                Latest Articles

                Collapse

                • seqadmin
                  Advancing Precision Medicine for Rare Diseases in Children
                  by seqadmin




                  Many organizations study rare diseases, but few have a mission as impactful as Rady Children’s Institute for Genomic Medicine (RCIGM). “We are all about changing outcomes for children,” explained Dr. Stephen Kingsmore, President and CEO of the group. The institute’s initial goal was to provide rapid diagnoses for critically ill children and shorten their diagnostic odyssey, a term used to describe the long and arduous process it takes patients to obtain an accurate...
                  12-16-2024, 07:57 AM
                • seqadmin
                  Recent Advances in Sequencing Technologies
                  by seqadmin



                  Innovations in next-generation sequencing technologies and techniques are driving more precise and comprehensive exploration of complex biological systems. Current advancements include improved accessibility for long-read sequencing and significant progress in single-cell and 3D genomics. This article explores some of the most impactful developments in the field over the past year.

                  Long-Read Sequencing
                  Long-read sequencing has seen remarkable advancements,...
                  12-02-2024, 01:49 PM

                ad_right_rmr

                Collapse

                News

                Collapse

                Topics Statistics Last Post
                Started by seqadmin, 12-17-2024, 10:28 AM
                0 responses
                26 views
                0 likes
                Last Post seqadmin  
                Started by seqadmin, 12-13-2024, 08:24 AM
                0 responses
                42 views
                0 likes
                Last Post seqadmin  
                Started by seqadmin, 12-12-2024, 07:41 AM
                0 responses
                28 views
                0 likes
                Last Post seqadmin  
                Started by seqadmin, 12-11-2024, 07:45 AM
                0 responses
                42 views
                0 likes
                Last Post seqadmin  
                Working...
                X