Seqanswers Leaderboard Ad

Collapse

Announcement

Collapse
No announcement yet.
X
 
  • Filter
  • Time
  • Show
Clear All
new posts

  • GATK - DepthOfCoverage

    Hi

    I try to use DepthOfCovareage. My command line is

    java -jar GenomeAnalysisTK.jar
    -T DepthOfCoverage
    -L 5:125921513-125923713
    -o test
    -I input.bam
    -R hg18.fa

    For the firsts lines I obtain these results :
    Locus Total_Depth Average_Depth_sample Depth_for_N/A
    5:125921513 253 253.00 253
    5:125921514 285 285.00 285
    5:125921515 292 292.00 292
    5:125921516 310 310.00 310
    5:125921517 319 319.00 319
    5:125921518 320 320.00 320
    (...)

    but when I'm looking for the reads aligned at the position 125921513 (for example ... first line of my output file from GATK) via this command line :

    samtools view input.bam | cut -f 3-4 | grep 125921513 | wc -l
    I obtain 7214 data (and all of them are within chromosome 5!)

    When I do the same position 125921514, I obtain 41 reads.

    I'd like to understand the difference between my interpretation of the bam file and the interpretation done by GATK. Which one is good? If it's GATK what about my command line output? What does GATK use?

    THanks a lot for your time

  • #2
    Are you sure nothing is wrong with your files prior to running DepthOfCoverage? I feel it's unlikely to have 253x coverage for one base and then have it jump to 285x coverage for the next base, although I don't know how the data was generated.

    Comment


    • #3
      Hi Heisman.
      It seems my command line was wrong. Someone told me that "
      "Your command is finding all records in the SAM file with a start position of 125921513. (The depth at that location is a combination of the reads that start there, and those that start prior to that location, yet span it).

      In addition, there may be a number of pieces of information about each record such that it does not count as 'depth' at a particular site. For instance, unmapped reads can still have a start location (from the SAM spec: "Bit 0x4 [of the flag field] is the only reliable place to tell whether the segment is unmapped. If 0x4 is set, no
      assumptions can be made about RNAME, POS, CIGAR, MAPQ..."). Similarly, the records could be marked as optical duplicates, or as having failed vendor quality checks; in both cases they should not be counted as a part of coverage. "
      so it make sense I think...
      Thanks for your time!

      Comment

      Latest Articles

      Collapse

      • seqadmin
        Current Approaches to Protein Sequencing
        by seqadmin


        Proteins are often described as the workhorses of the cell, and identifying their sequences is key to understanding their role in biological processes and disease. Currently, the most common technique used to determine protein sequences is mass spectrometry. While still a valuable tool, mass spectrometry faces several limitations and requires a highly experienced scientist familiar with the equipment to operate it. Additionally, other proteomic methods, like affinity assays, are constrained...
        04-04-2024, 04:25 PM
      • seqadmin
        Strategies for Sequencing Challenging Samples
        by seqadmin


        Despite advancements in sequencing platforms and related sample preparation technologies, certain sample types continue to present significant challenges that can compromise sequencing results. Pedro Echave, Senior Manager of the Global Business Segment at Revvity, explained that the success of a sequencing experiment ultimately depends on the amount and integrity of the nucleic acid template (RNA or DNA) obtained from a sample. “The better the quality of the nucleic acid isolated...
        03-22-2024, 06:39 AM

      ad_right_rmr

      Collapse

      News

      Collapse

      Topics Statistics Last Post
      Started by seqadmin, 04-11-2024, 12:08 PM
      0 responses
      18 views
      0 likes
      Last Post seqadmin  
      Started by seqadmin, 04-10-2024, 10:19 PM
      0 responses
      22 views
      0 likes
      Last Post seqadmin  
      Started by seqadmin, 04-10-2024, 09:21 AM
      0 responses
      17 views
      0 likes
      Last Post seqadmin  
      Started by seqadmin, 04-04-2024, 09:00 AM
      0 responses
      48 views
      0 likes
      Last Post seqadmin  
      Working...
      X