Unconfigured Ad

Collapse
X
 
  • Filter
  • Time
  • Show
Clear All
new posts
  • raman91
    Junior Member
    • May 2016
    • 6

    No coverage or very low coverage in the Complete Genomics data

    Hi,

    Has anyone worked on Complete Genomics data? Most of the exons in the data have no coverage or very low coverage (5-10 reads only) even though the reads are sequenced at 40X coverage. Can anyone explain why is it so?

    Thanks

    Regards
  • Gorgon_VZ
    Member
    • Oct 2016
    • 10

    #2
    Hi! Is it really the exons only, that show reduced coverage and the rest of the genome shows 40x? Have you checked the mean of all exons or for one particular gene only? What kind of genome and data are you working with? In general I would say it's kind of normal to see differences in coverage over the genome, but I am not aware of a exon specific bias. However, if you look on a particular gene or even a gene-family, the coverage might be reduced because of homology regions and multimapped reads with low mapping quality that are filtered out. Maybe it is worth to have a look on the percentage of reads you are able to map against your reference. A high number of discarded sequences could be a hind of such an effect.

    Comment

    • raman91
      Junior Member
      • May 2016
      • 6

      #3
      I’m sorry, I didn’t explain the issue properly. I’m working on CG WGS data with avg read depth of 40x. The QC metrics looks good for all parameters and the alignment rate is 97.43%. However, I had used cgaTools to convert tsv files provided by CG to BAM. When I visualize these BAM files on IGV, I see minimal coverage at all exons and major parts of introns for all genes (See example image attached). The trend is normally scant coverage hills at junction of intron and exon. I believe something went wrong in my conversion step. Can anyone please suggest a solution for this?
      Or is there any other visualization tool specific for CG data that I should be using?”

      The IGV snapshots is attached herewith.

      Thanks in advance
      Attached Files
      Last edited by raman91; 01-09-2018, 01:48 AM.

      Comment

      • Gorgon_VZ
        Member
        • Oct 2016
        • 10

        #4
        Hi! It could be an issue due to differing genome builds. Ich guess igv is using hg19 and reads are mapped to hg38.

        Comment

        • Gorgon_VZ
          Member
          • Oct 2016
          • 10

          #5
          You should be able to Check this by zoom in to nucleotide Level in igv. If the reads Do Not Match the reference this would be a Hind. I am Not a 100% Sure but i believe igv only uses the coordinates and cigar of the bam and does Not care about matching nucleotides. So maybe it is just a slippage of coordinates between hg19 and 38.

          Comment

          • raman91
            Junior Member
            • May 2016
            • 6

            #6
            Thanks for your reply. I am pretty sure that the reads are mapped to hg19 build only. I checked the IGV browser too and the read sequence match to the reference sequence.

            Comment

            Latest Articles

            Collapse

            • SEQadmin2
              Nine Things a Sample Prep Scientist Thinks About Before Sequencing
              by SEQadmin2


              I’m not a sequencing expert. I’m a purification scientist who uses NGS to evaluate workflows my group develops. With this perspective, we think about the sample first and the NGS workflow second. The sequencer is an exceptionally honest reporter, but it can only report on what you give it, so whether you get clean, interpretable data from an NGS workflow is largely determined before you begin.


              Here are nine questions we think about, in roughly the order they matter, before...
              06-18-2026, 07:11 AM
            • SEQadmin2
              From Collection to Sequencing: Why Sample Preparation and Preservation Define Sequencing Data
              by SEQadmin2


              Data variability is still an issue in sequencing technologies despite the advances in reproducibility and accuracy of these platforms. But the problem does not originate in the sequencing itself, but in the previous steps, before the sample reaches the sequencer.


              The first step is collection, followed by preservation and sample preparation for analysis. Most scientists overlook those steps, but not being careful might just be skewing the experiment’s results.
              ...
              06-02-2026, 10:05 AM

            ad_right_rmr

            Collapse

            News

            Collapse

            Topics Statistics Last Post
            Started by SEQadmin2, 06-17-2026, 06:09 AM
            0 responses
            26 views
            0 reactions
            Last Post SEQadmin2  
            Started by SEQadmin2, 06-09-2026, 11:58 AM
            0 responses
            43 views
            0 reactions
            Last Post SEQadmin2  
            Started by SEQadmin2, 06-05-2026, 10:09 AM
            0 responses
            48 views
            0 reactions
            Last Post SEQadmin2  
            Started by SEQadmin2, 06-04-2026, 08:59 AM
            0 responses
            49 views
            0 reactions
            Last Post SEQadmin2  
            Working...