Seqanswers Leaderboard Ad

Collapse

Announcement

Collapse
No announcement yet.
X
 
  • Filter
  • Time
  • Show
Clear All
new posts

  • No coverage or very low coverage in the Complete Genomics data

    Hi,

    Has anyone worked on Complete Genomics data? Most of the exons in the data have no coverage or very low coverage (5-10 reads only) even though the reads are sequenced at 40X coverage. Can anyone explain why is it so?

    Thanks

    Regards

  • #2
    Hi! Is it really the exons only, that show reduced coverage and the rest of the genome shows 40x? Have you checked the mean of all exons or for one particular gene only? What kind of genome and data are you working with? In general I would say it's kind of normal to see differences in coverage over the genome, but I am not aware of a exon specific bias. However, if you look on a particular gene or even a gene-family, the coverage might be reduced because of homology regions and multimapped reads with low mapping quality that are filtered out. Maybe it is worth to have a look on the percentage of reads you are able to map against your reference. A high number of discarded sequences could be a hind of such an effect.

    Comment


    • #3
      I’m sorry, I didn’t explain the issue properly. I’m working on CG WGS data with avg read depth of 40x. The QC metrics looks good for all parameters and the alignment rate is 97.43%. However, I had used cgaTools to convert tsv files provided by CG to BAM. When I visualize these BAM files on IGV, I see minimal coverage at all exons and major parts of introns for all genes (See example image attached). The trend is normally scant coverage hills at junction of intron and exon. I believe something went wrong in my conversion step. Can anyone please suggest a solution for this?
      Or is there any other visualization tool specific for CG data that I should be using?”

      The IGV snapshots is attached herewith.

      Thanks in advance
      Attached Files
      Last edited by raman91; 01-09-2018, 01:48 AM.

      Comment


      • #4
        Hi! It could be an issue due to differing genome builds. Ich guess igv is using hg19 and reads are mapped to hg38.

        Comment


        • #5
          You should be able to Check this by zoom in to nucleotide Level in igv. If the reads Do Not Match the reference this would be a Hind. I am Not a 100% Sure but i believe igv only uses the coordinates and cigar of the bam and does Not care about matching nucleotides. So maybe it is just a slippage of coordinates between hg19 and 38.

          Comment


          • #6
            Thanks for your reply. I am pretty sure that the reads are mapped to hg19 build only. I checked the IGV browser too and the read sequence match to the reference sequence.

            Comment

            Latest Articles

            Collapse

            • seqadmin
              Strategies for Sequencing Challenging Samples
              by seqadmin


              Despite advancements in sequencing platforms and related sample preparation technologies, certain sample types continue to present significant challenges that can compromise sequencing results. Pedro Echave, Senior Manager of the Global Business Segment at Revvity, explained that the success of a sequencing experiment ultimately depends on the amount and integrity of the nucleic acid template (RNA or DNA) obtained from a sample. “The better the quality of the nucleic acid isolated...
              03-22-2024, 06:39 AM
            • seqadmin
              Techniques and Challenges in Conservation Genomics
              by seqadmin



              The field of conservation genomics centers on applying genomics technologies in support of conservation efforts and the preservation of biodiversity. This article features interviews with two researchers who showcase their innovative work and highlight the current state and future of conservation genomics.

              Avian Conservation
              Matthew DeSaix, a recent doctoral graduate from Kristen Ruegg’s lab at The University of Colorado, shared that most of his research...
              03-08-2024, 10:41 AM

            ad_right_rmr

            Collapse

            News

            Collapse

            Topics Statistics Last Post
            Started by seqadmin, Yesterday, 06:37 PM
            0 responses
            8 views
            0 likes
            Last Post seqadmin  
            Started by seqadmin, Yesterday, 06:07 PM
            0 responses
            8 views
            0 likes
            Last Post seqadmin  
            Started by seqadmin, 03-22-2024, 10:03 AM
            0 responses
            49 views
            0 likes
            Last Post seqadmin  
            Started by seqadmin, 03-21-2024, 07:32 AM
            0 responses
            66 views
            0 likes
            Last Post seqadmin  
            Working...
            X