Seqanswers Leaderboard Ad

Collapse

Announcement

Collapse
No announcement yet.
X
 
  • Filter
  • Time
  • Show
Clear All
new posts

  • More problems with Tophat (v1.3.2)

    I ran tophat version 1.3.2 to align my deep sequencing data to my reference genome via the following command

    tophat --solexa1.3-quals –p 6 –o /control/8905X1/tophat /rice_index/rice_index /8905X1/8905X1.txt

    It generated a file called accepted_hits.bam.
    I converted the file to a bed file using the bamToBed tool.
    Then split the file up based on chromosomes. Here are the results:
    331723028 May 11 09:26 accepted_hits_bed_Chr1
    119325730 May 11 09:26 accepted_hits_bed_Chr10
    123994839 May 11 09:26 accepted_hits_bed_Chr11
    121898474 May 11 09:27 accepted_hits_bed_Chr12
    416137943 May 11 09:27 accepted_hits_bed_Chr2
    334052893 May 11 09:27 accepted_hits_bed_Chr3
    161529836 May 11 09:27 accepted_hits_bed_Chr4
    347228298 May 11 09:28 accepted_hits_bed_Chr5
    189465060 May 11 09:28 accepted_hits_bed_Chr6
    200695493 May 11 09:28 accepted_hits_bed_Chr7
    171194241 May 11 09:28 accepted_hits_bed_Chr8
    759976461 May 11 09:29 accepted_hits_bed_Chr9
    2184791 May 11 09:29 accepted_hits_bed_ChrSy
    539606 May 11 09:29 accepted_hits_bed_ChrUn

    Tophat overloaded chromosome 9 which is one of the smaller chromosomes.

    Unless this can be resolved, I recommend not using Tophat.

  • #2
    To all,
    I discovered my problem with my RNA-seq data. It looks like there was some rRNA contamination in my sample which accounted for 5% of the total reads in the sample. The rRNA genes are located on chromosome 9. There are also some regions on chromosome 2 that show high homology to the rRNA genes on chromosome 9. This is the cause of the reads overloading chromosomes 9 and 2.

    The latest version of Tophat works fine.

    If other people are having problems with reads overloading a chromosome, check to see if there is rRNA contamination.
    Thanks all.

    Comment


    • #3
      With any RNA-seq data, you're almost always going to get rRNA contamination. It makes subsequent mapping quicker if you filter out the rRNA reads first (i.e. prior to any other mapping that is done).

      Comment

      Latest Articles

      Collapse

      • seqadmin
        Essential Discoveries and Tools in Epitranscriptomics
        by seqadmin


        The field of epigenetics has traditionally concentrated more on DNA and how changes like methylation and phosphorylation of histones impact gene expression and regulation. However, our increased understanding of RNA modifications and their importance in cellular processes has led to a rise in epitranscriptomics research. “Epitranscriptomics brings together the concepts of epigenetics and gene expression,” explained Adrien Leger, PhD, Principal Research Scientist on Modified Bases...
        Yesterday, 07:01 AM
      • seqadmin
        Current Approaches to Protein Sequencing
        by seqadmin


        Proteins are often described as the workhorses of the cell, and identifying their sequences is key to understanding their role in biological processes and disease. Currently, the most common technique used to determine protein sequences is mass spectrometry. While still a valuable tool, mass spectrometry faces several limitations and requires a highly experienced scientist familiar with the equipment to operate it. Additionally, other proteomic methods, like affinity assays, are constrained...
        04-04-2024, 04:25 PM

      ad_right_rmr

      Collapse

      News

      Collapse

      Topics Statistics Last Post
      Started by seqadmin, 04-11-2024, 12:08 PM
      0 responses
      41 views
      0 likes
      Last Post seqadmin  
      Started by seqadmin, 04-10-2024, 10:19 PM
      0 responses
      41 views
      0 likes
      Last Post seqadmin  
      Started by seqadmin, 04-10-2024, 09:21 AM
      0 responses
      38 views
      0 likes
      Last Post seqadmin  
      Started by seqadmin, 04-04-2024, 09:00 AM
      0 responses
      55 views
      0 likes
      Last Post seqadmin  
      Working...
      X