Seqanswers Leaderboard Ad

Collapse

Announcement

Collapse
No announcement yet.
X
 
  • Filter
  • Time
  • Show
Clear All
new posts

  • hg19 genome reference for short read mapping

    Dear all,

    I am wondering does anyone use only base genome sequences:chrs1:22, X, Y when does the reads alignment? By excluding chrM, chrUn... etc., it will surely have some effect on the alignment results, but would this be a serious problem?

    I actually excluded all these sequences apart from base sequences during my alignment with bowtie/bwa, because we thought we were not very interested in the binding on the mitochondrion sequences, and somehow we could get more alignable reads, but recently we found some of the most duplicated reads in our data have been perfectly mapped to mitochondrion sequences, but with one or two mismatches to the base sequences, which make us to think about whether it is necessary for us to include the chrM into our reference genome. It would be really appreciated if you could shed some light on it! Thank you very much in advance!

    Yuan

  • #2
    It is useful to include chrM so that you know which of your reads are multiply mapped. This could help you assess the reliability of the mapping. Besides, ChrM is quite small so it should not take long to do the mapping.
    SpliceMap: De novo detection of splice junctions from RNA-seq
    Download SpliceMap Comment here

    Comment


    • #3
      Hello. I have the same issue here. I am not sure where to find the ChrM reference genome to add to my tophat alignment. I am looking here http://genome.ucsc.edu/cgi-bin/hgGateway

      Comment


      • #4
        ChrM in the reference genome

        Originally posted by yh253 View Post
        Dear all,

        I actually excluded all these sequences apart from base sequences during my alignment with bowtie/bwa, because we thought we were not very interested in the binding on the mitochondrion sequences, and somehow we could get more alignable reads, but recently we found some of the most duplicated reads in our data have been perfectly mapped to mitochondrion sequences, but with one or two mismatches to the base sequences,

        ***********
        which make us to think about whether it is necessary for us to include the chrM into our reference genome. It would be really appreciated if you could shed some light on it! Thank you very much in advance!
        ***********
        Yuan
        Yes I am interested in the same issue. And what I have found with the hg19 reference is that it already contains the chrM.fa within the chromosome directory of the hg19. which means that when you run the alignment, if something should be aligned to chrM, it will.

        I think that you may not need to add the chrM reference if it is already in the chromosome directory of hg19 (assuming you are studying homo sapiens).

        ?? is this true?

        Comment


        • #5
          Originally posted by arcolombo698 View Post
          Yes I am interested in the same issue. And what I have found with the hg19 reference is that it already contains the chrM.fa within the chromosome directory of the hg19. which means that when you run the alignment, if something should be aligned to chrM, it will.

          I think that you may not need to add the chrM reference if it is already in the chromosome directory of hg19 (assuming you are studying homo sapiens).

          ?? is this true?
          I'm not sure about this, but a short ' grep ">" your-reference-seq.fa ' command should tell you what's in your reference.

          Comment

          Latest Articles

          Collapse

          • seqadmin
            Essential Discoveries and Tools in Epitranscriptomics
            by seqadmin




            The field of epigenetics has traditionally concentrated more on DNA and how changes like methylation and phosphorylation of histones impact gene expression and regulation. However, our increased understanding of RNA modifications and their importance in cellular processes has led to a rise in epitranscriptomics research. “Epitranscriptomics brings together the concepts of epigenetics and gene expression,” explained Adrien Leger, PhD, Principal Research Scientist...
            04-22-2024, 07:01 AM
          • seqadmin
            Current Approaches to Protein Sequencing
            by seqadmin


            Proteins are often described as the workhorses of the cell, and identifying their sequences is key to understanding their role in biological processes and disease. Currently, the most common technique used to determine protein sequences is mass spectrometry. While still a valuable tool, mass spectrometry faces several limitations and requires a highly experienced scientist familiar with the equipment to operate it. Additionally, other proteomic methods, like affinity assays, are constrained...
            04-04-2024, 04:25 PM

          ad_right_rmr

          Collapse

          News

          Collapse

          Topics Statistics Last Post
          Started by seqadmin, Yesterday, 08:47 AM
          0 responses
          14 views
          0 likes
          Last Post seqadmin  
          Started by seqadmin, 04-11-2024, 12:08 PM
          0 responses
          60 views
          0 likes
          Last Post seqadmin  
          Started by seqadmin, 04-10-2024, 10:19 PM
          0 responses
          60 views
          0 likes
          Last Post seqadmin  
          Started by seqadmin, 04-10-2024, 09:21 AM
          0 responses
          54 views
          0 likes
          Last Post seqadmin  
          Working...
          X