Unconfigured Ad

Collapse
X
 
  • Filter
  • Time
  • Show
Clear All
new posts
  • yh253
    Member
    • Jul 2009
    • 16

    hg19 genome reference for short read mapping

    Dear all,

    I am wondering does anyone use only base genome sequences:chrs1:22, X, Y when does the reads alignment? By excluding chrM, chrUn... etc., it will surely have some effect on the alignment results, but would this be a serious problem?

    I actually excluded all these sequences apart from base sequences during my alignment with bowtie/bwa, because we thought we were not very interested in the binding on the mitochondrion sequences, and somehow we could get more alignable reads, but recently we found some of the most duplicated reads in our data have been perfectly mapped to mitochondrion sequences, but with one or two mismatches to the base sequences, which make us to think about whether it is necessary for us to include the chrM into our reference genome. It would be really appreciated if you could shed some light on it! Thank you very much in advance!

    Yuan
  • john_mu
    Member
    • May 2010
    • 88

    #2
    It is useful to include chrM so that you know which of your reads are multiply mapped. This could help you assess the reliability of the mapping. Besides, ChrM is quite small so it should not take long to do the mapping.
    SpliceMap: De novo detection of splice junctions from RNA-seq
    Download SpliceMap Comment here

    Comment

    • arcolombo698
      Senior Member
      • Nov 2013
      • 142

      #3
      Hello. I have the same issue here. I am not sure where to find the ChrM reference genome to add to my tophat alignment. I am looking here http://genome.ucsc.edu/cgi-bin/hgGateway

      Comment

      • arcolombo698
        Senior Member
        • Nov 2013
        • 142

        #4
        ChrM in the reference genome

        Originally posted by yh253 View Post
        Dear all,

        I actually excluded all these sequences apart from base sequences during my alignment with bowtie/bwa, because we thought we were not very interested in the binding on the mitochondrion sequences, and somehow we could get more alignable reads, but recently we found some of the most duplicated reads in our data have been perfectly mapped to mitochondrion sequences, but with one or two mismatches to the base sequences,

        ***********
        which make us to think about whether it is necessary for us to include the chrM into our reference genome. It would be really appreciated if you could shed some light on it! Thank you very much in advance!
        ***********
        Yuan
        Yes I am interested in the same issue. And what I have found with the hg19 reference is that it already contains the chrM.fa within the chromosome directory of the hg19. which means that when you run the alignment, if something should be aligned to chrM, it will.

        I think that you may not need to add the chrM reference if it is already in the chromosome directory of hg19 (assuming you are studying homo sapiens).

        ?? is this true?

        Comment

        • yueluo
          Member
          • Aug 2013
          • 82

          #5
          Originally posted by arcolombo698 View Post
          Yes I am interested in the same issue. And what I have found with the hg19 reference is that it already contains the chrM.fa within the chromosome directory of the hg19. which means that when you run the alignment, if something should be aligned to chrM, it will.

          I think that you may not need to add the chrM reference if it is already in the chromosome directory of hg19 (assuming you are studying homo sapiens).

          ?? is this true?
          I'm not sure about this, but a short ' grep ">" your-reference-seq.fa ' command should tell you what's in your reference.

          Comment

          Latest Articles

          Collapse

          • SEQadmin2
            Nine Things a Sample Prep Scientist Thinks About Before Sequencing
            by SEQadmin2


            I’m not a sequencing expert. I’m a purification scientist who uses NGS to evaluate workflows my group develops. With this perspective, we think about the sample first and the NGS workflow second. The sequencer is an exceptionally honest reporter, but it can only report on what you give it, so whether you get clean, interpretable data from an NGS workflow is largely determined before you begin.

            Here are nine questions we think about, in roughly the order they matter, before...
            06-18-2026, 07:11 AM
          • SEQadmin2
            From Collection to Sequencing: Why Sample Preparation and Preservation Define Sequencing Data
            by SEQadmin2


            Data variability is still an issue in sequencing technologies despite the advances in reproducibility and accuracy of these platforms. But the problem does not originate in the sequencing itself, but in the previous steps, before the sample reaches the sequencer.


            The first step is collection, followed by preservation and sample preparation for analysis. Most scientists overlook those steps, but not being careful might just be skewing the experiment’s results.
            ...
            06-02-2026, 10:05 AM

          ad_right_rmr

          Collapse

          News

          Collapse

          Topics Statistics Last Post
          Started by SEQadmin2, 06-26-2026, 11:10 AM
          0 responses
          16 views
          0 reactions
          Last Post SEQadmin2  
          Started by SEQadmin2, 06-17-2026, 06:09 AM
          0 responses
          49 views
          0 reactions
          Last Post SEQadmin2  
          Started by SEQadmin2, 06-09-2026, 11:58 AM
          0 responses
          108 views
          0 reactions
          Last Post SEQadmin2  
          Started by SEQadmin2, 06-05-2026, 10:09 AM
          0 responses
          125 views
          0 reactions
          Last Post SEQadmin2  
          Working...