Unconfigured Ad

Collapse
X
 
  • Filter
  • Time
  • Show
Clear All
new posts
  • sitapriyamoorthi
    Junior Member
    • Mar 2017
    • 4

    RNAseq alignment -misaligned on the genome!

    Hello ,

    My colleague did the mapping of publicly available human RNAseq data. However when I view this alignment on IGV the reads map next to the genes and not above it? Why would this be happening? Any suggestions?
  • GenoMax
    Senior Member
    • Feb 2008
    • 7142

    #2
    Can you clarify what do you mean by that? Perhaps attach a screenshot to explain.

    Comment

    • Bukowski
      Senior Member
      • Jan 2010
      • 388

      #3
      You're looking at the wrong genome build/annotation in IGV almost certainly.

      Comment

      • r.rosati
        Member
        • Aug 2015
        • 95

        #4
        I agree with Bukowski, if you're visualizing the data against hg19, try switching to GRCh38 - or vice-versa. Your collaborator aligned the data against a genome build that's different from the one you're using on IGV.

        Comment

        • wdecoster
          Member
          • Oct 2015
          • 97

          #5
          Classic mistake, but you'll make this one only once

          Comment

          • sitapriyamoorthi
            Junior Member
            • Mar 2017
            • 4

            #6
            Nope correct genome build

            Nope we are looking at it with the same genome build that he used to align same problem.

            Comment

            • r.rosati
              Member
              • Aug 2015
              • 95

              #7
              Curious... can you provide a screenshot?

              Comment

              • sitapriyamoorthi
                Junior Member
                • Mar 2017
                • 4

                #8
                Here are the screen shots

                So this is the screen shot for ACTB gene and as you can see the reads show up ~1kb downstrem
                Attached Files
                Last edited by sitapriyamoorthi; 04-01-2017, 10:45 AM.

                Comment

                • r.rosati
                  Member
                  • Aug 2015
                  • 95

                  #9
                  I have half an hypothesis. See if you agree.
                  By the (blurry) coordinates it seems like you're using hg19, is this correct? If so, then ACTB is within the window chr7:5,566,700-5,570,300. I'll use these numbers.

                  First thing, if your IGV is set to show mismatches in color, then you will agree with me that the build used for alignment and the build you're using on IGV don't match, because the nucleotides are all in color = mismatched. On the other side, if you've set IGV to show all nucleotides in color regardless of match, then disregard this comment.

                  You do see some reads about 1kb downstream, but you will agree with me that the exon structure totally doesn't match that of ACTB. You see a long exon, then three short ones, then one last long exon. This doesn't match ACTB.

                  However, 22kb downstream you have the FSCN1 gene, whose exon structure really seems to match the transcriptome reads. I'm not affirming those reads totally absolutely belong to FSCN1, but...

                  Click image for larger version

Name:	Untitled.jpg
Views:	1
Size:	77.6 KB
ID:	305246

                  If this is the case, and you say that you see these reads 1kb downstream to ACTB, then maybe your reads are actually shifted 60kb upstream.
                  And if so, then you should see transcriptome reads matching the ACTB exon structure within about the region: chr7:5,500,000-5,518,000.

                  The hole in this hypothesis is: I don't know which genome build would have ACTB shifted 60kb upstream compared to hg19. It's not GRCh38 nor hg18. Could it be an earlier version of hg19?

                  In the end, I would talk to your collaborator and be 100% sure about the genome build he's used to align the data. Although the mapping doesn't match hg19 the data is likely not wrong, but it's best to realign it to GRCh38 or hg19.
                  Last edited by r.rosati; 04-04-2017, 01:19 PM.

                  Comment

                  • r.rosati
                    Member
                    • Aug 2015
                    • 95

                    #10
                    ...Any news?

                    Comment

                    • sitapriyamoorthi
                      Junior Member
                      • Mar 2017
                      • 4

                      #11
                      Dear All,
                      I thought I had replied to this thread but I hadn't. R.Rosati, thank you for you painstaking response. I asked my collaborator again to check the genome build he had used. Being a yeast person he did not realize how different genome annotations were version to version, infact he had used GRCh38 and not hg 19. Problem resolved! Thank you all for your prompt responses it helped me query the problem with my collaborator more confidently!

                      Comment

                      Latest Articles

                      Collapse

                      • SEQadmin2
                        Nine Things a Sample Prep Scientist Thinks About Before Sequencing
                        by SEQadmin2


                        I’m not a sequencing expert. I’m a purification scientist who uses NGS to evaluate workflows my group develops. With this perspective, we think about the sample first and the NGS workflow second. The sequencer is an exceptionally honest reporter, but it can only report on what you give it, so whether you get clean, interpretable data from an NGS workflow is largely determined before you begin.

                        Here are nine questions we think about, in roughly the order they matter, before...
                        06-18-2026, 07:11 AM
                      • SEQadmin2
                        From Collection to Sequencing: Why Sample Preparation and Preservation Define Sequencing Data
                        by SEQadmin2


                        Data variability is still an issue in sequencing technologies despite the advances in reproducibility and accuracy of these platforms. But the problem does not originate in the sequencing itself, but in the previous steps, before the sample reaches the sequencer.


                        The first step is collection, followed by preservation and sample preparation for analysis. Most scientists overlook those steps, but not being careful might just be skewing the experiment’s results.
                        ...
                        06-02-2026, 10:05 AM

                      ad_right_rmr

                      Collapse

                      News

                      Collapse

                      Topics Statistics Last Post
                      Started by SEQadmin2, Today, 05:37 AM
                      0 responses
                      5 views
                      0 reactions
                      Last Post SEQadmin2  
                      Started by SEQadmin2, 06-26-2026, 11:10 AM
                      0 responses
                      16 views
                      0 reactions
                      Last Post SEQadmin2  
                      Started by SEQadmin2, 06-17-2026, 06:09 AM
                      0 responses
                      49 views
                      0 reactions
                      Last Post SEQadmin2  
                      Started by SEQadmin2, 06-09-2026, 11:58 AM
                      0 responses
                      109 views
                      0 reactions
                      Last Post SEQadmin2  
                      Working...