Seqanswers Leaderboard Ad

Collapse

Announcement

Collapse
No announcement yet.
X
 
  • Filter
  • Time
  • Show
Clear All
new posts

  • How should a strand-specific experiment look like in IGV?

    I have a paired-end, strand-specific dataset and used Trinity to assemble the contigs. I then mapped the reads back to the contigs to calculate transcript abundance and viewed the resulting bam file in IGV (don't have a reference genome).

    Is there a way that I can tell whether my dataset is really strand-specific just by looking at the picture in IGV? If I had single-end sequences I should expect to have reads only in one strand (i.e. pointing to the right). Am I right?

    But since I have paired-end reads, what the IGV picture should look like?

  • #2
    Hi,

    It depends on the library details, but you should could try "color by first-in-pair" from the popup menu. This will color by the strand of the template, rather than the read, at least for libraries I am familiar with. Also, igvtools has options to compute coverage by pair strand, although this isn't available in igv itself yet. Better strand support is on our radar.

    Jim

    Comment


    • #3
      Thanks Jim!

      I selected "first-of-pair" as you told me but can't find out what the colors mean (and couldn't find it either in IGV's manual).

      Can you please explain what the colors (red, blue and grey) mean?

      Comment


      • #4
        red = neg strand (first read of the pair is no the neg strand)
        blue = pos strand
        grey = pair information not available, so "first of pair" is undefined

        Comment


        • #5
          Thanks for the info Jim!

          So if most of the pairs in a contig contain reads that are like this:

          Code:
          =====>----------<=====
          (red)           (blue)
          then this means that sequencing was strand-specific.

          Am I right?

          What if both of them are red or blue?

          Comment


          • #6
            No, even in a non-strand-specific library, the two reads of a pair should point towards each other (or away from each other if it was a library-prep protocol with circularization). You should check whether it is predominantly the first read that point in the sense direction of the gene it aligns to.

            Comment


            • #7
              Originally posted by Jim Robinson View Post
              red = neg strand (first read of the pair is no the neg strand)
              blue = pos strand
              grey = pair information not available, so "first of pair" is undefined
              If I have Illumina directional RNA-seq data, should the colours be the opposite or the same as the strand of the RefSeq gene, when colouring by the first read of the pair ? My colours are opposite to the RefSeq gene, but when isoforms are assembled by Cufflinks with --library-type fr-secondstrand, the isoforms are, as expected, on the same strand as the gene.

              Comment


              • #8
                So is directional RNA-seq the same as strand-specific RNA-seq? any clarification will be greatly appreciated.

                Thanx

                Comment


                • #9
                  According to this, directional and strand-specific are the same.

                  Comment


                  • #10
                    Thanx panos-ed..

                    Comment


                    • #11
                      Hi,

                      I am trying to align and count the reads of paired-end strand-specific RNA-seq data using Hisat and htseq-count.
                      This is how my IGV look like. https://www.dropbox.com/s/snpapn051b...ness.jpeg?dl=0
                      (Color alignment by "first-of-pair strand") (view as pairs).
                      I am confused now. Why all the read pairs are red for NFE2L2? and clur for HNRNPA3?
                      I know that my data is strand specific, and that this color is showing the first pair read. So why both pairs have the same color?
                      Also can anyone tell me what options I should use for Hisat (or Tophat2) and htseq-count?

                      Thanks,
                      Last edited by ArghavanAri; 09-14-2015, 07:18 AM.

                      Comment


                      • #12
                        The two genes are on different strands, you can see by the little chevrons in the annotation track at bottom, so that is expected.

                        Comment


                        • #13
                          As others have mentioned, you need to consider the direction of mapping as well as the read number. For a standard Illumina strand-specific sample preparation, the reverse read is the first read:
                          • Reverse mapping, First read -- consistent with a gene model reading left to right on IGV
                          • Reverse mapping, Second read -- consistent with a gene model reading right to left on IGV
                          • Forward mapping, First read -- consistent with a gene model reading right to left on IGV
                          • Forward mapping, second read -- consistent with a gene model reading left to right on IGV


                          Just in case you weren't aware, Tablet (since v1.14.10.20) has a concordant / directional colouring mode that colours sequences appropriately.
                          Last edited by gringer; 09-14-2015, 07:31 PM.

                          Comment


                          • #14
                            Thanks for the information.
                            But I still cannot understand the difference between fr-firststrand & fr-secondstrand in Tophat2 (or RF & FR in Hisat).
                            Can anyone help me with that?

                            Comment


                            • #15
                              Originally posted by ArghavanAri View Post
                              Thanks for the information.
                              But I still cannot understand the difference between fr-firststrand & fr-secondstrand in Tophat2 (or RF & FR in Hisat).
                              Can anyone help me with that?
                              I am fairly sure fr-firststrand means read one (aka forward read or fr) corresponds to the reference genome sequence (firststrand). fr-secondstrand means read one corresponds to the reverse complement (secondstrand) of the reference genome. FR and RF would be a similar concept - but they are just making a shorthand where if the read one or forward read corresponds to the reference strand it maps to the left of the reverse read on a linear representation of the genome (FR) and if it corresponds to the reverse complement of the reference it maps to the right of the reverse (RF).

                              I hope that makes sense -- you might need to go back to DNA/genome structure and how Illumina works to really understand the above.

                              Comment

                              Latest Articles

                              Collapse

                              • seqadmin
                                Current Approaches to Protein Sequencing
                                by seqadmin


                                Proteins are often described as the workhorses of the cell, and identifying their sequences is key to understanding their role in biological processes and disease. Currently, the most common technique used to determine protein sequences is mass spectrometry. While still a valuable tool, mass spectrometry faces several limitations and requires a highly experienced scientist familiar with the equipment to operate it. Additionally, other proteomic methods, like affinity assays, are constrained...
                                04-04-2024, 04:25 PM
                              • seqadmin
                                Strategies for Sequencing Challenging Samples
                                by seqadmin


                                Despite advancements in sequencing platforms and related sample preparation technologies, certain sample types continue to present significant challenges that can compromise sequencing results. Pedro Echave, Senior Manager of the Global Business Segment at Revvity, explained that the success of a sequencing experiment ultimately depends on the amount and integrity of the nucleic acid template (RNA or DNA) obtained from a sample. “The better the quality of the nucleic acid isolated...
                                03-22-2024, 06:39 AM

                              ad_right_rmr

                              Collapse

                              News

                              Collapse

                              Topics Statistics Last Post
                              Started by seqadmin, 04-11-2024, 12:08 PM
                              0 responses
                              27 views
                              0 likes
                              Last Post seqadmin  
                              Started by seqadmin, 04-10-2024, 10:19 PM
                              0 responses
                              31 views
                              0 likes
                              Last Post seqadmin  
                              Started by seqadmin, 04-10-2024, 09:21 AM
                              0 responses
                              27 views
                              0 likes
                              Last Post seqadmin  
                              Started by seqadmin, 04-04-2024, 09:00 AM
                              0 responses
                              52 views
                              0 likes
                              Last Post seqadmin  
                              Working...
                              X