SEQanswers

Go Back   SEQanswers > Bioinformatics > Bioinformatics



Similar Threads
Thread Thread Starter Forum Replies Last Post
strand-specific cDNA papori General 26 08-02-2017 11:30 AM
rna-seq experiment not strand-specific or coverage symmetry laupl Introductions 2 05-09-2012 08:19 PM
Strand-specific library appears not strand-specific oligo Illumina/Solexa 7 12-08-2011 10:54 AM
IGV reverse strand gff NicoBxl Bioinformatics 1 04-01-2011 03:16 AM
is rna-seq experiment strand-specific or not? laupl Introductions 2 10-14-2010 01:56 PM

Reply
 
Thread Tools
Old 08-08-2012, 02:32 PM   #1
panos_ed
Member
 
Location: Geneva, Switzerland

Join Date: May 2010
Posts: 11
Default How should a strand-specific experiment look like in IGV?

I have a paired-end, strand-specific dataset and used Trinity to assemble the contigs. I then mapped the reads back to the contigs to calculate transcript abundance and viewed the resulting bam file in IGV (don't have a reference genome).

Is there a way that I can tell whether my dataset is really strand-specific just by looking at the picture in IGV? If I had single-end sequences I should expect to have reads only in one strand (i.e. pointing to the right). Am I right?

But since I have paired-end reads, what the IGV picture should look like?
panos_ed is offline   Reply With Quote
Old 08-09-2012, 04:50 AM   #2
Jim Robinson
Member
 
Location: Boston, MA

Join Date: May 2009
Posts: 75
Default

Hi,

It depends on the library details, but you should could try "color by first-in-pair" from the popup menu. This will color by the strand of the template, rather than the read, at least for libraries I am familiar with. Also, igvtools has options to compute coverage by pair strand, although this isn't available in igv itself yet. Better strand support is on our radar.

Jim
Jim Robinson is offline   Reply With Quote
Old 08-09-2012, 07:30 AM   #3
panos_ed
Member
 
Location: Geneva, Switzerland

Join Date: May 2010
Posts: 11
Default

Thanks Jim!

I selected "first-of-pair" as you told me but can't find out what the colors mean (and couldn't find it either in IGV's manual).

Can you please explain what the colors (red, blue and grey) mean?
panos_ed is offline   Reply With Quote
Old 08-09-2012, 08:02 AM   #4
Jim Robinson
Member
 
Location: Boston, MA

Join Date: May 2009
Posts: 75
Default

red = neg strand (first read of the pair is no the neg strand)
blue = pos strand
grey = pair information not available, so "first of pair" is undefined
Jim Robinson is offline   Reply With Quote
Old 08-09-2012, 03:25 PM   #5
panos_ed
Member
 
Location: Geneva, Switzerland

Join Date: May 2010
Posts: 11
Default

Thanks for the info Jim!

So if most of the pairs in a contig contain reads that are like this:

Code:
=====>----------<=====
(red)           (blue)
then this means that sequencing was strand-specific.

Am I right?

What if both of them are red or blue?
panos_ed is offline   Reply With Quote
Old 08-09-2012, 11:46 PM   #6
Simon Anders
Senior Member
 
Location: Heidelberg, Germany

Join Date: Feb 2010
Posts: 994
Default

No, even in a non-strand-specific library, the two reads of a pair should point towards each other (or away from each other if it was a library-prep protocol with circularization). You should check whether it is predominantly the first read that point in the sense direction of the gene it aligns to.
Simon Anders is offline   Reply With Quote
Old 06-19-2013, 12:00 AM   #7
Dario1984
Senior Member
 
Location: Sydney, Australia

Join Date: Jun 2011
Posts: 166
Default

Quote:
Originally Posted by Jim Robinson View Post
red = neg strand (first read of the pair is no the neg strand)
blue = pos strand
grey = pair information not available, so "first of pair" is undefined
If I have Illumina directional RNA-seq data, should the colours be the opposite or the same as the strand of the RefSeq gene, when colouring by the first read of the pair ? My colours are opposite to the RefSeq gene, but when isoforms are assembled by Cufflinks with --library-type fr-secondstrand, the isoforms are, as expected, on the same strand as the gene.
Dario1984 is offline   Reply With Quote
Old 02-10-2014, 01:58 AM   #8
stanikay
Junior Member
 
Location: Pretoria

Join Date: Oct 2013
Posts: 4
Default

So is directional RNA-seq the same as strand-specific RNA-seq? any clarification will be greatly appreciated.

Thanx
stanikay is offline   Reply With Quote
Old 02-10-2014, 02:29 AM   #9
panos_ed
Member
 
Location: Geneva, Switzerland

Join Date: May 2010
Posts: 11
Default

According to this, directional and strand-specific are the same.
panos_ed is offline   Reply With Quote
Old 02-10-2014, 02:53 AM   #10
stanikay
Junior Member
 
Location: Pretoria

Join Date: Oct 2013
Posts: 4
Smile

Thanx panos-ed..
stanikay is offline   Reply With Quote
Old 09-14-2015, 07:49 AM   #11
ArghavanAri
Junior Member
 
Location: Sweden

Join Date: Apr 2014
Posts: 2
Default

Hi,

I am trying to align and count the reads of paired-end strand-specific RNA-seq data using Hisat and htseq-count.
This is how my IGV look like. https://www.dropbox.com/s/snpapn051b...ness.jpeg?dl=0
(Color alignment by "first-of-pair strand") (view as pairs).
I am confused now. Why all the read pairs are red for NFE2L2? and clur for HNRNPA3?
I know that my data is strand specific, and that this color is showing the first pair read. So why both pairs have the same color?
Also can anyone tell me what options I should use for Hisat (or Tophat2) and htseq-count?

Thanks,

Last edited by ArghavanAri; 09-14-2015 at 08:18 AM.
ArghavanAri is offline   Reply With Quote
Old 09-14-2015, 11:10 AM   #12
cliffbeall
Senior Member
 
Location: Ohio

Join Date: Jan 2010
Posts: 144
Default

The two genes are on different strands, you can see by the little chevrons in the annotation track at bottom, so that is expected.
cliffbeall is offline   Reply With Quote
Old 09-14-2015, 07:54 PM   #13
gringer
David Eccles (gringer)
 
Location: Wellington, New Zealand

Join Date: May 2011
Posts: 838
Default

As others have mentioned, you need to consider the direction of mapping as well as the read number. For a standard Illumina strand-specific sample preparation, the reverse read is the first read:
  • Reverse mapping, First read -- consistent with a gene model reading left to right on IGV
  • Reverse mapping, Second read -- consistent with a gene model reading right to left on IGV
  • Forward mapping, First read -- consistent with a gene model reading right to left on IGV
  • Forward mapping, second read -- consistent with a gene model reading left to right on IGV

Just in case you weren't aware, Tablet (since v1.14.10.20) has a concordant / directional colouring mode that colours sequences appropriately.

Last edited by gringer; 09-14-2015 at 08:31 PM.
gringer is offline   Reply With Quote
Old 09-17-2015, 05:43 AM   #14
ArghavanAri
Junior Member
 
Location: Sweden

Join Date: Apr 2014
Posts: 2
Default

Thanks for the information.
But I still cannot understand the difference between fr-firststrand & fr-secondstrand in Tophat2 (or RF & FR in Hisat).
Can anyone help me with that?
ArghavanAri is offline   Reply With Quote
Old 09-17-2015, 06:36 AM   #15
cliffbeall
Senior Member
 
Location: Ohio

Join Date: Jan 2010
Posts: 144
Default

Quote:
Originally Posted by ArghavanAri View Post
Thanks for the information.
But I still cannot understand the difference between fr-firststrand & fr-secondstrand in Tophat2 (or RF & FR in Hisat).
Can anyone help me with that?
I am fairly sure fr-firststrand means read one (aka forward read or fr) corresponds to the reference genome sequence (firststrand). fr-secondstrand means read one corresponds to the reverse complement (secondstrand) of the reference genome. FR and RF would be a similar concept - but they are just making a shorthand where if the read one or forward read corresponds to the reference strand it maps to the left of the reverse read on a linear representation of the genome (FR) and if it corresponds to the reverse complement of the reference it maps to the right of the reverse (RF).

I hope that makes sense -- you might need to go back to DNA/genome structure and how Illumina works to really understand the above.
cliffbeall is offline   Reply With Quote
Old 09-18-2015, 06:41 AM   #16
kmcarr
Senior Member
 
Location: USA, Midwest

Join Date: May 2008
Posts: 1,178
Default

Quote:
Originally Posted by cliffbeall View Post
I am fairly sure fr-firststrand means read one (aka forward read or fr) corresponds to the reference genome sequence (firststrand). fr-secondstrand means read one corresponds to the reverse complement (secondstrand) of the reference genome. FR and RF would be a similar concept - but they are just making a shorthand where if the read one or forward read corresponds to the reference strand it maps to the left of the reverse read on a linear representation of the genome (FR) and if it corresponds to the reverse complement of the reference it maps to the right of the reverse (RF).

I hope that makes sense -- you might need to go back to DNA/genome structure and how Illumina works to really understand the above.
I'm afraid that this is incorrect.

First 'fr' does not mean 'forward read'. What it means is the the two reads are arranged in the FR (Foward/Reverse) as opposed to the RF (Reverse/Forward) orientation. FR means that the read pairs are oriented with the 3' end point toward each other when aligned to the reference and RF means they are pointing away from each other.

Code:
FR:
-------------------------------------------------------------------------
   ----------------->                              <-------------------

RF:
-------------------------------------------------------------------------
   <-----------------                               ------------------->
Firststrand means that read #1 matches the first strand of cDNA generated during conversion of the single stranded mRNA to ds-cDNA. In other words the R1 is the reverse complement of the mRNA. Note that all of these reference landmarks are relative to transcript, not the genome. For Illumina strand specific, paired end mRNA-Seq reads the orientation of the reads relative to their original RNA is:

Code:
  mRNA 5'-----------------------------------------------------------AAAAAAAAAAAA
                   R2 ------------>                <------------ R1
kmcarr is offline   Reply With Quote
Old 02-09-2017, 10:14 AM   #17
sowmyai
Member
 
Location: America

Join Date: Jan 2010
Posts: 27
Default

Looks like Jim Robinson had it backwards for the color legends. Actually

red=positive,
blue=negative

He has clarified here:
https://groups.google.com/forum/#!to...lp/YiVzwnUuOZM
sowmyai is offline   Reply With Quote
Old 04-06-2017, 10:48 AM   #18
Camg
Member
 
Location: Vancouver

Join Date: Jan 2011
Posts: 21
Default

Quote:
Originally Posted by sowmyai View Post
Looks like Jim Robinson had it backwards for the color legends. Actually

red=positive,
blue=negative

He has clarified here:
https://groups.google.com/forum/#!to...lp/YiVzwnUuOZM
I always thought that blue=positive and red=negative. See screen shots of my data in IGV.

I used Illumina dUTP-type stranded library prep and fr-firststrand with Tophat2. I find several places that say dUTP libraries should use fr-firststrand with Tophat, for example: http://onetipperday.sterding.com/201...pe-to-use.html
https://www.biostars.org/p/64250/

Is anyone able to clear this up?

Thanks
Attached Images
File Type: png sense.antisense1.png (59.5 KB, 23 views)
File Type: png sense.antisense.png (60.0 KB, 20 views)
Camg is offline   Reply With Quote
Old 04-07-2017, 05:28 AM   #19
gringer
David Eccles (gringer)
 
Location: Wellington, New Zealand

Join Date: May 2011
Posts: 838
Default

Tablet has a "read concordance" mode for viewing strand direction on reads. Both pairs of reads that are consistent with a mapped direction following the reference sequence are coloured green (customisable), and reads that are consistent with the reverse complement of the reference sequence are coloured red.
gringer is offline   Reply With Quote
Reply

Tags
igv, strand-specific

Thread Tools

Posting Rules
You may not post new threads
You may not post replies
You may not post attachments
You may not edit your posts

BB code is On
Smilies are On
[IMG] code is On
HTML code is Off




All times are GMT -8. The time now is 03:21 AM.


Powered by vBulletin® Version 3.8.9
Copyright ©2000 - 2021, vBulletin Solutions, Inc.
Single Sign On provided by vBSSO