Seqanswers Leaderboard Ad

**simonandrews** · 03-07-2013, 12:43 AM

The method I used for this was to create a test based on binomial stats (video tutorial of our method - the tool has actually been updated since this but works in the same way). Because strand specific libraries aren't completely clean I started by working out the global level of antisense transcription to get a measure of the level of antisense noise (assuming that true antisense transcription would be a small portion of all observed antisense reads).

For the test I then looked at the number of reads which mapped to a given region (I was using genes as my test regions). I then used the global antisense level to calculate an expected number of antisense reads given the total read count and then used a binomial test to see if the observed number of antisense reads was greater than this.

The test works pretty well, but there are some things you'll want to be aware of. The biggest problem is that in a surprisingly large number of cases we found predicted antisense transcription which occurred because the 3' UTRs of genes on opposite strands of the chromosome overlapped. This isn't an incorrect result as such but it might not be what you're looking for. We also found quite a few cases where we saw a very tightly packed column of antisense reads in a very small area. These could be small transcripts, but we suspect a lot of them will be mapping artefacts, so we might add in a filter to measure the physical extent of antisense transcription (proportion of the gene covered or something similar), rather than just the number of reads.

**Nicolas Nalpas** · 10-24-2013, 08:47 AM

Hi everyone,

I am also trying to identify antisense transcripts and quantify them.
My data are coming from cattle macrophages and have been prepared as paired-end strand-specific RNA-seq (using the ScriptSeq v2 kit). I have used STAR for the alignment and featureCounts for gene count summarisation (with option -s 1 to get sense gene counts and -s 2 to get antisense gene counts).

And based on my analyses, I obtain very high correlation between sense and antisense counts per gene (see picture), which is not really what I was expecting (hoping) to see. Can people share advise on this?
Can people share their experience on the pipeline they use to identify antisense transcripts (I'll probably give a try to SeqMonk), and also in mammals what is the expected amount of antisense transcription (about 10% of my reads are mapping to a gene but on the opposite strand).
Also my RNA-seq library preparation include ploy(A)+ purification, do people see antisense transcription after such step generally?

Thanks a lot, regards,
Nicolas

Attached Files

CPM_sense_antisense.1.png (39.6 KB, 71 views)

**puggie** · 10-24-2013, 10:20 AM

I think there is a technical factor to take into consideration. Although libraries are strand-specific (scriptseq), I believe there is some contamination to this, dna, wrong direction etc... At least from my data i get perhaps 2-3 % of reads mapping in the "wrong direction", some of this may be biological, some technical. Try to map to your genome and inspect the distribution of forward and reverse reads at genes. If gene is on negative strand and one gets 50/50 og forward to reverse reads, then it could be mapping problem also. One colleaque had this issue so take it as simple (trivial) advise.

**Nicolas Nalpas** · 10-25-2013, 02:21 AM

Thanks for your answer Puggie. I am aware that I may (must) have some contamination (even using strand-specific protocol), but since I do not have a spiked in in my samples it is impossible to determine the level of contamination.

The SeqMonk software (as explained by Simon Andrews) may be the way to go, in order to statistically remove such contamination to identify my putative true antisense. Actually does anyone know if SeqMonk software can identify antisense if provided with paired-end data?
Thanks

**simonandrews** · 10-25-2013, 02:32 AM

There is an antisense analysis pipeline in SeqMonk which looks at the global level of antisense (ie how unclean your strand specificity is) and then does a binomial test on each individual gene to see if the proportion of antisense within that gene is incompatible with the global level.

It seems to work pretty well but the results tend to be contaminated by biological artefacts, particularly extended 3' UTRs which run over the adjacent gene. There's a video tutorial of this which shows the basic process.

**Nicolas Nalpas** · 10-25-2013, 03:47 AM

Thanks for your answer Simon, I input my data into SeqMonk and it seems to work all right (I did follow the video instructions).
Also was I correct to input my BAM files (containing paired-end reads) without ticking the "split splice reads" options? My understanding from reading the manual is that selecting this option will make the software consider my reads as single-reads, which is not really what I want to look at antisense, am I right?
Also for the extended 3' UTR contamination, I was planning to just exclude those antisense too close from another gene 3' UTR, should work hopefully.
Thanks a million for the help,

**arcolombo698** · 12-12-2013, 02:44 PM

when do you use library type -firststrand or secondstrand? does this relate to the sense, or anti sense strand used in sequencing?

Topics	Statistics	Last Post
Expanding the Horizons of Cellular Research with the Single Cell Atlas by seqadmin Started by seqadmin, Yesterday, 11:49 AM	0 responses 13 views 0 likes	Last Post by seqadmin Yesterday, 11:49 AM
Genetic Variants and Diabetes Risk in Childhood Cancer Survivors by seqadmin Started by seqadmin, 04-24-2024, 08:47 AM	0 responses 16 views 0 likes	Last Post by seqadmin 04-24-2024, 08:47 AM
Cancer Metastasis: A Deep Dive into Cellular Plasticity by seqadmin Started by seqadmin, 04-11-2024, 12:08 PM	0 responses 61 views 0 likes	Last Post by seqadmin 04-11-2024, 12:08 PM
Proteogenomic Profiles Offer New Clues in Prostate Cancer by seqadmin Started by seqadmin, 04-10-2024, 10:19 PM	0 responses 60 views 0 likes	Last Post by seqadmin 04-10-2024, 10:19 PM

Seqanswers Leaderboard Ad

Announcement

RNA-seq and sense/antisense expression differences

Comment

Comment

Comment

Comment

Comment

Comment

Comment

Latest Articles

ad_right_rmr

News