anyone's doing RNA-seq with paired-end reads now? how to use pairing information is a challenge. any good ideas?
Seqanswers Leaderboard Ad
Collapse
Announcement
Collapse
No announcement yet.
X
-
Yes. We've done paired end RNAseq - partly because of mixed sample type (some DNAseq etc) flow cells.
The paired end info is useful for novel gene expression and splice junction discovery.
For simple RPKM etc we just treated each read as single end and aligned to genome+splice junctions seperately - this is likely to be suboptimal although easy approach. I'd be very interested in people's comments...
david
-
Rna-pet
Paired-end ditags (PET) is an interesting strategy. It is very economical way of finding splice junction variants. Of course it doesn't offer the same resolution as full paired end RNA-Seq. I believe the team at GIS have migrated to Illumina and SOLiD in particular for RNA-PET now.
Comment
-
Hi all, We have had success with PE RNAseq (illumina).
Some of the main pitfalls that we found:
-when you make your cDNA, if you use random hexamers (best results for us), your first strand synthesis reaction will tend to produce sequences that start/end in GC rich sequences. We have found that these tend to be non-accurate annealling products, and you can improve your mapping efficiency by stripping the first/last few bases.
- TopHAT (see bowtie/ben langmeads posts) has also been very useful (and extremely fast for mapping!!
- Creating 'spliced' reference genomes is very useful for retaining the benefits of PE mapping, without having to introduce huge 'insert' sizes...
- Avoid PET tags - the extra ligation steps, if you are not careful, can introduce a high percentage of 'false' splice events... With the illumina PE strategy there is no need for this...
Has anyone had any luck with barcoding RNAseq samples?
Ieuan
Comment
-
Originally posted by ieuanclay View PostHi all, We have had success with PE RNAseq (illumina).
Some of the main pitfalls that we found:
-when you make your cDNA, if you use random hexamers (best results for us), your first strand synthesis reaction will tend to produce sequences that start/end in GC rich sequences. We have found that these tend to be non-accurate annealling products, and you can improve your mapping efficiency by stripping the first/last few bases.
- TopHAT (see bowtie/ben langmeads posts) has also been very useful (and extremely fast for mapping!!
- Creating 'spliced' reference genomes is very useful for retaining the benefits of PE mapping, without having to introduce huge 'insert' sizes...
- Avoid PET tags - the extra ligation steps, if you are not careful, can introduce a high percentage of 'false' splice events... With the illumina PE strategy there is no need for this...
Has anyone had any luck with barcoding RNAseq samples?
Ieuan
Hi Ieuan,
It is interesting to see PE RNAseq.
I still ahve some doubts regarding mapping PE reads. Because ELAND can support eland_rna analysis for SR(single read) only.
Any tool to map tags with splice juction or can you please explain about the mapping part of PE RNA seq tags?
Thanks.
Comment
-
Check the forums here for TopHat if you want to know about mapping RNAseq!
The actual way that you map is very dependent on the source of your RNA, so i would need to know more about your experiments! My main experience is with nascent RNA, so we PE map againstt a genomic reference. Spliced RNA can be mapped against spliced genomes (again see TopHat). Or a combination of both can be used, i.e. mapping against the genomic reference using a sensible PE insert limit, to find nascent RNA, then mapping the reads that don't map to the reference genome against a spliced genome. This way you can flag potentially spliced and unspliced transcripts.
There really are too many ways of proceeding, you just have to pick a processing pipeline that makes sense for your material/experiment and that you can justify when it comes to publishing.
Hope this helps!?
Comment
Latest Articles
Collapse
-
by seqadmin
Many organizations study rare diseases, but few have a mission as impactful as Rady Children’s Institute for Genomic Medicine (RCIGM). “We are all about changing outcomes for children,” explained Dr. Stephen Kingsmore, President and CEO of the group. The institute’s initial goal was to provide rapid diagnoses for critically ill children and shorten their diagnostic odyssey, a term used to describe the long and arduous process it takes patients to obtain an accurate...-
Channel: Articles
12-16-2024, 07:57 AM -
-
by seqadmin
Innovations in next-generation sequencing technologies and techniques are driving more precise and comprehensive exploration of complex biological systems. Current advancements include improved accessibility for long-read sequencing and significant progress in single-cell and 3D genomics. This article explores some of the most impactful developments in the field over the past year.
Long-Read Sequencing
Long-read sequencing has seen remarkable advancements,...-
Channel: Articles
12-02-2024, 01:49 PM -
ad_right_rmr
Collapse
News
Collapse
Topics | Statistics | Last Post | ||
---|---|---|---|---|
Started by seqadmin, 12-17-2024, 10:28 AM
|
0 responses
39 views
0 likes
|
Last Post
by seqadmin
12-17-2024, 10:28 AM
|
||
Started by seqadmin, 12-13-2024, 08:24 AM
|
0 responses
52 views
0 likes
|
Last Post
by seqadmin
12-13-2024, 08:24 AM
|
||
Started by seqadmin, 12-12-2024, 07:41 AM
|
0 responses
38 views
0 likes
|
Last Post
by seqadmin
12-12-2024, 07:41 AM
|
||
Started by seqadmin, 12-11-2024, 07:45 AM
|
0 responses
46 views
0 likes
|
Last Post
by seqadmin
12-11-2024, 07:45 AM
|
Comment