![]() |
|
![]() |
||||
Thread | Thread Starter | Forum | Replies | Last Post |
Stranded BedGraph/BigWig for paired-end reads | id0 | Bioinformatics | 3 | 06-22-2017 02:13 PM |
mRNA fragmentation for 100bp paired-end reads | amdic2 | Illumina/Solexa | 7 | 09-01-2013 04:54 AM |
Mapping paired-end stranded RNA-seq data | hubin.keio | Bioinformatics | 3 | 03-28-2013 02:10 PM |
Can Cuffdiff treat paired-end and single-end reads at the same time? | zun | RNA Sequencing | 3 | 06-12-2012 05:37 PM |
![]() |
|
Thread Tools |
![]() |
#1 |
Member
Location: FL Join Date: Sep 2012
Posts: 13
|
![]()
I'm using the TruSeq Stranded Total RNA library preparation kit to generate libraries for 2x150 bp paired-end sequencing. Illumina's fragmentation protocol recommends 94°C for 8 minutes for intact, high quality RNA to generate insert lengths of 120-210 bp. If I am going to run a 2x150 bp sequencing run, I am concerned that 1) smaller fragments (< 150 bp) will preferentially bind and sequence, and 2) the read length may be overkill for this fragment population size and therefore be a waste of data. Illumina provides alternate fragmentation times for intact RNA, but I am unsure as to which is most efficient for a 2x150 bp (long RNA-seq) run. Can anyone provide feedback and/or experience with long RNA-seq?
![]() |
![]() |
![]() |
![]() |
#2 |
Jafar Jabbari
Location: Melbourne Join Date: Jan 2013
Posts: 1,235
|
![]()
Sequencing TruSeq RNA library with 2x150 will read into adapters. To maximise useful sequencing output you should use other kits such as NEBNext Ultra II RNAseq which allows preparing libraries with inserts up to 500 bp.
|
![]() |
![]() |
![]() |
#3 |
Member
Location: FL Join Date: Sep 2012
Posts: 13
|
![]()
Thanks, nucacidhunter, I'm also concerned with reading into the adapter or through the adapter into the flow cell and crashing the run; however, we have generated libraries using the "0 minute" fragmentation time (instead of a 94°C incubation, incubate at 65°C for 5 minutes, followed by a 4°C hold) with library traces ranging from 300-800 bp (~450 bp inserts) on a TapeStation -- no size selection was performed. My concern is accurate qPCR quantification and downstream cluster generation.
At this point, we will be using the TruSeq Stranded Total RNA kit, but I'm wondering if anyone has experience with altering fragmentation times using this kit to accommodate a 2x150 bp run on a NextSeq 500. Thanks. |
![]() |
![]() |
![]() |
#4 |
Jafar Jabbari
Location: Melbourne Join Date: Jan 2013
Posts: 1,235
|
![]()
If you can generate libraries in desired size with TruSeq (anecdotally some people have not been successful to obtain above 250 bp) then qPCR should not be an issue. You just need to take the average library size in 100-950 bp (the upper size depends on your qPCR setting for extension time) from BA or TapeStation to calculate molar concentration. This is common practice with TruSeq Nano DNA kits and Nextera that results in libraries with fragment sizes spanning above 1kb.
Edit:Reading into adapters will not crash the run but will give overall bad Q scores which should be fine after trimming adapters. Last edited by nucacidhunter; 09-19-2017 at 05:59 PM. |
![]() |
![]() |
![]() |
#5 |
Member
Location: FL Join Date: Sep 2012
Posts: 13
|
![]()
Thanks for the input. I'm using the KAPA library quantification kit for qPCR.
I will try altering fragmentation time to 2 min at 94°C and doing a double-sided SPRI size selection for 350-500 bp fragments. I am trying to decide on an appropriate Ampure XP bead ratio to use; would 0.55X/0.15X be sufficient? My thinking is to target insert sizes no less than 300 bp for a 2x150 bp run (Illumina recommends up to 550 bp inserts on the NextSeq). |
![]() |
![]() |
![]() |
#6 |
Jafar Jabbari
Location: Melbourne Join Date: Jan 2013
Posts: 1,235
|
![]()
If you have annealing and extension set for 45s then 950bp will be upper limit of amplification.
0.55x/0.15x should select your target range but the final library insert will depend on the size selection point (probably best after second strand synthesis). Keep in mind that size of fragmented RNA is not the only factor affecting the insert size and library insert size will be shorter than fragmented RNA length. |
![]() |
![]() |
![]() |
#7 |
Member
Location: FL Join Date: Sep 2012
Posts: 13
|
![]()
Yes, annealing/extension is set to 45 sec.
I was under the assumption that final library insert size was dependent on fragmentation time and post-ligation size selection. Perhaps that is my misunderstanding. I have never performed size selection after second strand synthesis. Ultimately, my overall goal is to make sure that I don’t waste valuable sequencing by decreasing the overlap between read 1 and read 2 for a 2 x 150 bp run. Based on Table 20. Library Insert Fragmentation Time on page 117 of the TruSeq Stranded Total RNA protocol, a 2 minute fragmentation time should generate an average library size of 410 bp or a 290 bp insert (410 bp – 120 bp to remove adapters = 290 bp insert size). Sequencing 150 bp in both directions would then allow for only a 10 bp overlap, increasing available template for sequencing and sensitivity to detect positional information for splice variant analysis or novel isoform discovery. Am I thinking about this correctly? What would you suggest? (Of course, the expected library size ranges are highly dependent on sample quality and accurate size selection, among other things.) |
![]() |
![]() |
![]() |
#8 |
Jafar Jabbari
Location: Melbourne Join Date: Jan 2013
Posts: 1,235
|
![]()
According to Illumina table the longest insert even without fragmentation is 200bp (130-350 range). To enrich for larger fragments you can clean up with low bead ratio (not double size selection) to remove smaller fragments (left size selection). This can be done after ligation or PCR (standard clean up is 0.9-1x bead). To cut off at 300 bp insert, ligated fragments shorter than 420 bp should be removed. This will result in lower yield and if the aim is transcript assembly or detecting isoform this is OK but would affect differential expression analysis.
For larger insert libraries a suitable kit that is able to generate larger cDNA fragments should be used. Edit: You can also do gel based size selection. Last edited by nucacidhunter; 09-20-2017 at 07:43 PM. |
![]() |
![]() |
![]() |
#9 |
Member
Location: FL Join Date: Sep 2012
Posts: 13
|
![]()
Given that the TruSeq kit is only capable of producing up to 200 bp inserts, and I intend to do a 2 x150 bp run, overlap is obviously unavoidable. I do not want to introduce bias in the data analysis by removing smaller fragments OR not fragmenting and possibly missing intact, mature mRNAs (>1 Kb). Unfortunately, I don't have the option of switching to a different kit to generate larger fragments. The PI's goal is to "get as much data as possible from a 2 x 150 bp run", and I'm trying to figure out the best protocol conditions for the TruSeq kit to optimize data output.
The 0 minute fragmentation (see image) seems to introduce some fragmentation, but, again, I'm concerned about size selection and sequencing a good representation of the biology. Perhaps I should just stick with some form of fragmentation to ensure coverage of the entire population in library conversion and perform a right size selection to remove any larger fragments (> 670 bp ligated fragments) that would not cluster efficiently on the NextSeq. What do you think? Thanks very much for your insight! |
![]() |
![]() |
![]() |
#10 |
Jafar Jabbari
Location: Melbourne Join Date: Jan 2013
Posts: 1,235
|
![]()
Library insert size is affected by RNA fragment size (user controllable) and random primers that can prime any region along RNA fragments and a fragment will be primed from multiple locations. That is the reason for relatively small inserts even without fragmentation. Some kits use less random primer so library insert size can be controlled by fragmentation time.
You do not need to remove larger fragments because they will cluster less efficiently and they do not affect sequencing results. |
![]() |
![]() |
![]() |
#11 |
Member
Location: FL Join Date: Sep 2012
Posts: 13
|
![]()
Should some form of fragmentation be performed then? Does no fragmentation introduce bias by not capturing the larger mRNAs in the sample?
|
![]() |
![]() |
![]() |
#12 |
Member
Location: Illinois Join Date: Oct 2014
Posts: 42
|
![]()
Take a look at Kapa's RNA Seq documentation (https://www.kapabiosystems.com/produ...mrna-seq-kits/). They have very nice alternate fragmentation conditions worked out. They use the same chemistry as Illumina to fragment the RNA, so now worries there. I'd consider using the 85 degree temperature, as opposed to 94. I also agree with nucacidhunter that a lower bead cleanup ratio (0.7X-0.8X) post-PCR will give you what you're looking for.
|
![]() |
![]() |
![]() |
#13 |
Member
Location: FL Join Date: Sep 2012
Posts: 13
|
![]()
Thanks, jteeee2. I will definitely try this method with a lower SPRI clean-up ratio, as suggested by nucacidhunter. KAPA has always been a reliable NGS resource.
|
![]() |
![]() |
![]() |
#14 | |
Jafar Jabbari
Location: Melbourne Join Date: Jan 2013
Posts: 1,235
|
![]() Quote:
To be sure you can try preparing two libraries from the same input RNA and compare the gene ranks and if correlation is high then you can process rest of your samples. If you are going to compare large insert libraries with standard ones, the possible variation caused by insert size need to be accounted and I would suggest to consult your bioinformatician for your experiment design. |
|
![]() |
![]() |
![]() |
#15 |
Member
Location: FL Join Date: Sep 2012
Posts: 13
|
![]()
Yes, I plan to prepare all samples using the same library prep methods.
Thanks very much for your feedback and helpful suggestions. |
![]() |
![]() |
![]() |
#16 |
Junior Member
Location: Nashville Join Date: Apr 2018
Posts: 2
|
![]()
I recommend to read article at NCBI on this topic and look at the research on ResearchGate for completeness of information. In addition, you can learn more about the processes associated with inorganic chemistry on this site and find answers to your questions. I also saw interesting information about Illumina fragmentation protocol in this thread.
Last edited by ebbert; 04-17-2018 at 01:12 AM. Reason: edit |
![]() |
![]() |
![]() |
Tags |
fragmentation, illumina, nextseq500, rna-seq, truseq |
Thread Tools | |
|
|