Seqanswers Leaderboard Ad

**Magdoll** · 03-01-2016, 02:44 PM

I would like some clarification on what you mean by "not producing a RoI".

The Iso-Seq classify steps are:

--- using the CCS algorithm (which is generic and used for many things in addition by Iso-Seq) to generate RoI reads (in the future, they may be called CCS reads again, sorry for all the naming changes!)

--- look at the RoI reads to identify 5' and 3' cDNA primers on the ends. It then "classifies" those RoI reads into full-length (has both 5' and 3' primer and polyA tail), and non-full-length (missing at least one of the criteria).

When you say "no RoI", do you mean:
(a) there was no RoI/CCS read for that ZMW.
or
(b) it was not full-length

Also, are all the libraries the same size? What is the avg. transcript length in these libraries?

I'm not entirely sure how I would explain what you observe (since I've not seen this myself). I did a # of passes vs RoI full-length detection survey a while back and it's different from what you see and is closer to what I'd expect:

Build software better, together

https://github.com/PacificBiosciences/cDNA_primer/wiki/Comparison-of-Reads-Of-Insert-parameters-and-full-length-detection

GitHub is where people build software. More than 100 million people use GitHub to discover, fork, and contribute to over 420 million projects.

Also for reference, here is a tutorial on using classify. It explains the parameters in detail:

Build software better, together

https://github.com/PacificBiosciences/cDNA_primer/wiki/RS_IsoSeq-%28v2.3%29-Tutorial-%231.-Getting-full-length-reads

GitHub is where people build software. More than 100 million people use GitHub to discover, fork, and contribute to over 420 million projects.

And another wiki to explain what to expect from classify output:

Build software better, together

https://github.com/PacificBiosciences/cDNA_primer/wiki/RS_IsoSeq-%28v2.3%29-Interpreting-Classify-and-Cluster-Output

GitHub is where people build software. More than 100 million people use GitHub to discover, fork, and contribute to over 420 million projects.

**cklopp** · 03-03-2016, 12:34 AM

"no RoI" means (a) there was no RoI/CCS read for that ZMW.

I've simply compared the ZMW names in the initial subreads file with the names in the RoI file.

The libraries are of three sizes (1-2kb, 2-3kb, 3-6kb). The average lengths are respectively 2kb, 2.5kb and 3.2kb.

**ndelaney** · 03-03-2016, 11:44 AM

Your reads are likely being filtered out by one of the criteria used (and which can be set as options to the command).

If using CCS2, you should see a report such as ccs_report.csv that gives a break down of what reads were filtered and why. If using a more recent version of CCS1, after the program finishes running it will print a report that indicates the yield loss due to various filters. If you can report either of these results here I can give more guidance.

Topics	Statistics	Last Post
Expanding the Horizons of Cellular Research with the Single Cell Atlas by seqadmin Started by seqadmin, Yesterday, 11:49 AM	0 responses 13 views 0 likes	Last Post by seqadmin Yesterday, 11:49 AM
Genetic Variants and Diabetes Risk in Childhood Cancer Survivors by seqadmin Started by seqadmin, 04-24-2024, 08:47 AM	0 responses 16 views 0 likes	Last Post by seqadmin 04-24-2024, 08:47 AM
Cancer Metastasis: A Deep Dive into Cellular Plasticity by seqadmin Started by seqadmin, 04-11-2024, 12:08 PM	0 responses 61 views 0 likes	Last Post by seqadmin 04-11-2024, 12:08 PM
Proteogenomic Profiles Offer New Clues in Prostate Cancer by seqadmin Started by seqadmin, 04-10-2024, 10:19 PM	0 responses 60 views 0 likes	Last Post by seqadmin 04-10-2024, 10:19 PM

Seqanswers Leaderboard Ad

Announcement

Why doesn't pbtranscript.py classify call reads of inserts for films with 2 to 5 rds

Comment

Comment

Comment

Latest Articles

ad_right_rmr

News