Transcript length/frequncy plot
I've created a few different assemblies, and I'd like to make a dotplot of the number of transcripts at a certain length. I'm using R or python. Right now I've got to the point where my data looks like this (each individual transcript is listed):
Line number, Data set, Length
"42","b",1258
"43","b",517
"44","b",529
"45","b",593
"46","b",1075
"47","b",772
I want to count the number of times a certain length transcript occurs so I can plot length vs. frequency and show which assemblies are generating longer transcripts. I can do this with R once I have the frequencies, but I can't figure out how to do that. Ultimately, I'd like to turn that file into something like:
Length, Quantity
1258, 38
517, 79
Etc...any recommendations on how to transform this?
