I have 454 assembled contigs, Isotigs and singletons. The core facility that we used has done this for us, using Newbler. Do I still need to get rid of any adapter sequences or can I use these data directly for other analyses? In addition, how can I combine Isotigs, contigs and singltons?
Seqanswers Leaderboard Ad
Collapse
Announcement
Collapse
No announcement yet.
X
-
-
The Singletons will still have the adaptor sequences. The contigs/isotigs mostly should not ( if at the time of running Newbler, a database of the adaptor sequences was specified ).
You can combine the singletons, contigs and the largest isotig from each isogroup and run it through one more assembly software ( Reference assembly if you have a reference genome ).
-
Originally posted by Khanjan View PostThe Singletons will still have the adaptor sequences.
We at Purdue Genomics do provide a Singleton.tfa file to our customers. The reads in this file have been trimmed using the data in 454TrimStatus.txt
Comment
-
Originally posted by westerman View PostI am not sure that follows. Singletons, per se, are not part of the Newbler output. The files 454Isotigs.fna and 454AllContigs.fna, sure, they exist. And the 454ReadStatus.tfa file tells where the reads went to. But there is no, as far as I know, Newbler generated singleton file.
We at Purdue Genomics do provide a Singleton.tfa file to our customers. The reads in this file have been trimmed using the data in 454TrimStatus.txt
ids of those which are singletons and then extract those singletons from the original sff files ( using sfffile/sffinfo )
Since the Singletons were not included in the Assembly, they wont be trimmed and will contain the adaptor.
Comment
-
Originally posted by Khanjan View PostNewbler does not generate the a file containing the singletons as it does for Isotigs and Contigs.
Maybe the person who generated it just took reads marked 'singleton' from the 454ReadStatus file.
Or maybe the person who generated the file took the reads marked 'singleton' plus the information from the 454TrimStatus file in order to create a singleton file that is trimmed.
sfffile has the '-t' option (File containing accno/trim line information) so it is quite easy to do the trimming. We do this routinely.
But all in all you can not assume that the singleton is untrimmed. Nor trimmed.
Comment
-
Originally posted by westerman View PostThat was my point. You can not say where the singleton file came from.
Maybe the person who generated it just took reads marked 'singleton' from the 454ReadStatus file.
Or maybe the person who generated the file took the reads marked 'singleton' plus the information from the 454TrimStatus file in order to create a singleton file that is trimmed.
sfffile has the '-t' option (File containing accno/trim line information) so it is quite easy to do the trimming. We do this routinely.
But all in all you can not assume that the singleton is untrimmed. Nor trimmed.
My sincere apologies
Comment
Latest Articles
Collapse
-
by seqadmin
The field of epigenetics has traditionally concentrated more on DNA and how changes like methylation and phosphorylation of histones impact gene expression and regulation. However, our increased understanding of RNA modifications and their importance in cellular processes has led to a rise in epitranscriptomics research. “Epitranscriptomics brings together the concepts of epigenetics and gene expression,” explained Adrien Leger, PhD, Principal Research Scientist...-
Channel: Articles
04-22-2024, 07:01 AM -
-
by seqadmin
Proteins are often described as the workhorses of the cell, and identifying their sequences is key to understanding their role in biological processes and disease. Currently, the most common technique used to determine protein sequences is mass spectrometry. While still a valuable tool, mass spectrometry faces several limitations and requires a highly experienced scientist familiar with the equipment to operate it. Additionally, other proteomic methods, like affinity assays, are constrained...-
Channel: Articles
04-04-2024, 04:25 PM -
ad_right_rmr
Collapse
News
Collapse
Topics | Statistics | Last Post | ||
---|---|---|---|---|
Started by seqadmin, Today, 08:47 AM
|
0 responses
11 views
0 likes
|
Last Post
by seqadmin
Today, 08:47 AM
|
||
Started by seqadmin, 04-11-2024, 12:08 PM
|
0 responses
60 views
0 likes
|
Last Post
by seqadmin
04-11-2024, 12:08 PM
|
||
Started by seqadmin, 04-10-2024, 10:19 PM
|
0 responses
59 views
0 likes
|
Last Post
by seqadmin
04-10-2024, 10:19 PM
|
||
Started by seqadmin, 04-10-2024, 09:21 AM
|
0 responses
54 views
0 likes
|
Last Post
by seqadmin
04-10-2024, 09:21 AM
|
Comment