I have 454 assembled contigs, Isotigs and singletons. The core facility that we used has done this for us, using Newbler. Do I still need to get rid of any adapter sequences or can I use these data directly for other analyses? In addition, how can I combine Isotigs, contigs and singltons?
Seqanswers Leaderboard Ad
Collapse
Announcement
Collapse
No announcement yet.
X
-
-
The Singletons will still have the adaptor sequences. The contigs/isotigs mostly should not ( if at the time of running Newbler, a database of the adaptor sequences was specified ).
You can combine the singletons, contigs and the largest isotig from each isogroup and run it through one more assembly software ( Reference assembly if you have a reference genome ).
-
Originally posted by Khanjan View PostThe Singletons will still have the adaptor sequences.
We at Purdue Genomics do provide a Singleton.tfa file to our customers. The reads in this file have been trimmed using the data in 454TrimStatus.txt
Comment
-
Originally posted by westerman View PostI am not sure that follows. Singletons, per se, are not part of the Newbler output. The files 454Isotigs.fna and 454AllContigs.fna, sure, they exist. And the 454ReadStatus.tfa file tells where the reads went to. But there is no, as far as I know, Newbler generated singleton file.
We at Purdue Genomics do provide a Singleton.tfa file to our customers. The reads in this file have been trimmed using the data in 454TrimStatus.txt
ids of those which are singletons and then extract those singletons from the original sff files ( using sfffile/sffinfo )
Since the Singletons were not included in the Assembly, they wont be trimmed and will contain the adaptor.
Comment
-
Originally posted by Khanjan View PostNewbler does not generate the a file containing the singletons as it does for Isotigs and Contigs.
Maybe the person who generated it just took reads marked 'singleton' from the 454ReadStatus file.
Or maybe the person who generated the file took the reads marked 'singleton' plus the information from the 454TrimStatus file in order to create a singleton file that is trimmed.
sfffile has the '-t' option (File containing accno/trim line information) so it is quite easy to do the trimming. We do this routinely.
But all in all you can not assume that the singleton is untrimmed. Nor trimmed.
Comment
-
Originally posted by westerman View PostThat was my point. You can not say where the singleton file came from.
Maybe the person who generated it just took reads marked 'singleton' from the 454ReadStatus file.
Or maybe the person who generated the file took the reads marked 'singleton' plus the information from the 454TrimStatus file in order to create a singleton file that is trimmed.
sfffile has the '-t' option (File containing accno/trim line information) so it is quite easy to do the trimming. We do this routinely.
But all in all you can not assume that the singleton is untrimmed. Nor trimmed.
My sincere apologies
Comment
Latest Articles
Collapse
-
by seqadmin
Many organizations study rare diseases, but few have a mission as impactful as Rady Children’s Institute for Genomic Medicine (RCIGM). “We are all about changing outcomes for children,” explained Dr. Stephen Kingsmore, President and CEO of the group. The institute’s initial goal was to provide rapid diagnoses for critically ill children and shorten their diagnostic odyssey, a term used to describe the long and arduous process it takes patients to obtain an accurate...-
Channel: Articles
12-16-2024, 07:57 AM -
-
by seqadmin
Innovations in next-generation sequencing technologies and techniques are driving more precise and comprehensive exploration of complex biological systems. Current advancements include improved accessibility for long-read sequencing and significant progress in single-cell and 3D genomics. This article explores some of the most impactful developments in the field over the past year.
Long-Read Sequencing
Long-read sequencing has seen remarkable advancements,...-
Channel: Articles
12-02-2024, 01:49 PM -
ad_right_rmr
Collapse
News
Collapse
Topics | Statistics | Last Post | ||
---|---|---|---|---|
Started by seqadmin, 12-17-2024, 10:28 AM
|
0 responses
23 views
0 likes
|
Last Post
by seqadmin
12-17-2024, 10:28 AM
|
||
Started by seqadmin, 12-13-2024, 08:24 AM
|
0 responses
42 views
0 likes
|
Last Post
by seqadmin
12-13-2024, 08:24 AM
|
||
Started by seqadmin, 12-12-2024, 07:41 AM
|
0 responses
28 views
0 likes
|
Last Post
by seqadmin
12-12-2024, 07:41 AM
|
||
Started by seqadmin, 12-11-2024, 07:45 AM
|
0 responses
42 views
0 likes
|
Last Post
by seqadmin
12-11-2024, 07:45 AM
|
Comment