Seqanswers Leaderboard Ad

**maubp** · 04-04-2012, 02:51 AM

Are you trying to work with the Roche off instrument application suite to do the multiplexing? There are other options if you program and want to work directly with the SFF files, e.g. Biopython and BioHaskell

**jimmybee** · 04-04-2012, 12:59 PM

Yeah I'm using the command-line tools in a couple of scripts. I'm not so worried about the tools but just the process at the moment. Just wanted some feedback and advice on what others have confronted and got around it

I've always wanted to look into Biopython but never found the right project. Might spend a bit of time with it on this

**kmcarr** · 04-04-2012, 02:20 PM

Originally posted by jimmybee View Post

...but we are having difficulties assigning the reverses read, compared to the forward read which we can demultiplex nicely

Jimmybee,

I'm a bit confused as to what you mean by reverse read and forward read. The Roche/454 sequencer is only capable of generating a single, unidirectional read for any given DNA molecule. There are no 'forward' and 'reverse' reads in 454 sequencing. For both of your MID tags to be useful you would have to make sure that the PCR product generated by you fusion primers is short enough so that the entire length of the amplicon can be sequenced on the GS-FLX (Titanium, + or whatever). This would result in barcode A at the 5' end of your read and barcode B at the 3' end.

Since dual indexing is not supported by Roche/454 their tools won't help with the second index. You will have to do the sorting in two stages. Assuming your amplicon is sized such that the reads span the entire length, including the second barcode into the keytag and primer B, the 454 runProcessor should have recognized the keytag-primer at the 3' end and trimmed it, leaving your barcode as the 3' end of your reads. You could use the SFF tools to sort based on the 5' barcode. Then feed the resultant FASTA files into one of the various barcode sorting tools available (no specific recommendation, check the SeqAnswers Software Wiki) to sub-divide each further. It's also my experience that most barcode sorting scripts expect the tag to be at the 5' end of the read which means you would need to reverse-complement the reads after the first step.

**jimmybee** · 04-04-2012, 03:22 PM

Originally posted by kmcarr View Post

Jimmybee,

I'm a bit confused as to what you mean by reverse read and forward read. The Roche/454 sequencer is only capable of generating a single, unidirectional read for any given DNA molecule. There are no 'forward' and 'reverse' reads in 454 sequencing. For both of your MID tags to be useful you would have to make sure that the PCR product generated by you fusion primers is short enough so that the entire length of the amplicon can be sequenced on the GS-FLX (Titanium, + or whatever). This would result in barcode A at the 5' end of your read and barcode B at the 3' end.

This is fine, we have only a 215bp fragment length so we're definitely catching the 3' barcode. Sorry I didnt exactly mean read (more the forward and reverse direction)

Originally posted by kmcarr View Post

Since dual indexing is not supported by Roche/454 their tools won't help with the second index. You will have to do the sorting in two stages. Assuming your amplicon is sized such that the reads span the entire length, including the second barcode into the keytag and primer B, the 454 runProcessor should have recognized the keytag-primer at the 3' end and trimmed it, leaving your barcode as the 3' end of your reads. You could use the SFF tools to sort based on the 5' barcode. Then feed the resultant FASTA files into one of the various barcode sorting tools available (no specific recommendation, check the SeqAnswers Software Wiki) to sub-divide each further. It's also my experience that most barcode sorting scripts expect the tag to be at the 5' end of the read which means you would need to reverse-complement the reads after the first step.

Thanks for the advice, i think i'll use either bioperl or biopython to create something solid now, as we're looking to be doing similar projects in the future. I've heard that the Roche tools are no help in this sort of setup

Topics	Statistics	Last Post
Cancer Metastasis: A Deep Dive into Cellular Plasticity by seqadmin Started by seqadmin, 04-11-2024, 12:08 PM	0 responses 30 views 0 likes	Last Post by seqadmin 04-11-2024, 12:08 PM
Proteogenomic Profiles Offer New Clues in Prostate Cancer by seqadmin Started by seqadmin, 04-10-2024, 10:19 PM	0 responses 32 views 0 likes	Last Post by seqadmin 04-10-2024, 10:19 PM
Novel Diagnostic Assay Enhances Ovarian Cancer Detection by seqadmin Started by seqadmin, 04-10-2024, 09:21 AM	0 responses 28 views 0 likes	Last Post by seqadmin 04-10-2024, 09:21 AM
Evolutionary Dynamics of Centromeres: A Comparative Genomic Analysis by seqadmin Started by seqadmin, 04-04-2024, 09:00 AM	0 responses 53 views 0 likes	Last Post by seqadmin 04-04-2024, 09:00 AM

Seqanswers Leaderboard Ad

Announcement

Template-specific bidirectional demultiplexing of sff files from 454

Comment

Comment

Comment

Comment

Latest Articles

ad_right_rmr

News