Go Back   SEQanswers > Bioinformatics > Bioinformatics

Similar Threads
Thread Thread Starter Forum Replies Last Post
FASTX Toolkit barcode splitter issue jdanderson Bioinformatics 36 01-31-2016 07:09 PM
Illumina1.8 Paired-End Barcode Splitting? pbatzel Bioinformatics 2 10-25-2011 03:08 PM
Bowtie output from paired end reads godzilla07 Bioinformatics 0 01-06-2011 12:36 PM
BOth single and paired end reads in a file!! adgen Illumina/Solexa 0 06-30-2010 11:28 AM
Paired-end & shotgun reads nickloman 454 Pyrosequencing 4 03-11-2010 01:22 AM

Thread Tools
Old 08-09-2011, 09:02 AM   #1
Junior Member
Location: Austria, Vienna

Join Date: Jul 2011
Posts: 4
Default Barcode splitter - for paired end reads & specifiy prefix in output file

I'm new to SEQanswers, and more or less new to NGS data (since Juli).
I'm looking for a barcode splitter that
1.) can handle Illumina paired end reads (keeps them as paired end after splitting)
2.) lets you specifiy a prefix for the output files (makes it easier for usage in a pipeline)

I know barcode splitters like Novobarcode that can handle case 1.) or fastx barcode splitter that offers case 2.) Does anyone know a barcode splitter that combines both features?

Thanks in advance for any hint, really appreciate it!

spoonman is offline   Reply With Quote
Old 08-09-2011, 11:06 AM   #2
Senior Member
Location: Cambridge, MA

Join Date: Mar 2009
Posts: 141

I don't know a tool that combines both, but a work-around pipeline might be to:
1) split read 1 file with fastx_barcode_splitter
2) construct lists of reads from each library
3) extract those reads from the original read 2 file using cdbyank or Galaxy tools (
greigite is offline   Reply With Quote
Old 08-10-2011, 04:54 PM   #3
Location: seattle

Join Date: Mar 2010
Posts: 14

Originally Posted by greigite View Post
I don't know a tool that combines both, but a work-around pipeline might be to:
1) split read 1 file with fastx_barcode_splitter
In my experience fastx_barcode_splitter cannot be used to split illumina barcode output. Maybe our sequencing guys have a non-standard setup, but when we get illumina barcode results they are split into two files; one with the sequences and another file with just the barcodes. Is this normal?

fastx_barcode_splitter seems to assume that the barcode is at the 3' end of each read, which in our case is not true. We have to match up lines from one file with lines from the other to split the data by barcode.

I ended up writing a perl script (loosely based on that handled the case of barcodes in a separate file from the main sequencing results.
cswarth is offline   Reply With Quote

Thread Tools

Posting Rules
You may not post new threads
You may not post replies
You may not post attachments
You may not edit your posts

BB code is On
Smilies are On
[IMG] code is On
HTML code is Off

All times are GMT -8. The time now is 07:24 AM.

Powered by vBulletin® Version 3.8.9
Copyright ©2000 - 2021, vBulletin Solutions, Inc.
Single Sign On provided by vBSSO