Seqanswers Leaderboard Ad

**Frank D** · 01-24-2012, 03:26 PM

Hi Metz,
Try the fastx_barcode_splitter.pl from the FASTX toolkit:
http://hannonlab.cshl.edu/fastx_toolkit/
You can specify the number of mismatches and for a one off analysis it is reasonably quick.

**Metz** · 01-24-2012, 06:00 PM

Thanks for the response Frank, but I've already tried fastx. Unless I'm mistaken, it requires that the barcode be part of the read, not in the read name. I've thought about just writing a script to reintroduce the barcode back into the read, but that also requires adding in 'mock' quality scores for those bases and other changes to the read name. My perl is several years out of use, and I'm trying to prevent reinventing the wheel.

**GenoMax** · 01-25-2012, 04:18 AM

I assume you are looking to parse reads from the "Undetermined" reads file which would have the reads with more than 1 mismatch.

Rather than re-introducing the tags back in the reads it would be more efficient to enumerate all "tags" that are in your "undetermined tags" file and decide the ones you want keep/extract.

**Metz** · 01-25-2012, 06:44 AM

Do you know of a program that can do something along those lines?

**HESmith** · 01-25-2012, 07:30 AM

**Metz** · 01-26-2012, 06:06 AM

Thanks for the code. It works for giving me a list of barcodes and their counts. I can definitively tell which barcode the most abundant ones belong too. However, I'm not sure how to proceed with this. I just don't understand enough perl to move forward quickly. However, if there isn't another solution, I guess that is the way to go. I just can't believe that nobody else has had this problem in the last year.

Topics	Statistics	Last Post
Cancer Metastasis: A Deep Dive into Cellular Plasticity by seqadmin Started by seqadmin, 04-11-2024, 12:08 PM	0 responses 25 views 0 likes	Last Post by seqadmin 04-11-2024, 12:08 PM
Proteogenomic Profiles Offer New Clues in Prostate Cancer by seqadmin Started by seqadmin, 04-10-2024, 10:19 PM	0 responses 28 views 0 likes	Last Post by seqadmin 04-10-2024, 10:19 PM
Novel Diagnostic Assay Enhances Ovarian Cancer Detection by seqadmin Started by seqadmin, 04-10-2024, 09:21 AM	0 responses 24 views 0 likes	Last Post by seqadmin 04-10-2024, 09:21 AM
Evolutionary Dynamics of Centromeres: A Comparative Genomic Analysis by seqadmin Started by seqadmin, 04-04-2024, 09:00 AM	0 responses 52 views 0 likes	Last Post by seqadmin 04-04-2024, 09:00 AM

Seqanswers Leaderboard Ad

Announcement

Demultiplexing Casava 1.8 reads

Comment

Comment

Comment

Comment

Comment

Comment

Latest Articles

ad_right_rmr

News