Hi, folks. I've started working with some old SOLiD single-ended RNA-seq reads, from 2010 and 2011. I'm using novoalignCS and have the quality files, csfasta files, and an additional fasta-like file with "BC" in the filename. Here's an example of the filenames:
S1001938CIS_7/primary.20100713155754122/reads/
The F3.csfasta and F3_QV.qual files are as expected, and work fine with novoalignCS.
The BC.csfasta files have data as follows and I'm completely mystified as to what they are:
# Wed Jul 7 11:02:39 2010 /share/apps/corona/bin/filter_fasta.pl --output=/data/results/solid0015/solid0015_20100706_S1001938CIS_BC/bcSample1/results.F1B1/primary.20100707172116543 --name=solid0015_20100706_S1001938CIS_BC_bcSample1 --tag=BC --minlength=5 --mincalls=25 --prefix=G /data/results/solid0015/solid0015_20100706_S1001938CIS_BC/bcSample1/jobs/postPrimerSetPrimary.1505/rawseq
# Cwd: /home/pipeline
# Title: solid0015_20100706_S1001938CIS_BC_bcSample1
# Library:S1001938CIS_7:00313
>1_223_2_BC 0
G00313
>1_238_37_BC 0
G00313
>1_240_14_BC 0
G00313
Anyone know? Should I be using these BC files in some way? They have extremely little information content.
S1001938CIS_7/primary.20100713155754122/reads/
solid0015_20100706_S1001938CIS_BC_bcSample1_F3.csfasta
solid0015_20100706_S1001938CIS_BC_bcSample1_F3.stats
solid0015_20100706_S1001938CIS_BC_bcSample1_F3_QV.qual
S1001938CIS_7/primary.20100707172116543/reads/solid0015_20100706_S1001938CIS_BC_bcSample1_F3.stats
solid0015_20100706_S1001938CIS_BC_bcSample1_F3_QV.qual
solid0015_20100706_S1001938CIS_BC_bcSample1_BC.csfasta
The F3.csfasta and F3_QV.qual files are as expected, and work fine with novoalignCS.
The BC.csfasta files have data as follows and I'm completely mystified as to what they are:
# Wed Jul 7 11:02:39 2010 /share/apps/corona/bin/filter_fasta.pl --output=/data/results/solid0015/solid0015_20100706_S1001938CIS_BC/bcSample1/results.F1B1/primary.20100707172116543 --name=solid0015_20100706_S1001938CIS_BC_bcSample1 --tag=BC --minlength=5 --mincalls=25 --prefix=G /data/results/solid0015/solid0015_20100706_S1001938CIS_BC/bcSample1/jobs/postPrimerSetPrimary.1505/rawseq
# Cwd: /home/pipeline
# Title: solid0015_20100706_S1001938CIS_BC_bcSample1
# Library:S1001938CIS_7:00313
>1_223_2_BC 0
G00313
>1_238_37_BC 0
G00313
>1_240_14_BC 0
G00313
Anyone know? Should I be using these BC files in some way? They have extremely little information content.
Comment