Unconfigured Ad

Collapse
X
 
  • Filter
  • Time
  • Show
Clear All
new posts
  • samhokin
    Member
    • Nov 2013
    • 20

    What are these BC files for? Should I use them in alignment?

    Hi, folks. I've started working with some old SOLiD single-ended RNA-seq reads, from 2010 and 2011. I'm using novoalignCS and have the quality files, csfasta files, and an additional fasta-like file with "BC" in the filename. Here's an example of the filenames:

    S1001938CIS_7/primary.20100713155754122/reads/
    solid0015_20100706_S1001938CIS_BC_bcSample1_F3.csfasta
    solid0015_20100706_S1001938CIS_BC_bcSample1_F3.stats
    solid0015_20100706_S1001938CIS_BC_bcSample1_F3_QV.qual
    S1001938CIS_7/primary.20100707172116543/reads/
    solid0015_20100706_S1001938CIS_BC_bcSample1_BC.csfasta

    The F3.csfasta and F3_QV.qual files are as expected, and work fine with novoalignCS.

    The BC.csfasta files have data as follows and I'm completely mystified as to what they are:

    # Wed Jul 7 11:02:39 2010 /share/apps/corona/bin/filter_fasta.pl --output=/data/results/solid0015/solid0015_20100706_S1001938CIS_BC/bcSample1/results.F1B1/primary.20100707172116543 --name=solid0015_20100706_S1001938CIS_BC_bcSample1 --tag=BC --minlength=5 --mincalls=25 --prefix=G /data/results/solid0015/solid0015_20100706_S1001938CIS_BC/bcSample1/jobs/postPrimerSetPrimary.1505/rawseq
    # Cwd: /home/pipeline
    # Title: solid0015_20100706_S1001938CIS_BC_bcSample1
    # Library:S1001938CIS_7:00313
    >1_223_2_BC 0
    G00313
    >1_238_37_BC 0
    G00313
    >1_240_14_BC 0
    G00313

    Anyone know? Should I be using these BC files in some way? They have extremely little information content.
    Sam Hokin
    Computational Scientist, Carnegie and NCGR
  • colindaven
    Senior Member
    • Oct 2008
    • 417

    #2
    Well, the main thing is your csfasta and qual files work fine.

    Could BC be barcode ? They seemt to be short, as in barcodes, and the format seems to be the csfasta format.

    Not sure I ever saw these in my days of SOLiD adventures, which ended in 2012 (thank god).

    Comment

    • samhokin
      Member
      • Nov 2013
      • 20

      #3
      Hey, the data turned out OK and was useful! But yeah, I had to dust off the hard drive the reads were on, it'd been sitting on a shelf for eight years or so.
      Sam Hokin
      Computational Scientist, Carnegie and NCGR

      Comment

      • awhite95
        Banned
        • Feb 2021
        • 2

        #4
        wow that is some really good work that you've been doing there. I had been closely associated with some projects on Encodeproject on RNA sequence measures.

        Comment

        Latest Articles

        Collapse

        • SEQadmin2
          Nine Things a Sample Prep Scientist Thinks About Before Sequencing
          by SEQadmin2


          I’m not a sequencing expert. I’m a purification scientist who uses NGS to evaluate workflows my group develops. With this perspective, we think about the sample first and the NGS workflow second. The sequencer is an exceptionally honest reporter, but it can only report on what you give it, so whether you get clean, interpretable data from an NGS workflow is largely determined before you begin.

          Here are nine questions we think about, in roughly the order they matter, before...
          06-18-2026, 07:11 AM
        • SEQadmin2
          From Collection to Sequencing: Why Sample Preparation and Preservation Define Sequencing Data
          by SEQadmin2


          Data variability is still an issue in sequencing technologies despite the advances in reproducibility and accuracy of these platforms. But the problem does not originate in the sequencing itself, but in the previous steps, before the sample reaches the sequencer.


          The first step is collection, followed by preservation and sample preparation for analysis. Most scientists overlook those steps, but not being careful might just be skewing the experiment’s results.
          ...
          06-02-2026, 10:05 AM

        ad_right_rmr

        Collapse

        News

        Collapse

        Topics Statistics Last Post
        Started by SEQadmin2, 06-17-2026, 06:09 AM
        0 responses
        34 views
        0 reactions
        Last Post SEQadmin2  
        Started by SEQadmin2, 06-09-2026, 11:58 AM
        0 responses
        99 views
        0 reactions
        Last Post SEQadmin2  
        Started by SEQadmin2, 06-05-2026, 10:09 AM
        0 responses
        119 views
        0 reactions
        Last Post SEQadmin2  
        Started by SEQadmin2, 06-04-2026, 08:59 AM
        0 responses
        112 views
        0 reactions
        Last Post SEQadmin2  
        Working...