Seqanswers Leaderboard Ad

Collapse

Announcement

Collapse
No announcement yet.
X
 
  • Filter
  • Time
  • Show
Clear All
new posts

  • What are these BC files for? Should I use them in alignment?

    Hi, folks. I've started working with some old SOLiD single-ended RNA-seq reads, from 2010 and 2011. I'm using novoalignCS and have the quality files, csfasta files, and an additional fasta-like file with "BC" in the filename. Here's an example of the filenames:

    S1001938CIS_7/primary.20100713155754122/reads/
    solid0015_20100706_S1001938CIS_BC_bcSample1_F3.csfasta
    solid0015_20100706_S1001938CIS_BC_bcSample1_F3.stats
    solid0015_20100706_S1001938CIS_BC_bcSample1_F3_QV.qual
    S1001938CIS_7/primary.20100707172116543/reads/
    solid0015_20100706_S1001938CIS_BC_bcSample1_BC.csfasta

    The F3.csfasta and F3_QV.qual files are as expected, and work fine with novoalignCS.

    The BC.csfasta files have data as follows and I'm completely mystified as to what they are:

    # Wed Jul 7 11:02:39 2010 /share/apps/corona/bin/filter_fasta.pl --output=/data/results/solid0015/solid0015_20100706_S1001938CIS_BC/bcSample1/results.F1B1/primary.20100707172116543 --name=solid0015_20100706_S1001938CIS_BC_bcSample1 --tag=BC --minlength=5 --mincalls=25 --prefix=G /data/results/solid0015/solid0015_20100706_S1001938CIS_BC/bcSample1/jobs/postPrimerSetPrimary.1505/rawseq
    # Cwd: /home/pipeline
    # Title: solid0015_20100706_S1001938CIS_BC_bcSample1
    # Library:S1001938CIS_7:00313
    >1_223_2_BC 0
    G00313
    >1_238_37_BC 0
    G00313
    >1_240_14_BC 0
    G00313

    Anyone know? Should I be using these BC files in some way? They have extremely little information content.
    Sam Hokin
    Computational Scientist, Carnegie and NCGR

  • #2
    Well, the main thing is your csfasta and qual files work fine.

    Could BC be barcode ? They seemt to be short, as in barcodes, and the format seems to be the csfasta format.

    Not sure I ever saw these in my days of SOLiD adventures, which ended in 2012 (thank god).

    Comment


    • #3
      Hey, the data turned out OK and was useful! But yeah, I had to dust off the hard drive the reads were on, it'd been sitting on a shelf for eight years or so.
      Sam Hokin
      Computational Scientist, Carnegie and NCGR

      Comment


      • #4
        wow that is some really good work that you've been doing there. I had been closely associated with some projects on Encodeproject on RNA sequence measures.

        Comment

        Latest Articles

        Collapse

        • seqadmin
          Techniques and Challenges in Conservation Genomics
          by seqadmin



          The field of conservation genomics centers on applying genomics technologies in support of conservation efforts and the preservation of biodiversity. This article features interviews with two researchers who showcase their innovative work and highlight the current state and future of conservation genomics.

          Avian Conservation
          Matthew DeSaix, a recent doctoral graduate from Kristen Ruegg’s lab at The University of Colorado, shared that most of his research...
          03-08-2024, 10:41 AM
        • seqadmin
          The Impact of AI in Genomic Medicine
          by seqadmin



          Artificial intelligence (AI) has evolved from a futuristic vision to a mainstream technology, highlighted by the introduction of tools like OpenAI's ChatGPT and Google's Gemini. In recent years, AI has become increasingly integrated into the field of genomics. This integration has enabled new scientific discoveries while simultaneously raising important ethical questions1. Interviews with two researchers at the center of this intersection provide insightful perspectives into...
          02-26-2024, 02:07 PM

        ad_right_rmr

        Collapse

        News

        Collapse

        Topics Statistics Last Post
        Started by seqadmin, 03-14-2024, 06:13 AM
        0 responses
        33 views
        0 likes
        Last Post seqadmin  
        Started by seqadmin, 03-08-2024, 08:03 AM
        0 responses
        72 views
        0 likes
        Last Post seqadmin  
        Started by seqadmin, 03-07-2024, 08:13 AM
        0 responses
        80 views
        0 likes
        Last Post seqadmin  
        Started by seqadmin, 03-06-2024, 09:51 AM
        0 responses
        68 views
        0 likes
        Last Post seqadmin  
        Working...
        X