Seqanswers Leaderboard Ad

Collapse

Announcement

Collapse
No announcement yet.
X
 
  • Filter
  • Time
  • Show
Clear All
new posts

  • What are these BC files for? Should I use them in alignment?

    Hi, folks. I've started working with some old SOLiD single-ended RNA-seq reads, from 2010 and 2011. I'm using novoalignCS and have the quality files, csfasta files, and an additional fasta-like file with "BC" in the filename. Here's an example of the filenames:

    S1001938CIS_7/primary.20100713155754122/reads/
    solid0015_20100706_S1001938CIS_BC_bcSample1_F3.csfasta
    solid0015_20100706_S1001938CIS_BC_bcSample1_F3.stats
    solid0015_20100706_S1001938CIS_BC_bcSample1_F3_QV.qual
    S1001938CIS_7/primary.20100707172116543/reads/
    solid0015_20100706_S1001938CIS_BC_bcSample1_BC.csfasta

    The F3.csfasta and F3_QV.qual files are as expected, and work fine with novoalignCS.

    The BC.csfasta files have data as follows and I'm completely mystified as to what they are:

    # Wed Jul 7 11:02:39 2010 /share/apps/corona/bin/filter_fasta.pl --output=/data/results/solid0015/solid0015_20100706_S1001938CIS_BC/bcSample1/results.F1B1/primary.20100707172116543 --name=solid0015_20100706_S1001938CIS_BC_bcSample1 --tag=BC --minlength=5 --mincalls=25 --prefix=G /data/results/solid0015/solid0015_20100706_S1001938CIS_BC/bcSample1/jobs/postPrimerSetPrimary.1505/rawseq
    # Cwd: /home/pipeline
    # Title: solid0015_20100706_S1001938CIS_BC_bcSample1
    # Library:S1001938CIS_7:00313
    >1_223_2_BC 0
    G00313
    >1_238_37_BC 0
    G00313
    >1_240_14_BC 0
    G00313

    Anyone know? Should I be using these BC files in some way? They have extremely little information content.
    Sam Hokin
    Computational Scientist, Carnegie and NCGR

  • #2
    Well, the main thing is your csfasta and qual files work fine.

    Could BC be barcode ? They seemt to be short, as in barcodes, and the format seems to be the csfasta format.

    Not sure I ever saw these in my days of SOLiD adventures, which ended in 2012 (thank god).

    Comment


    • #3
      Hey, the data turned out OK and was useful! But yeah, I had to dust off the hard drive the reads were on, it'd been sitting on a shelf for eight years or so.
      Sam Hokin
      Computational Scientist, Carnegie and NCGR

      Comment


      • #4
        wow that is some really good work that you've been doing there. I had been closely associated with some projects on Encodeproject on RNA sequence measures.

        Comment

        Latest Articles

        Collapse

        • seqadmin
          Essential Discoveries and Tools in Epitranscriptomics
          by seqadmin


          The field of epigenetics has traditionally concentrated more on DNA and how changes like methylation and phosphorylation of histones impact gene expression and regulation. However, our increased understanding of RNA modifications and their importance in cellular processes has led to a rise in epitranscriptomics research. “Epitranscriptomics brings together the concepts of epigenetics and gene expression,” explained Adrien Leger, PhD, Principal Research Scientist on Modified Bases...
          Yesterday, 07:01 AM
        • seqadmin
          Current Approaches to Protein Sequencing
          by seqadmin


          Proteins are often described as the workhorses of the cell, and identifying their sequences is key to understanding their role in biological processes and disease. Currently, the most common technique used to determine protein sequences is mass spectrometry. While still a valuable tool, mass spectrometry faces several limitations and requires a highly experienced scientist familiar with the equipment to operate it. Additionally, other proteomic methods, like affinity assays, are constrained...
          04-04-2024, 04:25 PM

        ad_right_rmr

        Collapse

        News

        Collapse

        Topics Statistics Last Post
        Started by seqadmin, 04-11-2024, 12:08 PM
        0 responses
        55 views
        0 likes
        Last Post seqadmin  
        Started by seqadmin, 04-10-2024, 10:19 PM
        0 responses
        51 views
        0 likes
        Last Post seqadmin  
        Started by seqadmin, 04-10-2024, 09:21 AM
        0 responses
        45 views
        0 likes
        Last Post seqadmin  
        Started by seqadmin, 04-04-2024, 09:00 AM
        0 responses
        55 views
        0 likes
        Last Post seqadmin  
        Working...
        X