![]() |
|
![]() |
||||
Thread | Thread Starter | Forum | Replies | Last Post |
Program to view multiple alignment files at once | teaelleceecee | Bioinformatics | 1 | 02-11-2019 12:23 PM |
STAR alignment with multiple fq files | graceqy | Bioinformatics | 2 | 04-16-2016 10:32 AM |
No BWA alignment output files! | thh32 | Bioinformatics | 4 | 09-09-2014 07:04 AM |
export scaffold files from SAM/BAM alignment | dnajuice | Illumina/Solexa | 0 | 08-25-2012 02:17 PM |
How to do CASAVA alignment by using fastq files | weasteam | Bioinformatics | 2 | 01-03-2012 12:18 PM |
![]() |
|
Thread Tools |
![]() |
#1 |
Member
Location: Santa Fe, NM Join Date: Nov 2013
Posts: 20
|
![]()
Hi, folks. I've started working with some old SOLiD single-ended RNA-seq reads, from 2010 and 2011. I'm using novoalignCS and have the quality files, csfasta files, and an additional fasta-like file with "BC" in the filename. Here's an example of the filenames:
S1001938CIS_7/primary.20100713155754122/reads/ solid0015_20100706_S1001938CIS_BC_bcSample1_F3.csfastaS1001938CIS_7/primary.20100707172116543/reads/ solid0015_20100706_S1001938CIS_BC_bcSample1_BC.csfastaThe F3.csfasta and F3_QV.qual files are as expected, and work fine with novoalignCS. The BC.csfasta files have data as follows and I'm completely mystified as to what they are: # Wed Jul 7 11:02:39 2010 /share/apps/corona/bin/filter_fasta.pl --output=/data/results/solid0015/solid0015_20100706_S1001938CIS_BC/bcSample1/results.F1B1/primary.20100707172116543 --name=solid0015_20100706_S1001938CIS_BC_bcSample1 --tag=BC --minlength=5 --mincalls=25 --prefix=G /data/results/solid0015/solid0015_20100706_S1001938CIS_BC/bcSample1/jobs/postPrimerSetPrimary.1505/rawseq # Cwd: /home/pipeline # Title: solid0015_20100706_S1001938CIS_BC_bcSample1 # Library:S1001938CIS_7:00313 >1_223_2_BC 0 G00313 >1_238_37_BC 0 G00313 >1_240_14_BC 0 G00313 Anyone know? Should I be using these BC files in some way? They have extremely little information content.
__________________
Sam Hokin Computational Scientist, Carnegie and NCGR |
![]() |
![]() |
![]() |
#2 |
Senior Member
Location: Germany Join Date: Oct 2008
Posts: 415
|
![]()
Well, the main thing is your csfasta and qual files work fine.
Could BC be barcode ? They seemt to be short, as in barcodes, and the format seems to be the csfasta format. Not sure I ever saw these in my days of SOLiD adventures, which ended in 2012 (thank god). |
![]() |
![]() |
![]() |
#3 | |
Junior Member
Location: Washington Join Date: Mar 2020
Posts: 6
|
![]() Quote:
![]() |
|
![]() |
![]() |
![]() |
#4 |
Member
Location: Santa Fe, NM Join Date: Nov 2013
Posts: 20
|
![]()
Hey, the data turned out OK and was useful! But yeah, I had to dust off the hard drive the reads were on, it'd been sitting on a shelf for eight years or so.
![]()
__________________
Sam Hokin Computational Scientist, Carnegie and NCGR |
![]() |
![]() |
![]() |
#5 |
Junior Member
Location: United States Join Date: Feb 2021
Posts: 2
|
![]()
wow that is some really good work that you've been doing there. I had been closely associated with some projects on Encodeproject on RNA sequence measures.
__________________
"Anyone who has never made a mistake, has never learnt anything new" - Albert Einstein I work as a consultant and research analyst at eduhelphub |
![]() |
![]() |
![]() |
Tags |
solid data analysis |
Thread Tools | |
|
|