Does anybody have a tested program to convert Illumina CASAVA 1.8 qual scores (Phred+33) to the previous version Illumina 1.5+ (Phred+64)?
Seqanswers Leaderboard Ad
Collapse
Announcement
Collapse
No announcement yet.
X
-
What about something that I can run in the command lene?
Most programs should work with Phred+33 (e.g. append '-Q 33' to the fastx command line).
Also, Biopython can read files as one format and write them as another:
Here's a quick python conversion script, derived from an example on that page:
Code:#!/usr/bin/python from Bio import SeqIO SeqIO.convert("input.fastq", "fastq-sanger", "output.fastq", "fastq-illumina")
Comment
-
Originally posted by gringer View PostWhy do you need a Phred+64 offset and/or the command line?
Originally posted by gringer View PostMost programs should work with Phred+33 (e.g. append '-Q 33' to the fastx command line).
Also, Biopython can read files as one format and write them as another:
Here's a quick python conversion script, derived from an example on that page:
Code:#!/usr/bin/python from Bio import SeqIO SeqIO.convert("input.fastq", "fastq-sanger", "output.fastq", "fastq-illumina")
Comment
-
Originally posted by lpn View PostBecause tophat doesn't seem to handle well Phred+33 (CASAVA 1.8), but works with Phred+64 (CASAVA 1.5+).
There's also the Bowtie "--phred33-quals" option, which I guess you could add to tophat's bowtie call to force this:
Code:nano $(which tophat)
Comment
-
As gringer said if you DO NOT specify a qual flag it will work fine. You are not the only person with HiSeq data which finally encodes the quality values in the standard sanger format for which nearly all programs expect by by default. The flag, for most today, is just for processing legacy datasets.
Comment
-
Originally posted by gringer View PostThis is interesting and doesn't match my experience with tophat on recent Illumina runs. Do you have any "--solexa1.3-quals" options on your command line? Removing that should stop bowtie from using Phred+64, and go back to the default Phred+33.
There's also the Bowtie "--phred33-quals" option, which I guess you could add to tophat's bowtie call to force this:
Code:nano $(which tophat)
Last edited by lpn; 03-03-2012, 07:53 AM.
Comment
-
Originally posted by lpn View PostThat works, but subsequent analysis produces strange results.Last edited by gringer; 03-03-2012, 11:57 AM.
Comment
Latest Articles
Collapse
-
by seqadmin
The field of epigenetics has traditionally concentrated more on DNA and how changes like methylation and phosphorylation of histones impact gene expression and regulation. However, our increased understanding of RNA modifications and their importance in cellular processes has led to a rise in epitranscriptomics research. “Epitranscriptomics brings together the concepts of epigenetics and gene expression,” explained Adrien Leger, PhD, Principal Research Scientist...-
Channel: Articles
04-22-2024, 07:01 AM -
-
by seqadmin
Proteins are often described as the workhorses of the cell, and identifying their sequences is key to understanding their role in biological processes and disease. Currently, the most common technique used to determine protein sequences is mass spectrometry. While still a valuable tool, mass spectrometry faces several limitations and requires a highly experienced scientist familiar with the equipment to operate it. Additionally, other proteomic methods, like affinity assays, are constrained...-
Channel: Articles
04-04-2024, 04:25 PM -
ad_right_rmr
Collapse
News
Collapse
Topics | Statistics | Last Post | ||
---|---|---|---|---|
Started by seqadmin, Today, 11:49 AM
|
0 responses
12 views
0 likes
|
Last Post
by seqadmin
Today, 11:49 AM
|
||
Started by seqadmin, Yesterday, 08:47 AM
|
0 responses
16 views
0 likes
|
Last Post
by seqadmin
Yesterday, 08:47 AM
|
||
Started by seqadmin, 04-11-2024, 12:08 PM
|
0 responses
61 views
0 likes
|
Last Post
by seqadmin
04-11-2024, 12:08 PM
|
||
Started by seqadmin, 04-10-2024, 10:19 PM
|
0 responses
60 views
0 likes
|
Last Post
by seqadmin
04-10-2024, 10:19 PM
|
Comment