Seqanswers Leaderboard Ad

**maubp** · 08-22-2012, 12:36 PM

It might be valid, but appears to be using line wrapping, which is not recommended. Some tools assume 4 lines per record (for speed) and line wrapping breaks them. Other tools have more robust (but slightly slower) FASTQ parsers which will cope.

You could try running this line wrapped FASTQ through EMBOSS seqret or Biopython or something similar which will accept line wrapped FASTQ input, but produce typical 4 line per record unwrapped FASTQ as output.

**SES** · 08-23-2012, 05:37 AM

I took a look at main.c in tagdust and saw this at line 52:

Code:

	if(!param->linewrap){
		linewrap = 0;
	}else{
		linewrap = 1;
	}

which suggested there was an option to control this behavior. Sure enough, after you compile the program you can see that there is an option to print the sequences on one line (the -s option, specifically).

Code:

$ ./tagdust
TagDust version 1.13, Copyright (C) 2009 Timo Lassmann <[email protected]>

Usage: tagdust [options]  lib.fa read1.fa read2.fa ...
	

	Options:
	-f, -fdr	False discovery rate (default: 0.01)
	-o <file>	print clean tags to file.
	-a <file>	print artifactual tags to file.
	-trim5 <X>	trim 'X' residues from the start of all reads.
	-trim3 <X>	trim 'X' residues from the end of all reads.
	-fasta		output format is fasta.
	-s, -singleline	sequences are written in a single line.
	-q, -quiet	quite mode

	Identifies tags as artifactual sequences if they match to library sequences.
	Library sequences must be in fasta; tag sequences in either fasta or fastq format.

Topics	Statistics	Last Post
Cancer Metastasis: A Deep Dive into Cellular Plasticity by seqadmin Started by seqadmin, 04-11-2024, 12:08 PM	0 responses 30 views 0 likes	Last Post by seqadmin 04-11-2024, 12:08 PM
Proteogenomic Profiles Offer New Clues in Prostate Cancer by seqadmin Started by seqadmin, 04-10-2024, 10:19 PM	0 responses 32 views 0 likes	Last Post by seqadmin 04-10-2024, 10:19 PM
Novel Diagnostic Assay Enhances Ovarian Cancer Detection by seqadmin Started by seqadmin, 04-10-2024, 09:21 AM	0 responses 28 views 0 likes	Last Post by seqadmin 04-10-2024, 09:21 AM
Evolutionary Dynamics of Centromeres: A Comparative Genomic Analysis by seqadmin Started by seqadmin, 04-04-2024, 09:00 AM	0 responses 52 views 0 likes	Last Post by seqadmin 04-04-2024, 09:00 AM

Seqanswers Leaderboard Ad

Announcement

tagdust output is not fastq!

Comment

Comment

Latest Articles

ad_right_rmr

News