Dear All,
We are a small research group working on compressing fastq file information. We have read through the format specifications (as described in http://nar.oxfordjournals.org/content/38/6/1767) and have the following questions:
1. How significant is each field in a fastq file? We know that the sequence data is important. What about quality data? Do we need to accurately reproduce it?
2. Title line: What aspects of title line are important? What does the title line typically signify. What aspects of title should be retained and what can be dropped? We want to know, what information in the title line is most important to those working with the fastq file.
3. Is the order of reads as stored in a fastq file important or can the reads be reordered to make a new fastq file. Note that the new fastq file will contain the same but reordered information.
Thank you all very much in advance for any inputs that we can get from you.
Best Regards
Ajit
We are a small research group working on compressing fastq file information. We have read through the format specifications (as described in http://nar.oxfordjournals.org/content/38/6/1767) and have the following questions:
1. How significant is each field in a fastq file? We know that the sequence data is important. What about quality data? Do we need to accurately reproduce it?
2. Title line: What aspects of title line are important? What does the title line typically signify. What aspects of title should be retained and what can be dropped? We want to know, what information in the title line is most important to those working with the fastq file.
3. Is the order of reads as stored in a fastq file important or can the reads be reordered to make a new fastq file. Note that the new fastq file will contain the same but reordered information.
Thank you all very much in advance for any inputs that we can get from you.
Best Regards
Ajit
Comment