![]() |
|
![]() |
||||
Thread | Thread Starter | Forum | Replies | Last Post |
COSMIC vcf file contains "N" & "." characters in ALT column | vipul jain | Bioinformatics | 1 | 09-03-2015 01:16 AM |
glimmer .predict file: "from" larger than "to" on + strand | shany | Bioinformatics | 0 | 05-27-2014 03:21 PM |
fastq file with "." in sequence line | zinky | Bioinformatics | 3 | 01-01-2013 02:34 PM |
Relatively large proportion of "LOWDATA", "FAIL" of FPKM_status running cufflink | ruben6um | Bioinformatics | 3 | 10-12-2011 01:39 AM |
The position file formats ".clocs" and "_pos.txt"? Ist there any difference? | elgor | Illumina/Solexa | 0 | 06-27-2011 08:55 AM |
![]() |
|
Thread Tools |
![]() |
#1 |
Senior Member
Location: Oxford, Ohio Join Date: Mar 2012
Posts: 253
|
![]()
Hello,
I hesitate to use the following terms: "split", "partition", "trim" - because they all have special connotations. What I'd like to do is to take a large FASTQ file of PE reads and cut the file into two approximately equal halves. However, I want to do it in such a manner that a given file does not have only one half of a pair of a PE read group. In other words, I want to insure that when the file is halved, or split, that no PE reads are separated. For example, if I have a file with 11 PE reads, I do NOT want 5 reads in one file and 6 reads in another. Will FASTQ Splitter work for what I want? - Andor |
![]() |
![]() |
![]() |
#2 |
Senior Member
Location: East Coast USA Join Date: Feb 2008
Posts: 7,080
|
![]()
How would you split a PE dataset otherwise? I assume your PE reads are in two separate files (R1/R2) and the split would have to split both files?
|
![]() |
![]() |
![]() |
#3 |
Senior Member
Location: Oxford, Ohio Join Date: Mar 2012
Posts: 253
|
![]() |
![]() |
![]() |
![]() |
#4 |
Senior Member
Location: East Coast USA Join Date: Feb 2008
Posts: 7,080
|
![]()
So your paired-end reads are interleaved? Why not use "split -n 2" to divide the original into to two parts. If you have an odd number of fastq records then using an explicit "split -l ((n+1)/2*4)" may be better (n = number of fastq records).
Last edited by GenoMax; 08-07-2017 at 09:28 AM. |
![]() |
![]() |
![]() |
Tags |
fastq splitter, partition, pe reads, split |
Thread Tools | |
|
|