Hi all, just starting to analyse NGS data, beginning to realise I should have been here asking for help from the beginning!
I have a number of fastq files (produced by concatenating both directions of a paired-end Illumina run), from a variety of PCRs from different samples. I'd like to be able to compare the files, to see how many sequences in one file are shared with say one or more other files.
Can anyone think of a sensible way of going about this? Bioinformatics is really not my strong suit, and the only ways I can think of would probably never run on my machine.
I have a number of fastq files (produced by concatenating both directions of a paired-end Illumina run), from a variety of PCRs from different samples. I'd like to be able to compare the files, to see how many sequences in one file are shared with say one or more other files.
Can anyone think of a sensible way of going about this? Bioinformatics is really not my strong suit, and the only ways I can think of would probably never run on my machine.
Comment