Hi everyone,
I'm looking for a library in C which would allow me to :
- Extract the first x k-mers of a FASTA file (put them in a buffer)
- Make some treatments on these k-mers
- Extract the next x k-mers of a FASTA file (put them in a buffer)
- Make some treatments on these k-mers
- And so on, until the end of the FASTA file.
Basically, the goal is to read a FASTA file by chunks of x k-mers, I don't want to load a complete sequence in memory (to extract the k-mers after) especially if this one is very long.
I could do this code by myself but I'm pretty sure this was already implemented somewhere so if you have a clue where I can find this, it would be very nice to tell me.
Thank you a lot for your help.
Best regards.
I'm looking for a library in C which would allow me to :
- Extract the first x k-mers of a FASTA file (put them in a buffer)
- Make some treatments on these k-mers
- Extract the next x k-mers of a FASTA file (put them in a buffer)
- Make some treatments on these k-mers
- And so on, until the end of the FASTA file.
Basically, the goal is to read a FASTA file by chunks of x k-mers, I don't want to load a complete sequence in memory (to extract the k-mers after) especially if this one is very long.
I could do this code by myself but I'm pretty sure this was already implemented somewhere so if you have a clue where I can find this, it would be very nice to tell me.
Thank you a lot for your help.
Best regards.
Comment