A quick question here. I am constructing a reference transcriptome for DE out of 16 transcriptomes (8 control + 8 treatment) and my strategy is as follows:
1. create a database with transcriptome 1 (random)
2. blast transcriptome 2 to 1
3. start creating the reference transcriptome = transcriptome 1 + unmatched transcripts from transcriptome 2
4. create a database with the reference transcriptome
5. blast transcriptome 3 to ref transcr.
6. as in point 3 and so on
At the end I will have a reference transcriptome with all the transcripts present in the 16 transcriptomes represented once.
My questions is to use or not to use -task megablast.
If I use megablast then the reference transcriptome is almost double in size and there might be in my view some redundancy?
What are your thoughts...
Thank you
1. create a database with transcriptome 1 (random)
2. blast transcriptome 2 to 1
3. start creating the reference transcriptome = transcriptome 1 + unmatched transcripts from transcriptome 2
4. create a database with the reference transcriptome
5. blast transcriptome 3 to ref transcr.
6. as in point 3 and so on
At the end I will have a reference transcriptome with all the transcripts present in the 16 transcriptomes represented once.
My questions is to use or not to use -task megablast.
If I use megablast then the reference transcriptome is almost double in size and there might be in my view some redundancy?
What are your thoughts...
Thank you
Comment