Hi all,
I am looking for a faster method to map annotation I obtained from RAST to a range of target genome positions.
The method I am currently using is by filtering the start and end position in the excel file to my target range (for example if I am finding annotated CDS that locates between 49050 and 54500 bp, I filter the genome location of the annotation file to "larger or equal to 49050" and "less than or equal to 54500" and copy the related result from there.
This method is extremely time-consuming and now I have a set of data with about 500 ranges so I am looking for suggestions on any method that I can use to speed up this process? I know very limited awk and R but is familiar with usage of Linux OS.
Thank you in advance for any help
I am looking for a faster method to map annotation I obtained from RAST to a range of target genome positions.
The method I am currently using is by filtering the start and end position in the excel file to my target range (for example if I am finding annotated CDS that locates between 49050 and 54500 bp, I filter the genome location of the annotation file to "larger or equal to 49050" and "less than or equal to 54500" and copy the related result from there.
This method is extremely time-consuming and now I have a set of data with about 500 ranges so I am looking for suggestions on any method that I can use to speed up this process? I know very limited awk and R but is familiar with usage of Linux OS.
Thank you in advance for any help
Comment