SEQanswers

SEQanswers (http://seqanswers.com/forums/index.php)
-   Illumina/Solexa (http://seqanswers.com/forums/forumdisplay.php?f=6)
-   -   How to separate eukaryotic sequence reads from my metagenome data? (http://seqanswers.com/forums/showthread.php?t=75651)

SaiPrabhas 04-24-2017 09:50 AM

How to separate eukaryotic sequence reads from my metagenome data?
 
Hello!

I have Illumina Hiseq data from leafhopper Empoasca fabae. It has two symbionts, Sulcia muelleri, and Nasuia deltocephalinicola. In my project, we sequenced head and bacteriome regions of 4 different leafhopper specimens. We want to know if they are any differences in bacterial populations between these specimens and also want to see if they are any new bacteria present in the head samples of leafhopper. As the genome of the leafhopper is not sequenced yet, I cannot map the reads to a reference genome and filter out the eukaryotic reads so that I can analyze the bacterial population. How can I solve this problem? Thank you very much.

Regards,
Sai

GenoMax 04-24-2017 10:04 AM

Want to make you aware of BBSplit from BBMap.

Following is not completely thought through so take with a grain of salt.

There seem to be too many unknowns in this design to be able to use BBsplit directly. You could bbsplit all samples against each other to see if you can get as much of the leafhopper out as you can but it is going to take out some of the common bacteria as well.

Is there a related genome (to leafhopper) available? That would be the second best bet.

WhatsOEver 04-25-2017 06:26 AM

1) bacteria often have a higher GC-content than eukaryots, so this might be a starting point
2) You could filter your dataset for the known symbionts (which genomes are available, at least acc to a 5s google research) and try an assembly with the rest (depending on what this Hiseq data is: WGS/RNA-Seq?)
3) If you could get a collection of genomes of known insect symbionts, try merging all these and map against this set
4) Contact the people belonging to the abstract "Hybrid De Novo Assembly of the Leafhopper Genome with SMRT Sequencing Method" I also found via google


All times are GMT -8. The time now is 05:56 PM.

Powered by vBulletin® Version 3.8.9
Copyright ©2000 - 2021, vBulletin Solutions, Inc.