i have a fasta file containing all the virus genome which will affect cassava. i have transcript file also in fasta. in transcript file each transcript have 100 nucleotide length. I want to know whether these transcripts are inside the virus genom. any one help me pls?
Seqanswers Leaderboard Ad
Collapse
Announcement
Collapse
No announcement yet.
X
-
One relatively straight-forward way to attack this problem would be to make a combined fasta file for the viral genomes and cassava and then map the transcripts/transcript fragments against it. You can then get an idea how likely each fragment is to come from the host or virus. I assume that tophat or something like that would be an appropriate aligner, since presumably you have a mixture of host (i.e. spliced) and viral (I assume transcripts would be single exon, but I've only made viruses, not studied them) reads and not doing so might bias things.
I should note that I've never done what you're trying to do, but if no one replies with a better idea then this is enough to get you started.
-
doubt regarding tophat
Originally posted by dpryan View PostOne relatively straight-forward way to attack this problem would be to make a combined fasta file for the viral genomes and cassava and then map the transcripts/transcript fragments against it. You can then get an idea how likely each fragment is to come from the host or virus. I assume that tophat or something like that would be an appropriate aligner, since presumably you have a mixture of host (i.e. spliced) and viral (I assume transcripts would be single exon, but I've only made viruses, not studied them) reads and not doing so might bias things.
I should note that I've never done what you're trying to do, but if no one replies with a better idea then this is enough to get you started.
Comment
-
Originally posted by vijesh View Postcan we put the two files as input in tophat.means that genome file in fasta and also the transcript file?
Code:cat Cassava.fa Viruses.fa > CombinedGenome.fa bowtie2-build CombinedGenome.fa Combined tophat -G Cassava.gtf Combined reads.fa
It would be good to compare the results with and without using the GTF annotation, off-hand I'm not entirely sure how or if that might bias things.
Comment
Latest Articles
Collapse
-
by seqadmin
Spatial biology is an exciting field that encompasses a wide range of techniques and technologies aimed at mapping the organization and interactions of various biomolecules in their native environments. As this area of research progresses, new tools and methodologies are being introduced, accompanied by efforts to establish benchmarking standards and drive technological innovation.
3D Genomics
While spatial biology often involves studying proteins and RNAs in their...-
Channel: Articles
Yesterday, 07:30 PM -
-
by seqadmin
Many organizations study rare diseases, but few have a mission as impactful as Rady Children’s Institute for Genomic Medicine (RCIGM). “We are all about changing outcomes for children,” explained Dr. Stephen Kingsmore, President and CEO of the group. The institute’s initial goal was to provide rapid diagnoses for critically ill children and shorten their diagnostic odyssey, a term used to describe the long and arduous process it takes patients to obtain an accurate...-
Channel: Articles
12-16-2024, 07:57 AM -
ad_right_rmr
Collapse
News
Collapse
Topics | Statistics | Last Post | ||
---|---|---|---|---|
Started by seqadmin, 12-30-2024, 01:35 PM
|
0 responses
21 views
0 likes
|
Last Post
by seqadmin
12-30-2024, 01:35 PM
|
||
Started by seqadmin, 12-17-2024, 10:28 AM
|
0 responses
41 views
0 likes
|
Last Post
by seqadmin
12-17-2024, 10:28 AM
|
||
Started by seqadmin, 12-13-2024, 08:24 AM
|
0 responses
55 views
0 likes
|
Last Post
by seqadmin
12-13-2024, 08:24 AM
|
||
Started by seqadmin, 12-12-2024, 07:41 AM
|
0 responses
40 views
0 likes
|
Last Post
by seqadmin
12-12-2024, 07:41 AM
|
Comment