Seqanswers Leaderboard Ad

**sarvidsson** · 02-18-2015, 04:06 AM

Could you post a few lines from the FASTA file you produced?

**dena.dinesh** · 02-18-2015, 04:11 AM

Originally posted by sarvidsson View Post

Could you post a few lines from the FASTA file you produced?

Hi,
I have added a few lines of my fasta file. Please take a look

Best
dena

**sarvidsson** · 02-18-2015, 04:19 AM

Posting the final file (after all replacements, cleanup etc.) would be easier to debug (e.g. as an attachment).

Make sure that the first line ("fasta_seq") is removed and that there is a linebreak between the ID and the sequence (difficult to tell whether this is the case). Additionally, some tools expect fixed-length sequence lines - you can use the "fold" command line utility to fix that.

**dpryan** · 02-18-2015, 04:36 AM

You don't want to use write.table(). Well, you can, but then you'd need sep="\n" and quote=F. A better method would be the write.fasta() command.

**dena.dinesh** · 02-18-2015, 06:43 AM

Hi Ryan,

Thanks for your comment. I tried the "write.fasta" for the file but it prints out only the first sequence with all character in a single line. Its is not printing out the other sequences. I think the file must be in different format. I have attached the file for your reference. Kindly guide me.

Attached Files

fasta_sequences.txt (13.2 KB, 26 views)

**dena.dinesh** · 02-18-2015, 06:44 AM

Originally posted by sarvidsson View Post

Could you post a few lines from the FASTA file you produced?

I have attached the file which was generated by above R command for your reference.

Attached Files

fasta_sequences.txt (13.2 KB, 25 views)

**sarvidsson** · 02-18-2015, 06:51 AM

That file should be OK (it is proper FASTA). As I previously said, some tools like to have folded sequence lines (just run the Unix command fold on it).

I ran your file on the frameDP web resource from INRA (https://iant.toulouse.inra.fr/FrameDP/), and that worked fine.

**dpryan** · 02-18-2015, 06:56 AM

The problem is that you mucked up the output of read.fasta. Your code should be something like:

Code:

library(seqinr)
ids=as.character(read.delim("path/to/ids/file.txt"))
dd=read.fasta("path/to/transcriptome.fasta",seqtype="DNA",as.string=T)
dd = dd[names(dd) %in% ids)
write.fasta(dd, names(dd), file=paste(dir,"/",name,sep=""))

There's no need to muck around with prepending ">" to the names.

**dena.dinesh** · 02-19-2015, 01:40 AM

Thanks Ryan. It worked but when I gave nbchar=70, it doesnt seems to work. rather it prints the entire sequence in a single line. Thanks once again for your help

**dena.dinesh** · 02-19-2015, 01:41 AM

Originally posted by sarvidsson View Post

That file should be OK (it is proper FASTA). As I previously said, some tools like to have folded sequence lines (just run the Unix command fold on it).

I ran your file on the frameDP web resource from INRA (https://iant.toulouse.inra.fr/FrameDP/), and that worked fine.

Thank you very much. it worked

Topics	Statistics	Last Post
Cancer Metastasis: A Deep Dive into Cellular Plasticity by seqadmin Started by seqadmin, 04-11-2024, 12:08 PM	0 responses 59 views 0 likes	Last Post by seqadmin 04-11-2024, 12:08 PM
Proteogenomic Profiles Offer New Clues in Prostate Cancer by seqadmin Started by seqadmin, 04-10-2024, 10:19 PM	0 responses 57 views 0 likes	Last Post by seqadmin 04-10-2024, 10:19 PM
Novel Diagnostic Assay Enhances Ovarian Cancer Detection by seqadmin Started by seqadmin, 04-10-2024, 09:21 AM	0 responses 51 views 0 likes	Last Post by seqadmin 04-10-2024, 09:21 AM
Evolutionary Dynamics of Centromeres: A Comparative Genomic Analysis by seqadmin Started by seqadmin, 04-04-2024, 09:00 AM	0 responses 56 views 0 likes	Last Post by seqadmin 04-04-2024, 09:00 AM

Seqanswers Leaderboard Ad

Announcement

Unable to find ORF for fasta file

Comment

Comment

Comment

Comment

Comment

Comment

Comment

Comment

Comment

Comment

Latest Articles

ad_right_rmr

News