SEQanswers

Go Back   SEQanswers > Bioinformatics > Bioinformatics



Similar Threads
Thread Thread Starter Forum Replies Last Post
seq.txt, qseq.txt and fastq NicoBxl Bioinformatics 5 01-03-2014 08:35 AM
s_*_export.txt VS s_*_sequence.txt zhuj Illumina/Solexa 5 06-08-2010 01:35 PM
solexa output files | s_*_seq.txt vs. s_*_sequencece.txt lajoieb Illumina/Solexa 3 04-08-2009 05:52 PM
Difference between eland_multi.txt and eland_extended.txt seq_GA Illumina/Solexa 1 02-13-2009 03:50 AM
problems concerning seq.txt and prb.txt hannat Bioinformatics 4 01-27-2009 10:35 PM

Reply
 
Thread Tools
Old 03-24-2010, 04:56 PM   #1
btully
Junior Member
 
Location: Los Angeles

Join Date: Mar 2010
Posts: 1
Default 454ContigGraph.txt

The 454ContigGraph.txt file output by the GS assembler is exactly the kind of information I am looking for. However, I could not tell from the Roche manual how to visualize the results. Any suggestions?
btully is offline   Reply With Quote
Old 03-24-2010, 11:18 PM   #2
linikujp
Junior Member
 
Location: JAPAN

Join Date: May 2009
Posts: 6
Smile

There is detailed introduction in the Version2.3's manual:
Genome Sequencer FLX System Software Manual Part C: GS De Novo Assembler – GS Reference Mapper – SFF Tools
1.15.1.12 454ContigGraph.txt

After you run a cDNA aseembly project, look at the actual file and read the manual. It is clear and easy to understand.
linikujp is offline   Reply With Quote
Old 03-25-2010, 02:00 AM   #3
seqseq
Junior Member
 
Location: Spain

Join Date: Oct 2009
Posts: 7
Default

You are referring to this manual file, right?
http://xyala.cap.ed.ac.uk/Gene_Pool/...ls_Oct2009.pdf

There is a decription of the "first section" of 454ContigGraph.txt containing the average depth of the alignment per contig. But for the "second section" containing connections between contigs they only describe the lines starting with "C" not the ones starting with "I" or with "F". Does anyone know where I can find a description for these parts of the "second section"?

And does anyone know a way to visualize (parts of) this graph structure?

Thanks
seqseq is offline   Reply With Quote
Old 03-25-2010, 02:41 AM   #4
linikujp
Junior Member
 
Location: JAPAN

Join Date: May 2009
Posts: 6
Post

Yes. It is that manual.

In my file, there are "S" and "I" starting instead of "F" and "I".

Here is an example:
S 1 3694 618:+;561:-;560:+;82:-;711:-;500:+

I 164 AGAGGCTTAgggtttttCATCCAATCaaacAGTGCCAGACCACGGTTACACAACAGAACCGATGGCCTCAGCCGGTAACAGGATGGATACCAacagggtgg
I 502 GC 301:875-3'..855-3';6:970-5'..855-3';2867:971-5'..855-3';5:972-5'..855-3'

Couldn't find the explantation from the manual. I think programmer from 454 can explain this.
linikujp is offline   Reply With Quote
Old 03-25-2010, 02:44 AM   #5
linikujp
Junior Member
 
Location: JAPAN

Join Date: May 2009
Posts: 6
Default

Quote:
Originally Posted by seqseq View Post
And does anyone know a way to visualize (parts of) this graph structure?

Thanks
You may develop a program to visualize this.
I am thinking of doing this... who else is interested?
linikujp is offline   Reply With Quote
Old 03-29-2010, 07:04 AM   #6
dschika
Member
 
Location: Zurich

Join Date: Mar 2010
Posts: 56
Default

I don't know if it is correct but what i suppose looking at my 454ContigGraph-file:

S isotig-id isotig-length contigs_in_isotig(with direction +/-)

I contig_id consenus_sequence coverage-depth:contigs_surrounding_the contig (???) (leading zeros in the contig_id are discarded: 1 = contig00001)

As I'm very unsure about the last entry in the "I-rows" I still don't have a clue why this entry is missing sometimes...

Last edited by dschika; 03-29-2010 at 07:13 AM.
dschika is offline   Reply With Quote
Old 03-29-2010, 11:20 PM   #7
flxlex
Moderator
 
Location: Oslo, Norway

Join Date: Nov 2008
Posts: 415
Default

I got the following explanation:

I: reads 'flowing through' the contig, i.e. reads that start in a neighboring contig, flow through the contig, and end in another neighbor again. In the example:

I 502 GC 301:875-3'..855-3';6:970-5'..855-3';2867:971-5'..855-3';5:972-5'..855-3'

310 reads start in contig 875, flow out it's 3' end, into the 5' of the contig you are looking at (502), out of it again at the 3', and into the 3' of contig 855. The '..' kind of represents the contig in question from 5' to 3'.

In this case, the large number of reads flowing through (301, 6, 2867 and 5, respectively) and the short sequence ('GC') point to a tandem repeat/microsat. All reads go into contig 855, which perhaps is high depth and short also?
flxlex is offline   Reply With Quote
Old 04-13-2010, 01:35 AM   #8
flxlex
Moderator
 
Location: Oslo, Norway

Join Date: Nov 2008
Posts: 415
Default

I just posted an entry on the 454ContigGraph.txt file on my blog about newbler:

http://contig.wordpress.com/2010/04/...raph-txt-file/

Cheers,

flxlex
flxlex is offline   Reply With Quote
Old 04-13-2010, 07:45 PM   #9
linikujp
Junior Member
 
Location: JAPAN

Join Date: May 2009
Posts: 6
Thumbs up

Quote:
Originally Posted by flxlex View Post
I just posted an entry on the 454ContigGraph.txt file on my blog about newbler:

http://contig.wordpress.com/2010/04/...raph-txt-file/

Cheers,

flxlex
Hi, that's a good one. Thank you!
linikujp is offline   Reply With Quote
Old 04-14-2010, 06:49 AM   #10
lh3
Senior Member
 
Location: Boston

Join Date: Feb 2008
Posts: 693
Default

Thanks. Quite a nice blog on technical details.
lh3 is offline   Reply With Quote
Old 06-28-2010, 10:00 AM   #11
henry.gibbons
Junior Member
 
Location: Aberdeen Proving Ground, MD

Join Date: Jun 2009
Posts: 1
Default Visualizing 454contiggraph.txt files

Quote:
Originally Posted by linikujp View Post
You may develop a program to visualize this.
I am thinking of doing this... who else is interested?
I would be very interested in this. You see displays of this at AGBT/other meetings but I have not seen any tools to display this very useful information. I'm not a programmer, so this would be someone else's bag but would love the tool!
henry.gibbons is offline   Reply With Quote
Old 12-29-2010, 11:00 AM   #12
dsenalik
Carrot Scientist
 
Location: Madison WI USA

Join Date: Nov 2009
Posts: 42
Default

Quote:
Originally Posted by henry.gibbons View Post
I would be very interested in this. You see displays of this at AGBT/other meetings but I have not seen any tools to display this very useful information. I'm not a programmer, so this would be someone else's bag but would love the tool!
For anyone finding this old thread, I wrote a program to do just this.

I posted it at this address:

http://www.vcru.wisc.edu/simonlab/sd...html#contignet
dsenalik is offline   Reply With Quote
Reply

Thread Tools

Posting Rules
You may not post new threads
You may not post replies
You may not post attachments
You may not edit your posts

BB code is On
Smilies are On
[IMG] code is On
HTML code is Off




All times are GMT -8. The time now is 01:48 AM.


Powered by vBulletin® Version 3.8.9
Copyright ©2000 - 2021, vBulletin Solutions, Inc.
Single Sign On provided by vBSSO