SEQanswers

Go Back   SEQanswers > Bioinformatics > Bioinformatics



Similar Threads
Thread Thread Starter Forum Replies Last Post
How does cuffdiff/cufflinks determine "read type"? sdarko Bioinformatics 3 08-31-2011 11:13 PM
cufflinks error:Read Type: 0bp single-end chenyao Bioinformatics 0 07-27-2011 07:09 PM
Reads map in Tophat, but Cufflinks doesn't find transcripts janec Bioinformatics 0 07-07-2011 01:42 AM
cufflinks incorrectly specifies read type as single end lindymcb Bioinformatics 6 04-03-2011 08:33 AM
Cufflinks Transcript & Protein predictions: When one happens but the other doesn't. keebs42 Bioinformatics 2 01-27-2011 02:31 PM

Reply
 
Thread Tools
Old 12-11-2011, 09:26 AM   #1
marb
Junior Member
 
Location: Poland

Join Date: Dec 2011
Posts: 7
Default Cufflinks doesn't recognize read type

Hello.
I used Cufflinks to process data from tophat.
My data come from Illumina and are pair-end type.

I type following command:
Code:
/path/cufflinks tophat_out/accepted_hits.bam -o cuff_out

Unfortunately Cufflinks recognize that data as single-end and lenght 0!

Code:
[13:40:01] Inspecting reads and determining fragment length distribution.
Processed 204293 loci.
Map Properties:
       Total Map Mass: 5928337.99
       Read Type: 0bp single-end
       Fragment Length Distribution: Truncated Gaussian (default)
                     Default Mean: 200
                  Default Std Dev: 80
[13:52:49] Assembling transcripts and estimating abundances.
Processed 204293 loci.
Is it necesarry to use some more option to correct run Cufflinks on pair-end data?
marb is offline   Reply With Quote
Old 12-11-2011, 05:16 PM   #2
Wallysb01
Senior Member
 
Location: San Francisco, CA

Join Date: Feb 2011
Posts: 286
Default

What do your sequence headers look like and how are your files split up, if at all? I've run into this problem before, and it just required getting the formatting right.
Wallysb01 is offline   Reply With Quote
Old 12-12-2011, 01:32 AM   #3
marb
Junior Member
 
Location: Poland

Join Date: Dec 2011
Posts: 7
Default

Quote:
Originally Posted by Wallysb01 View Post
What do your sequence headers look like and how are your files split up, if at all? I've run into this problem before, and it just required getting the formatting right.
Do you think about header of bam file?
I obtained 28 fastq files from Casava - 14 right-end (R1) 14 left-end (R2).
I have processed them by tophat.
marb is offline   Reply With Quote
Old 12-12-2011, 02:23 PM   #4
Wallysb01
Senior Member
 
Location: San Francisco, CA

Join Date: Feb 2011
Posts: 286
Default

Quote:
Originally Posted by marb View Post
Do you think about header of bam file?
I obtained 28 fastq files from Casava - 14 right-end (R1) 14 left-end (R2).
I have processed them by tophat.
Are you sure tophat used them as paired and not singled? How do the ends of your sequence headers look in the fastq format? If they come out with:

@XXXX 1:N:0 @XXXX 2:N:0
AGC.. GCT
+XXXX 1:N:0 +XXXX 1:N:0
.... .....

a lot of programs won't recognize that as paired end files. You need to convert it to:

@XXXX/1 @XXXX/2
AGC.. GCT
+XXXX/1 +XXXX/2
.... .....

I came on here with the same kinds of issues and a friendly commenter made this post to help people like me out:

http://contig.wordpress.com/2011/09/...-fastq-header/
Wallysb01 is offline   Reply With Quote
Old 12-13-2011, 03:56 AM   #5
marb
Junior Member
 
Location: Poland

Join Date: Dec 2011
Posts: 7
Default

I've tested cufflinks processing on other data and then cufflinks recognised them correctly as 57bp x 57bp.
Hence I know that there is the mistake at tophat processing fastq files level.

I know that is necessary to all sequences R1 and R2 (pair-end) be typed in the same order, so I used following command:

Code:
tophat /path/to/genome $(printf "%s," ./*.gz | sed 's/,$/\n/')
Do you think that way type args (fastq files) is incorect?
marb is offline   Reply With Quote
Reply

Thread Tools

Posting Rules
You may not post new threads
You may not post replies
You may not post attachments
You may not edit your posts

BB code is On
Smilies are On
[IMG] code is On
HTML code is Off




All times are GMT -8. The time now is 02:37 AM.


Powered by vBulletin® Version 3.8.9
Copyright ©2000 - 2019, vBulletin Solutions, Inc.
Single Sign On provided by vBSSO