Seqanswers Leaderboard Ad

Collapse

Announcement

Collapse
No announcement yet.
X
 
  • Filter
  • Time
  • Show
Clear All
new posts

  • ERROR: cufflinks, cuffmerge, cuffdiff

    bowtie2-build Mh.fa Mh &

    tophat2 -p 8 -o t1_Mh -G Mh.gff Mh t1.1_fq,t1.2_fq,t1.3_fq,t1.4_fq,t1.5_fq 2>> tophat.log &
    tophat2 -p 8 -o t2_Mh -G Mh.gff Mh t2.1_fq,t2.2_fq,t2.3_fq,t2.4_fq,t2.5_fq 2>> tophat.log &

    ===============no problem until here=================

    samtools view -o accepted_hits.sam accepted_hits.bam
    cufflinks -o cufflinks_out t1_accepted_hits.sam 2>> cufflinks.log &

    You are using Cufflinks v2.0.2, which is the most recent release.
    [bam_header_read] EOF marker is absent.
    [bam_header_read] invalid BAM binary header (this is not a BAM file).
    File IR_t1_Mh/accepted_hits.sam doesn't appear to be a valid BAM file, trying SAM...
    [10:52:43] Inspecting reads and determining fragment length distribution.
    > Processed 9677 loci. [*************************] 100%
    > Map Properties:
    > Normalized Map Mass: 11715284.00
    > Raw Map Mass: 11715284.00
    > Fragment Length Distribution: Truncated Gaussian (default)
    > Default Mean: 200
    > Default Std Dev: 80
    [10:53:42] Assembling transcripts and estimating abundances.


    output files:
    -genes.fpkm_tracking,
    -isoforms.fpkkm_tracking,
    -skipped.gtf,
    -and transcripts.gtf.

    Cufflinks worked when sam file was used and gff file was dropped.

    ===============no problem until here=================

    cufflinks -g Mh.gff -o cufflinks_out t1_accepted_hits.sam

    You are using Cufflinks v2.0.2, which is the most recent release.
    [bam_header_read] EOF marker is absent.
    [bam_header_read] invalid BAM binary header (this is not a BAM file).
    File IR_t1_Mh/accepted_hits.sam doesn't appear to be a valid BAM file, trying SAM...
    [10:59:30] Loading reference annotation.
    [10:59:31] Inspecting reads and determining fragment length distribution.
    > Processing Locus Mh:3451:MhA1_Contig3451:113 [************************ ] 99%
    Error: this SAM file doesn't appear to be correctly sorted!
    current hit is at Mh:0004:MhA1_Contig4:380, last one was at Mh:0003:MhA1_Contig3:890
    Cufflinks requires that if your file has SQ records in
    the SAM header that they appear in the same order as the chromosomes names
    in the alignments.
    If there are no SQ records in the header, or if the header is missing,
    the alignments must be sorted lexicographically by chromsome
    name and by position.


    Cufflinks didn't work when gff file was included.
    It seemed that cufflinks stoppted at Contig3.

    ===============problem occurred above=================

    cufflinks -o cufflinks_out t1_accepted_hits.bam

    You are using Cufflinks v2.0.2, which is the most recent release.
    Error: sort order of reads in BAMs must be the same


    Cufflinks didn't work when bam file was used and gff file was dropped.

    ===============problem occurred above=================

    cufflinks -g Mt.gff -o cufflinks_out t1_accepted_hits.bam 2>> cufflinks.log &

    You are using Cufflinks v2.0.2, which is the most recent release.
    [bam_header_read] EOF marker is absent.
    [bam_header_read] invalid BAM binary header (this is not a BAM file).
    File IR_t1_Mh/accepted_hits.sam doesn't appear to be a valid BAM file, trying SAM...
    [10:50:49] Loading reference annotation.
    [10:50:50] Inspecting reads and determining fragment length distribution.
    > Processing Locus Mh:3451:MhA1_Contig3451:113 [************************ ] 99%
    Error: this SAM file doesn't appear to be correctly sorted!
    current hit is at Mh:0004:MhA1_Contig4:380, last one was at Mh:0003:MhA1_Contig3:890
    Cufflinks requires that if your file has SQ records in
    the SAM header that they appear in the same order as the chromosomes names
    in the alignments.
    If there are no SQ records in the header, or if the header is missing,
    the alignments must be sorted lexicographically by chromsome
    name and by position.


    Cufflinks didn't work when bam file and gff file were used.
    It seemed that cufflinks stoppted at Contig3.

    ===============problem occurred above=================

    cuffmerge -o merge_IR_Mt -g Mt.gff IR_Mt_transcript.txt 2>> cuffmerge.log &

    [Sat Jan 19 11:10:13 2013] Preparing output location merge_IR_Mh/
    [Sat Jan 19 11:10:15 2013] Converting GTF files to SAM
    [11:10:15] Loading reference annotation.
    [11:10:17] Loading reference annotation.
    [11:10:18] Loading reference annotation.
    [11:10:18] Loading reference annotation.
    [11:10:18] Loading reference annotation.
    [11:10:18] Loading reference annotation.
    [11:10:18] Loading reference annotation.
    [Sat Jan 19 11:10:18 2013] Quantitating transcripts
    You are using Cufflinks v2.0.2, which is the most recent release.
    Command line:
    cufflinks -o merge_IR_Mh/ -F 0.05 -g Mh.gff -q --overhang-tolerance 200 --library-type=transfrags -A 0.0 --min-frags-per-transfrag 0 --no-5-extend -p 1 merge_IR_Mh/tmp/mergeSam_fileBL9IlN
    [bam_header_read] EOF marker is absent.
    [bam_header_read] invalid BAM binary header (this is not a BAM file).
    File merge_IR_Mh/tmp/mergeSam_fileBL9IlN doesn't appear to be a valid BAM file, trying SAM...
    Error: sort order of reads in BAMs must be the same
    [FAILED]
    Error: could not execute cufflinks


    ===============problem occurred above=================

    cuffdiff -o cuffdiff_out_IR_Mt merge_IR_Mt/merged.gtf t1_Mt/accepted_hits.bam t2_Mh/accepted_hits.bam

    You are using Cufflinks v2.0.2, which is the most recent release.
    Error: sort order of reads in BAMs must be the same


    ===============problem occurred above=================

    In summery,
    1. There was no problem in running Bowtie-build and Tophat.
    2. While running cufflinks, if I included gff file, it did not work.
    Even though I dropped gff file, accepted_bam file also made an error.
    However, when I dropped gff file but used sam file instead of bam file, four output files were successfully generated.
    3. While funning cuffmerge and cuffdiff, it didn't work as well.
    4. I checked GFF and SAM/BAM file.
    - They are correctly sorted.
    - SAM/BAM file: contig0, contig1, contig2, contig3, contig4, ......
    - GFF file: contig0, contig1, contig2, contig4...... (no contig3) -> INCOMPLETE
    5. accepted_hits.bam file looks like....
    @HD VN:1.0 SO:coordinate
    @SQ SN:Mh:0000:MhA1_Contig0 LN:237641
    @SQ SN:Mh:0001:MhA1_Contig1 LN:17102
    @SQ SN:Mh:0002:MhA1_Contig2 LN:88040
    @SQ SN:Mh:0003:MhA1_Contig3 LN:1670
    @SQ SN:Mh:0004:MhA1_Contig4 LN:27447
    @SQ SN:Mh:0005:MhA1_Contig5 LN:2702
    @SQ SN:Mh:0006:MhA1_Contig6 LN:3758
    @SQ SN:Mh:0007:MhA1_Contig7 LN:156602
    @SQ SN:Mh:0008:MhA1_Contig8 LN:1526
    @SQ SN:Mh:0009:MhA1_Contig9 LN:106463
    @SQ SN:Mh:0010:MhA1_Contig10 LN:7780
    @SQ SN:Mh:0011:MhA1_Contig11 LN:67162
    @SQ SN:Mh:0012:MhA1_Contig12 LN:16878
    @SQ SN:Mh:0013:MhA1_Contig13 LN:73699
    @SQ SN:Mh:0014:MhA1_Contig14 LN:78027
    6. gff file looks like...
    Mh:0000:MhA1_Contig0 Freeze3 mRNA 4027 5970 + ID=MhA1_Contig0.frz3.fgene1;Name=MhA1_Contig0.frz3.fgene1
    Mh:0000:MhA1_Contig0 Freeze3 CDS 4027 4090 15.97 + 0 Parent=MhA1_Contig0.frz3.fgene1
    Mh:0000:MhA1_Contig0 Freeze3 CDS 4141 4251 7.12 + 1 Parent=MhA1_Contig0.frz3.fgene1
    Mh:0000:MhA1_Contig0 Freeze3 CDS 4311 4390 9.54 + 1 Parent=MhA1_Contig0.frz3.fgene1
    Mh:0000:MhA1_Contig0 Freeze3 CDS 4471 4576 11.8 + 0 Parent=MhA1_Contig0.frz3.fgene1
    Mh:0000:MhA1_Contig0 Freeze3 CDS 4635 4678 -4.98 + 1 Parent=MhA1_Contig0.frz3.fgene1
    Mh:0000:MhA1_Contig0 Freeze3 CDS 4760 4959 14.82 + 0 Parent=MhA1_Contig0.frz3.fgene1
    Mh:0000:MhA1_Contig0 Freeze3 CDS 5070 5193 4.65 + 2 Parent=MhA1_Contig0.frz3.fgene1
    Mh:0000:MhA1_Contig0 Freeze3 CDS 5243 5274 -3.08 + 0 Parent=MhA1_Contig0.frz3.fgene1
    Mh:0000:MhA1_Contig0 Freeze3 CDS 5348 5631 39.94 + 2 Parent=MhA1_Contig0.frz3.fgene1
    Mh:0000:MhA1_Contig0 Freeze3 CDS 5747 5841 5.9 + 1 Parent=MhA1_Contig0.frz3.fgene1
    Mh:0000:MhA1_Contig0 Freeze3 CDS 5881 5970 9.61 + 0 Parent=MhA1_Contig0.frz3.fgene1

    Would you please give me some tips about what I have to fix?
    Last edited by syintel87; 01-19-2013, 08:28 AM.

Latest Articles

Collapse

  • seqadmin
    Current Approaches to Protein Sequencing
    by seqadmin


    Proteins are often described as the workhorses of the cell, and identifying their sequences is key to understanding their role in biological processes and disease. Currently, the most common technique used to determine protein sequences is mass spectrometry. While still a valuable tool, mass spectrometry faces several limitations and requires a highly experienced scientist familiar with the equipment to operate it. Additionally, other proteomic methods, like affinity assays, are constrained...
    04-04-2024, 04:25 PM
  • seqadmin
    Strategies for Sequencing Challenging Samples
    by seqadmin


    Despite advancements in sequencing platforms and related sample preparation technologies, certain sample types continue to present significant challenges that can compromise sequencing results. Pedro Echave, Senior Manager of the Global Business Segment at Revvity, explained that the success of a sequencing experiment ultimately depends on the amount and integrity of the nucleic acid template (RNA or DNA) obtained from a sample. “The better the quality of the nucleic acid isolated...
    03-22-2024, 06:39 AM

ad_right_rmr

Collapse

News

Collapse

Topics Statistics Last Post
Started by seqadmin, 04-11-2024, 12:08 PM
0 responses
24 views
0 likes
Last Post seqadmin  
Started by seqadmin, 04-10-2024, 10:19 PM
0 responses
25 views
0 likes
Last Post seqadmin  
Started by seqadmin, 04-10-2024, 09:21 AM
0 responses
22 views
0 likes
Last Post seqadmin  
Started by seqadmin, 04-04-2024, 09:00 AM
0 responses
52 views
0 likes
Last Post seqadmin  
Working...
X