Seqanswers Leaderboard Ad

Collapse

Announcement

Collapse
No announcement yet.
X
 
  • Filter
  • Time
  • Show
Clear All
new posts

  • Can't use my SAM files for cuffdiff

    Hi everyone,

    I am running a cuffdiff with GTF files from cuffmerge and SAM files from my mapped reads (CLC Bio). However, I've got error notification that my SAM files are not aligned. I searched through the web and found this:

    Do Cufflinks and Cuffdiff support both BAM and SAM?

    Yes. If a SAM is supplied, a message will be output that the file is not a valid BAM file. However, Cufflinks will recognize this and treat the file as a SAM. When using a SAM file, you should include a proper header or ensure that the reads are lexicographically by chromosome and then numerically by left position. You can accomplish this sorting with the command sort -k3,3 -k4,4n in.sam > out.sam.


    As I am running from Galaxy website and not using the LINUX setting, what should I do to keep my analysis going using cuffdiff. Thanks.

    Jasmine

  • #2
    Are your SAM files "sorted" by coordinates ?

    Comment


    • #3
      SAM files are far larger than .bam files, you are going to want to compress them for the long term, so since you are having problems with them now, you should probably just compress them already.

      Comment


      • #4
        @yueluo May I know how can I do that? Does that means that I assigned coordinates to each mapped sample (different conditions) so that cuffmerge can recognise it? Which tool can I use in Galaxy?

        Hope to get reply from you soon. I am really foreign to bioinformatics tools. Thanks guys =)

        Comment


        • #5
          What did your error message look like?
          Here is a quote from cufflinks on how to sort the sam file:
          The SAM file supplied to Cufflinks must be sorted by reference position. If you aligned your reads with TopHat, your alignments will be properly sorted already. If you used another tool, you may want to make sure they are properly sorted as follows:

          sort -k 3,3 -k 4,4n hits.sam > hits.sam.sorted

          Comment


          • #6
            @yueluo Thanks. I do realise it now that it will be easier for me to align my reads using Tophat instead of CLC Bio. The problem is because I am using cufflink thru Galaxy website and I am unsure how to key in the command stated above. If I am not wrong, the command stated above is only LINUX based right?

            Thanks for your help =)

            Comment


            • #7
              @yueluo By the way, are you a frequent user of galaxy?

              Comment


              • #8
                @jasminegirl
                Sorry,no... I have access to local clusters/servers.

                Comment


                • #9
                  @yueluo Oh i see. But thanks anyway for your help. I will update you on my progress. Thanks.

                  Comment

                  Latest Articles

                  Collapse

                  • seqadmin
                    Essential Discoveries and Tools in Epitranscriptomics
                    by seqadmin


                    The field of epigenetics has traditionally concentrated more on DNA and how changes like methylation and phosphorylation of histones impact gene expression and regulation. However, our increased understanding of RNA modifications and their importance in cellular processes has led to a rise in epitranscriptomics research. “Epitranscriptomics brings together the concepts of epigenetics and gene expression,” explained Adrien Leger, PhD, Principal Research Scientist on Modified Bases...
                    Yesterday, 07:01 AM
                  • seqadmin
                    Current Approaches to Protein Sequencing
                    by seqadmin


                    Proteins are often described as the workhorses of the cell, and identifying their sequences is key to understanding their role in biological processes and disease. Currently, the most common technique used to determine protein sequences is mass spectrometry. While still a valuable tool, mass spectrometry faces several limitations and requires a highly experienced scientist familiar with the equipment to operate it. Additionally, other proteomic methods, like affinity assays, are constrained...
                    04-04-2024, 04:25 PM

                  ad_right_rmr

                  Collapse

                  News

                  Collapse

                  Topics Statistics Last Post
                  Started by seqadmin, 04-11-2024, 12:08 PM
                  0 responses
                  39 views
                  0 likes
                  Last Post seqadmin  
                  Started by seqadmin, 04-10-2024, 10:19 PM
                  0 responses
                  41 views
                  0 likes
                  Last Post seqadmin  
                  Started by seqadmin, 04-10-2024, 09:21 AM
                  0 responses
                  35 views
                  0 likes
                  Last Post seqadmin  
                  Started by seqadmin, 04-04-2024, 09:00 AM
                  0 responses
                  55 views
                  0 likes
                  Last Post seqadmin  
                  Working...
                  X