Seqanswers Leaderboard Ad

Collapse

Announcement

Collapse
No announcement yet.
X
 
  • Filter
  • Time
  • Show
Clear All
new posts

  • What linux spreadsheet program can you open .sam files in?

    Tried opening it in libre office calc but the program just freezes

  • #2
    Are we talking about an alignment file? Why do you want to open it in a spreadsheet program? If you need to make a change there may be an alternate option available.

    Comment


    • #3
      Well, you can probably open it in libre office if it's small enough (it does read it into memory). Why you would actually want to do that is beyond me.

      Comment


      • #4
        Originally posted by dpryan View Post
        Well, you can probably open it in libre office if it's small enough (it does read it into memory). Why you would actually want to do that is beyond me.
        Basically to write a program

        Comment


        • #5
          You really don't want to write macros to deal with data of this size. Try python/perl/C/whatever. You can use R, if you prefer, but I think it usually reads everything into memory too.

          Comment


          • #6
            Originally posted by dpryan View Post
            You really don't want to write macros to deal with data of this size. Try python/perl/C/whatever. You can use R, if you prefer, but I think it usually reads everything into memory too.
            I'm actually trying to write in python. How do I notate a certain variable in a sam file (I'm very new to python bioinformatics).

            Comment


            • #7
              Generally something like the following will work:

              Code:
              import csv
              
              sam = csv.reader(open("foo.sam","r"), dialect="excel-tab")
              for line in sam :
                  print("QNAME: %s" % (line[0]))
                  print("Sequence: %s" % (line[9]))
              So, just use the column number. You can also use pysam, which makes some things much easier. If you're comfortable with C, I can also recommend the samtools C API. If you need higher performance, you'll find it quite useful.

              Comment


              • #8
                Originally posted by dpryan View Post
                Generally something like the following will work:

                Code:
                import csv
                
                sam = csv.reader(open("foo.sam","r"), dialect="excel-tab")
                for line in sam :
                    print("QNAME: %s" % (line[0]))
                    print("Sequence: %s" % (line[9]))
                So, just use the column number. You can also use pysam, which makes some things much easier. If you're comfortable with C, I can also recommend the samtools C API. If you need higher performance, you'll find it quite useful.
                So that's basically exactly what I was looking for. Thanks!

                Comment


                • #9
                  Glad I could help. BTW, I didn't deal with the header in my example. You might check "if(len(line) > 5): stuff" or check for a @ as the first character to get past the header (unless you want to parse it).

                  Comment


                  • #10
                    Originally posted by dpryan View Post
                    Glad I could help. BTW, I didn't deal with the header in my example. You might check "if(len(line) > 5): stuff" or check for a @ as the first character to get past the header (unless you want to parse it).
                    Right, I'll have to figure out some of the details. Right now, I just have an idea what I want to perform, but no real idea how to execute. But I did just read through a tutorial that kind of went over the sort of info you presented, I was just unaware that you could do this directly to a sam file.

                    Comment


                    • #11
                      You might want to consider the pysam module: http://wwwfgu.anat.ox.ac.uk/~andreas...tools/api.html and https://code.google.com/p/pysam/

                      (EDIT: I see Devon already suggested this - apologies for the duplication0
                      Last edited by LeightonP; 12-10-2013, 02:30 PM. Reason: Noticed duplication of advice.

                      Comment


                      • #12
                        Visualize Bam

                        FYI: I wrote a simple java-based GUI to visualize some BAMS: https://github.com/lindenb/jvarkit/wiki/BamViewGui

                        Comment

                        Latest Articles

                        Collapse

                        • seqadmin
                          Essential Discoveries and Tools in Epitranscriptomics
                          by seqadmin


                          The field of epigenetics has traditionally concentrated more on DNA and how changes like methylation and phosphorylation of histones impact gene expression and regulation. However, our increased understanding of RNA modifications and their importance in cellular processes has led to a rise in epitranscriptomics research. “Epitranscriptomics brings together the concepts of epigenetics and gene expression,” explained Adrien Leger, PhD, Principal Research Scientist on Modified Bases...
                          Yesterday, 07:01 AM
                        • seqadmin
                          Current Approaches to Protein Sequencing
                          by seqadmin


                          Proteins are often described as the workhorses of the cell, and identifying their sequences is key to understanding their role in biological processes and disease. Currently, the most common technique used to determine protein sequences is mass spectrometry. While still a valuable tool, mass spectrometry faces several limitations and requires a highly experienced scientist familiar with the equipment to operate it. Additionally, other proteomic methods, like affinity assays, are constrained...
                          04-04-2024, 04:25 PM

                        ad_right_rmr

                        Collapse

                        News

                        Collapse

                        Topics Statistics Last Post
                        Started by seqadmin, 04-11-2024, 12:08 PM
                        0 responses
                        39 views
                        0 likes
                        Last Post seqadmin  
                        Started by seqadmin, 04-10-2024, 10:19 PM
                        0 responses
                        41 views
                        0 likes
                        Last Post seqadmin  
                        Started by seqadmin, 04-10-2024, 09:21 AM
                        0 responses
                        35 views
                        0 likes
                        Last Post seqadmin  
                        Started by seqadmin, 04-04-2024, 09:00 AM
                        0 responses
                        55 views
                        0 likes
                        Last Post seqadmin  
                        Working...
                        X