Unconfigured Ad

Collapse
X
 
  • Filter
  • Time
  • Show
Clear All
new posts
  • KnowNothing2
    Member
    • Sep 2013
    • 49

    What linux spreadsheet program can you open .sam files in?

    Tried opening it in libre office calc but the program just freezes
  • GenoMax
    Senior Member
    • Feb 2008
    • 7142

    #2
    Are we talking about an alignment file? Why do you want to open it in a spreadsheet program? If you need to make a change there may be an alternate option available.

    Comment

    • dpryan
      Devon Ryan
      • Jul 2011
      • 3478

      #3
      Well, you can probably open it in libre office if it's small enough (it does read it into memory). Why you would actually want to do that is beyond me.

      Comment

      • KnowNothing2
        Member
        • Sep 2013
        • 49

        #4
        Originally posted by dpryan View Post
        Well, you can probably open it in libre office if it's small enough (it does read it into memory). Why you would actually want to do that is beyond me.
        Basically to write a program

        Comment

        • dpryan
          Devon Ryan
          • Jul 2011
          • 3478

          #5
          You really don't want to write macros to deal with data of this size. Try python/perl/C/whatever. You can use R, if you prefer, but I think it usually reads everything into memory too.

          Comment

          • KnowNothing2
            Member
            • Sep 2013
            • 49

            #6
            Originally posted by dpryan View Post
            You really don't want to write macros to deal with data of this size. Try python/perl/C/whatever. You can use R, if you prefer, but I think it usually reads everything into memory too.
            I'm actually trying to write in python. How do I notate a certain variable in a sam file (I'm very new to python bioinformatics).

            Comment

            • dpryan
              Devon Ryan
              • Jul 2011
              • 3478

              #7
              Generally something like the following will work:

              Code:
              import csv
              
              sam = csv.reader(open("foo.sam","r"), dialect="excel-tab")
              for line in sam :
                  print("QNAME: %s" % (line[0]))
                  print("Sequence: %s" % (line[9]))
              So, just use the column number. You can also use pysam, which makes some things much easier. If you're comfortable with C, I can also recommend the samtools C API. If you need higher performance, you'll find it quite useful.

              Comment

              • KnowNothing2
                Member
                • Sep 2013
                • 49

                #8
                Originally posted by dpryan View Post
                Generally something like the following will work:

                Code:
                import csv
                
                sam = csv.reader(open("foo.sam","r"), dialect="excel-tab")
                for line in sam :
                    print("QNAME: %s" % (line[0]))
                    print("Sequence: %s" % (line[9]))
                So, just use the column number. You can also use pysam, which makes some things much easier. If you're comfortable with C, I can also recommend the samtools C API. If you need higher performance, you'll find it quite useful.
                So that's basically exactly what I was looking for. Thanks!

                Comment

                • dpryan
                  Devon Ryan
                  • Jul 2011
                  • 3478

                  #9
                  Glad I could help. BTW, I didn't deal with the header in my example. You might check "if(len(line) > 5): stuff" or check for a @ as the first character to get past the header (unless you want to parse it).

                  Comment

                  • KnowNothing2
                    Member
                    • Sep 2013
                    • 49

                    #10
                    Originally posted by dpryan View Post
                    Glad I could help. BTW, I didn't deal with the header in my example. You might check "if(len(line) > 5): stuff" or check for a @ as the first character to get past the header (unless you want to parse it).
                    Right, I'll have to figure out some of the details. Right now, I just have an idea what I want to perform, but no real idea how to execute. But I did just read through a tutorial that kind of went over the sort of info you presented, I was just unaware that you could do this directly to a sam file.

                    Comment

                    • LeightonP
                      Member
                      • Feb 2011
                      • 29

                      #11
                      You might want to consider the pysam module: http://wwwfgu.anat.ox.ac.uk/~andreas...tools/api.html and https://code.google.com/p/pysam/

                      (EDIT: I see Devon already suggested this - apologies for the duplication0
                      Last edited by LeightonP; 12-10-2013, 02:30 PM. Reason: Noticed duplication of advice.

                      Comment

                      • lindenb
                        Senior Member
                        • Apr 2010
                        • 143

                        #12
                        Visualize Bam

                        FYI: I wrote a simple java-based GUI to visualize some BAMS: https://github.com/lindenb/jvarkit/wiki/BamViewGui

                        Comment

                        Latest Articles

                        Collapse

                        • SEQadmin2
                          Nine Things a Sample Prep Scientist Thinks About Before Sequencing
                          by SEQadmin2


                          I’m not a sequencing expert. I’m a purification scientist who uses NGS to evaluate workflows my group develops. With this perspective, we think about the sample first and the NGS workflow second. The sequencer is an exceptionally honest reporter, but it can only report on what you give it, so whether you get clean, interpretable data from an NGS workflow is largely determined before you begin.

                          Here are nine questions we think about, in roughly the order they matter, before...
                          06-18-2026, 07:11 AM
                        • SEQadmin2
                          From Collection to Sequencing: Why Sample Preparation and Preservation Define Sequencing Data
                          by SEQadmin2


                          Data variability is still an issue in sequencing technologies despite the advances in reproducibility and accuracy of these platforms. But the problem does not originate in the sequencing itself, but in the previous steps, before the sample reaches the sequencer.


                          The first step is collection, followed by preservation and sample preparation for analysis. Most scientists overlook those steps, but not being careful might just be skewing the experiment’s results.
                          ...
                          06-02-2026, 10:05 AM

                        ad_right_rmr

                        Collapse

                        News

                        Collapse

                        Topics Statistics Last Post
                        Started by SEQadmin2, Today, 05:37 AM
                        0 responses
                        5 views
                        0 reactions
                        Last Post SEQadmin2  
                        Started by SEQadmin2, 06-26-2026, 11:10 AM
                        0 responses
                        16 views
                        0 reactions
                        Last Post SEQadmin2  
                        Started by SEQadmin2, 06-17-2026, 06:09 AM
                        0 responses
                        49 views
                        0 reactions
                        Last Post SEQadmin2  
                        Started by SEQadmin2, 06-09-2026, 11:58 AM
                        0 responses
                        109 views
                        0 reactions
                        Last Post SEQadmin2  
                        Working...