Seqanswers Leaderboard Ad

Collapse

Announcement

Collapse
No announcement yet.
X
 
  • Filter
  • Time
  • Show
Clear All
new posts

  • Rookie's plee for advice: PERL or Python, Linux or Windows?

    Hello kind readers,

    To be able to use the majority of the command line based tools enlisted in the famous http://seqanswers.com/forums/showthread.php?t=43, which programming language would you recommend we learn: PERL or Python? and on which operating system? Are there any books that you could recommend that could bring us a little closer to your level of expertise?

    We are undergrads who have been enslaved by some nice professors and forced to do their dirty Bioinformatics... supposedly we were capable enough of doing it all with a set of (free) downloadable/web-based tools, all having a graphic user interface (GUI) rather than being command line based. Without the capability of using command line tools (due to our lacking expertise) we had to resort to some quirky and dodgy comprimises. Apparently they liked the results and have come back for more, save us!

    Thanks

  • #2
    Stick to Linux, most bioinformatics tools have a UNIX like environment in mind. I'm a Perl guy who is slowly migrating to Python. R for data visualization and some analysis is invaluable!

    Comment


    • #3
      This link:



      has a great tutorial for basic linux and perl commands. Click 'just the documentation' to download the pdf. Do at the least the first half.

      Comment


      • #4
        For the best chance of getting existing command line tools to work you definitely want to go with 64-bit linux.

        As far as writing your own scripts to do simple processing of your data you can pretty much pick whichever scripting language you like - you should probably go with what you're more familiar with. We tend to go with Perl since we know it well, but if you're starting from scratch there probably isn't much to choose between Perl and Python. R is pretty good for visualisation, but I'm not sure I'd want to do basic data processing with it.

        Comment


        • #5
          Wow thank you all! So kind

          So it seems like Linux for operating system, PERL for programming language and R for data visualisation (admittedly a new term to us, but noted nonetheless).

          Thanks everyone.

          Comment


          • #6
            Personally I went for Python rather than Perl, but a key factor would be what do the local experts use (people in your group or department). Having someone even a bit more experienced around to turn to for help in person is very useful - although there is plenty of stuff online.

            In terms of the OS, for NGS pretty much everything works on Linux, and will probably work on the Mac (sometimes with minor tweaks), but Windows is a pain. Note that Perl, Python and R are all cross platform - I'm talking about things like assembly tools and mappers. So this is another vote for picking Linux over Windows.

            Also go for a 64bit OS rather than a 32bit OS given the choice, as with NGS data you will want to be able to use more than 4GB of RAM (assuming you machine has that much, and if it doesn't you'll need it sooner rather than later).

            Comment


            • #7
              Ah thanks Maubp! Nice to read from you again.

              Right, so we'll have a check with the local experts then.

              Much appreciated specs advice too, but what about CPU?

              Best regards.

              Comment


              • #8
                Regarding machine specs, try searching this forum - there have been plenty of other threads. Also keep in mind that it will depend on what you are doing (assembling a virus is easier than a plant, mapping is easier than de novo assembly).

                Comment

                Latest Articles

                Collapse

                • seqadmin
                  Essential Discoveries and Tools in Epitranscriptomics
                  by seqadmin




                  The field of epigenetics has traditionally concentrated more on DNA and how changes like methylation and phosphorylation of histones impact gene expression and regulation. However, our increased understanding of RNA modifications and their importance in cellular processes has led to a rise in epitranscriptomics research. “Epitranscriptomics brings together the concepts of epigenetics and gene expression,” explained Adrien Leger, PhD, Principal Research Scientist...
                  04-22-2024, 07:01 AM
                • seqadmin
                  Current Approaches to Protein Sequencing
                  by seqadmin


                  Proteins are often described as the workhorses of the cell, and identifying their sequences is key to understanding their role in biological processes and disease. Currently, the most common technique used to determine protein sequences is mass spectrometry. While still a valuable tool, mass spectrometry faces several limitations and requires a highly experienced scientist familiar with the equipment to operate it. Additionally, other proteomic methods, like affinity assays, are constrained...
                  04-04-2024, 04:25 PM

                ad_right_rmr

                Collapse

                News

                Collapse

                Topics Statistics Last Post
                Started by seqadmin, Today, 11:49 AM
                0 responses
                11 views
                0 likes
                Last Post seqadmin  
                Started by seqadmin, Yesterday, 08:47 AM
                0 responses
                16 views
                0 likes
                Last Post seqadmin  
                Started by seqadmin, 04-11-2024, 12:08 PM
                0 responses
                61 views
                0 likes
                Last Post seqadmin  
                Started by seqadmin, 04-10-2024, 10:19 PM
                0 responses
                60 views
                0 likes
                Last Post seqadmin  
                Working...
                X