Seqanswers Leaderboard Ad

Collapse
X
 
  • Filter
  • Time
  • Show
Clear All
new posts
  • Calico
    Member
    • Jan 2010
    • 12

    Preferred languages to work in?

    Hello everybody,

    My rather limited bioinfromatics skills come from having done some microarray data analysis in R (following a template code) and some minor coursework. So, I consider myself quite a newbie to the subject. I will quite soon be shaking hands with some sequencing data (from a Helicos machine) and need to prepare myself for this.

    Being of a younger generation, I would say I can handle computers pretty well. So far I have, as recommended in this nice thread, started to take a look at the Unix and Perl for Biologist tutorial and installed Ubuntu in Virtual PC on my Windows computer.

    What I'd like to ask you, SEQanswers community, is whether you can suggest me anything helpful. Am I starting out in the right way? I will get some bioinformatics help along the way, though I am unsure to what extent. Also, I see this as a part of my future career, so I am not just doing this for one particular project.

    Edit: I have just realized that the Helicos software package uses Python.
    Last edited by Calico; 04-20-2010, 08:07 AM.
  • Broadie
    Member
    • Oct 2009
    • 15

    #2
    learn Java for now and Python for later. Many of the startup companies are looking for these skills combined with bioinformatics analysis skills.

    Comment

    • lvaruzza
      Junior Member
      • Feb 2008
      • 8

      #3
      I think the oposite is better

      I think it's better to learn python first because it's software development is easer to use, you just need the interpreter and a text editor, java on the other hand needs eclipse, maven, ant, XML files, etc, etc.

      Learn the basic concepts in phyton quick and them go to Java.

      Comment

      • Broadie
        Member
        • Oct 2009
        • 15

        #4
        Maybe so, the reason I recommended Java first was it would open up more career opportunities, and Python is still relatively new but growing in demand.

        Comment

        • lvaruzza
          Junior Member
          • Feb 2008
          • 8

          #5
          Maybe there is more jobs for java programmers, but learn python will not hurt and will be faster them learn java.

          PS: Python is 4 years older than java:


          Comment

          • sklages
            Senior Member
            • May 2008
            • 628

            #6
            I'd go for perl :-)

            Comment

            • thinkRNA
              Member
              • Jan 2010
              • 94

              #7
              Learning PERL is very easy:I have taught college student in 7 days using exercises from this awesome bible of mine: http://oreilly.com/catalog/9780596001322
              if you have never programmed before, after you can write PERL scripts (~3 months regular practise), you can start python as the transition is easy -> it will also give you a lot of confidence which is really what biologists' biggest problem is.

              People, what do you all think about C++ versus Java, which is more sought after in industry?

              Comment

              • Bukowski
                Senior Member
                • Jan 2010
                • 388

                #8
                Originally posted by Calico View Post
                Hello everybody,

                My rather limited bioinfromatics skills come from having done some microarray data analysis in R (following a template code) and some minor coursework. So, I consider myself quite a newbie to the subject. I will quite soon be shaking hands with some sequencing data (from a Helicos machine) and need to prepare myself for this.

                Being of a younger generation, I would say I can handle computers pretty well. So far I have, as recommended in this nice thread, started to take a look at the Unix and Perl for Biologist tutorial and installed Ubuntu in Virtual PC on my Windows computer.

                What I'd like to ask you, SEQanswers community, is whether you can suggest me anything helpful. Am I starting out in the right way? I will get some bioinformatics help along the way, though I am unsure to what extent. Also, I see this as a part of my future career, so I am not just doing this for one particular project.

                Edit: I have just realized that the Helicos software package uses Python.
                If you're looking to leverage the Bio* libraries (BioPerl, BioJava, BioPython, BioRuby etc.) just be aware they are not all equally mature, nor equally good as each other for certain tasks.

                Depending on what your likely use case is, I would check to see what is mature and developed in your area. I've run into issues a couple of times recently with BioRuby that have been easily solved by switching to BioPython.

                As an ex-Perlist, I can really only suggest that for a nice balance of language design and tooling maturity - Python is a good way to go. I'd still love to get some competency in Java however, but if you want things to work, and work fast, let's face it you will get much more traction in Python first.

                Comment

                • Broadie
                  Member
                  • Oct 2009
                  • 15

                  #9
                  I taught myself Perl as well, its very easy and I'd say a good introductory language.

                  Comment

                  • Calico
                    Member
                    • Jan 2010
                    • 12

                    #10
                    I was hoping for a uniform answer. This leaves me with some thinking to do.

                    Thanks for the input.

                    Comment

                    • mudshark
                      Senior Member
                      • Jan 2009
                      • 138

                      #11
                      just to add my 2 cents:

                      a) threading in JAVA is easy! if you plan to work with huge datasets on multicore processors this is an important issue. i managed to reduce computation times in orders of magnitude by exporting my data let's say from R and do multicore processing in java.

                      b) programming is very easy when you have powerful IDE's that allow for easy variable/method renaming and other fancy stuff. I use intellij IDEA and coding is a charm. if you plan to do a lot of coding this might be a relevant issue as well.

                      Comment

                      • Broadie
                        Member
                        • Oct 2009
                        • 15

                        #12
                        Originally posted by Calico View Post
                        I was hoping for a uniform answer. This leaves me with some thinking to do.

                        Thanks for the input.
                        I suggest you make an evidence-based decision. Go to monster.com, careerbuilder.com, and job sections for various organizations/companies seeking bioinformatics people, do a search for "bioinformatics", find the job listings that sound like what you want to do, and see what languages they are looking for. Based on what I've seen, they look for Java, Python, and Perl and a few others.

                        Comment

                        • lh3
                          Senior Member
                          • Feb 2008
                          • 686

                          #13
                          Another vote for python, especially when you are dealing with data in various custom formats. Java is more appropriate when you develop algorithms or when modules are available for tedious format parsing. Between perl and python, it seems that most people who know both perl and python like python better, although I know perl only.

                          Comment

                          • dnusol
                            Senior Member
                            • Jul 2009
                            • 136

                            #14
                            My opinion from a biologist point of view...
                            I've tried two times to learn Java and finally gave up, maybe because my brain is not made up for understanding so much of classes attributes etc. I taught myself Perl and R (with help from userlists, of course!) with some good results (I would say). My next steps: continue improving Perl and start Python.

                            HTH

                            Comment

                            • quinlana
                              Senior Member
                              • Sep 2008
                              • 119

                              #15
                              One more vote for Python. I spent my grad career developing in Perl and C++. For some reason, I always wrote Perl code as "one-off" scripts. Moreover, the syntax and code structure made it difficult for me to make sense of even my most well-documented Perl scripts.

                              On the other hand, from the moment I began using Python, I started developing more in the style I do when writing C++ tools: that is, with an emphasis on code "reusability" and clarity. For some reason, Python just "feels" good and lends itself to clean, simple and reusable code. The best part is that its approach to object orientation is rather well done and allows one to think the same way as when writing Java or C++ code.

                              Therefore, I think learning Python first with a concerted effort towards learning objects, iterators, inheritance and developing modules, would be best. It will make a transition to Java or C++ much easier.

                              Comment

                              Latest Articles

                              Collapse

                              • seqadmin
                                Pathogen Surveillance with Advanced Genomic Tools
                                by seqadmin




                                The COVID-19 pandemic highlighted the need for proactive pathogen surveillance systems. As ongoing threats like avian influenza and newly emerging infections continue to pose risks, researchers are working to improve how quickly and accurately pathogens can be identified and tracked. In a recent SEQanswers webinar, two experts discussed how next-generation sequencing (NGS) and machine learning are shaping efforts to monitor viral variation and trace the origins of infectious...
                                Today, 11:48 AM
                              • seqadmin
                                New Genomics Tools and Methods Shared at AGBT 2025
                                by seqadmin


                                This year’s Advances in Genome Biology and Technology (AGBT) General Meeting commemorated the 25th anniversary of the event at its original venue on Marco Island, Florida. While this year’s event didn’t include high-profile musical performances, the industry announcements and cutting-edge research still drew the attention of leading scientists.

                                The Headliner
                                The biggest announcement was Roche stepping back into the sequencing platform market. In the years since...
                                03-03-2025, 01:39 PM
                              • seqadmin
                                Investigating the Gut Microbiome Through Diet and Spatial Biology
                                by seqadmin




                                The human gut contains trillions of microorganisms that impact digestion, immune functions, and overall health1. Despite major breakthroughs, we’re only beginning to understand the full extent of the microbiome’s influence on health and disease. Advances in next-generation sequencing and spatial biology have opened new windows into this complex environment, yet many questions remain. This article highlights two recent studies exploring how diet influences microbial...
                                02-24-2025, 06:31 AM

                              ad_right_rmr

                              Collapse

                              News

                              Collapse

                              Topics Statistics Last Post
                              Started by seqadmin, 03-20-2025, 05:03 AM
                              0 responses
                              26 views
                              0 reactions
                              Last Post seqadmin  
                              Started by seqadmin, 03-19-2025, 07:27 AM
                              0 responses
                              33 views
                              0 reactions
                              Last Post seqadmin  
                              Started by seqadmin, 03-18-2025, 12:50 PM
                              0 responses
                              25 views
                              0 reactions
                              Last Post seqadmin  
                              Started by seqadmin, 03-03-2025, 01:15 PM
                              0 responses
                              190 views
                              0 reactions
                              Last Post seqadmin  
                              Working...