Unconfigured Ad

Collapse
X
 
  • Filter
  • Time
  • Show
Clear All
new posts
  • chayan
    Member
    • Nov 2012
    • 52

    Clustering a two-D matrix

    Hii All,

    I have generated a 2-D matrix based on NGS data which has cog scores in one axis (Y-axis) and sample no in other axis. Now i want to cluster this matrix based on the cog scores to find similarity between different sampling points and finally construct a UPGMA tree based on the output. Can any one suggest me a simple way to do this?? I don't have much knowledge about this...

    Thanks for any help in advance...

    Regards

    Chayan
  • GenoMax
    Senior Member
    • Feb 2008
    • 7142

    #2
    Try MEGA: http://megasoftware.net/

    You should read the manual first to see if this would be appropriate to do with the dataset you have.

    Comment

    • southan
      Member
      • May 2011
      • 11

      #3
      You can use the Mclust package.

      (http://www.stat.washington.edu/mclust/)

      Change the parameter modelNames to obtain a suitable model (e.g., EII).

      Comment

      • chayan
        Member
        • Nov 2012
        • 52

        #4
        thanks to both of you. As i am going through both the manuals, it will take some time as i am a hardcore biology guy, i want to clear one my query.. my matrix structure is like this
        A B C D E
        i 20 30 15 43 87
        ii 12 54 3 76 56
        iii 45 78 4 99 54

        now i want to cluster this based on the A, B, C, D, E based on the values of i, ii and iii...can i use "beta_diversity.py" from the qiime package..? then dont know basically how to generate to file format required for this as the prior steps are not same...


        thanks

        Comment

        • chayan
          Member
          • Nov 2012
          • 52

          #5
          Originally posted by southan View Post
          You can use the Mclust package.

          (http://www.stat.washington.edu/mclust/)

          Change the parameter modelNames to obtain a suitable model (e.g., EII).
          thanks.. but i have never used R also... what i understand from the Mclust manual, if my matrix is like this

          A B C D E
          i 20 30 15 43 87
          ii 12 54 3 76 56
          iii 45 78 4 99 54

          i should use the modelName "VII"/"VVV" or other unequal volume or varying volume from the multivariate mixture??

          another thing, how to prepare the input file from the excel to feed into mclust??

          thanks a lot

          Comment

          Latest Articles

          Collapse

          ad_right_rmr

          Collapse

          News

          Collapse

          Topics Statistics Last Post
          Started by SEQadmin2, Today, 06:09 AM
          0 responses
          13 views
          0 reactions
          Last Post SEQadmin2  
          Started by SEQadmin2, 06-09-2026, 11:58 AM
          0 responses
          34 views
          0 reactions
          Last Post SEQadmin2  
          Started by SEQadmin2, 06-05-2026, 10:09 AM
          0 responses
          39 views
          0 reactions
          Last Post SEQadmin2  
          Started by SEQadmin2, 06-04-2026, 08:59 AM
          0 responses
          44 views
          0 reactions
          Last Post SEQadmin2  
          Working...