Seqanswers Leaderboard Ad

Collapse

Announcement

Collapse
No announcement yet.
X
 
  • Filter
  • Time
  • Show
Clear All
new posts

  • apporach a huge matrix data: Python or R or MATLAB

    I have file which looks like:
    *tab separated*

    0 0.11692212 0.12066375 0.1096388
    0.11692212 0 0.09939274 0.0684602
    0.12066375 0.09939274 0 0.13651687
    0.1096388 0.0684602 0.13651687 0
    In essence it means:

    A B C D
    A 0 0.11692212 0.12066375 0.1096388
    B 0.11692212 0 0.09939274 0.0684602
    C 0.12066375 0.09939274 0 0.13651687
    D 0.1096388 0.0684602 0.13651687 0
    Value of A against A, value of A against B,
    Value of B against A, and so on..
    Where value for A against B is same as B against A, and further for other columns and row applies the same.

    I have a file with 350X350 or 490X490 matrix numbers, which follow the exact same pattern as above.
    How do I approach such set if I got to have say top 4 or top 10 values with their row and column number.

    Should I proceed with Python [ I work in python] or any statistical tool? R or MATLAB?

    I know this doesn't relate to Bioinformatics, or NGS, but approach sometimes means a lot.
    Last edited by bio_informatics; 09-22-2014, 02:15 PM.
    Bioinformaticscally calm

  • #2
    With such a small matrix it shouldn't much matter which language you use. With python you'd want to use numpy rather than the base functionality, but aside from that you shouldn't have issues in any of the languages. If you're most familiar with python then just use it.

    Comment


    • #3
      Hi,
      Thank you for your reply. I put a sample matrix. I have matrix of more than 350x350, or 490X490.
      Bioinformaticscally calm

      Comment


      • #4
        As dpryan said those are small enough not to matter what you use, whatever is comfortable.
        ( and we did not mean the sample you posted
        but the few hundred square matricies you mention)

        If you have tens of thousands of those ~500^2 matricies to process maybe worry a bit more.

        Comment


        • #5
          @tomc-
          Alright. Shall proceed with numpy then.
          Bioinformaticscally calm

          Comment

          Latest Articles

          Collapse

          • seqadmin
            Essential Discoveries and Tools in Epitranscriptomics
            by seqadmin




            The field of epigenetics has traditionally concentrated more on DNA and how changes like methylation and phosphorylation of histones impact gene expression and regulation. However, our increased understanding of RNA modifications and their importance in cellular processes has led to a rise in epitranscriptomics research. “Epitranscriptomics brings together the concepts of epigenetics and gene expression,” explained Adrien Leger, PhD, Principal Research Scientist...
            04-22-2024, 07:01 AM
          • seqadmin
            Current Approaches to Protein Sequencing
            by seqadmin


            Proteins are often described as the workhorses of the cell, and identifying their sequences is key to understanding their role in biological processes and disease. Currently, the most common technique used to determine protein sequences is mass spectrometry. While still a valuable tool, mass spectrometry faces several limitations and requires a highly experienced scientist familiar with the equipment to operate it. Additionally, other proteomic methods, like affinity assays, are constrained...
            04-04-2024, 04:25 PM

          ad_right_rmr

          Collapse

          News

          Collapse

          Topics Statistics Last Post
          Started by seqadmin, Yesterday, 11:49 AM
          0 responses
          13 views
          0 likes
          Last Post seqadmin  
          Started by seqadmin, 04-24-2024, 08:47 AM
          0 responses
          16 views
          0 likes
          Last Post seqadmin  
          Started by seqadmin, 04-11-2024, 12:08 PM
          0 responses
          61 views
          0 likes
          Last Post seqadmin  
          Started by seqadmin, 04-10-2024, 10:19 PM
          0 responses
          60 views
          0 likes
          Last Post seqadmin  
          Working...
          X