Seqanswers Leaderboard Ad

Collapse

Announcement

Collapse
No announcement yet.
X
 
  • Filter
  • Time
  • Show
Clear All
new posts

  • MG-RAST - Efficient downloads?

    Hello,
    I want to download the hierarchical SEED Subsystems information for 30 metagenomes at MG-RAST. I've managed to obtain, through the API, the tables corresponding to the 4 levels separately, but I'd like to have the integrated table you obtain from the "Analysis" section. In this table you can actually see the hierarchy of this system.
    The thing is that when you ask MG-RAST to include so many samples (30), it kind of crushes.
    Does anyone know a better way? Maybe via the same API?
    Cheers,
    fibar

  • #2
    Please before download the SEED subsystems check the validity of MG-RAST annotations. I was surprised that there is a lot of crushes and errors in annotations. Please compare the orfs.faa or orfs.fna to the SEED organisms or functions annotations, we will find not corresponding data for example the start-end of orfs in query is different then start-end of query gathered in annotations files. I have started to suspect the quality of MG-RAST annotations. Every biologist should not believe in black box of automated things software and webservers might suggest control test for each step in data processing.
    __Bach__

    Comment

    Latest Articles

    Collapse

    • seqadmin
      Essential Discoveries and Tools in Epitranscriptomics
      by seqadmin


      The field of epigenetics has traditionally concentrated more on DNA and how changes like methylation and phosphorylation of histones impact gene expression and regulation. However, our increased understanding of RNA modifications and their importance in cellular processes has led to a rise in epitranscriptomics research. “Epitranscriptomics brings together the concepts of epigenetics and gene expression,” explained Adrien Leger, PhD, Principal Research Scientist on Modified Bases...
      Today, 07:01 AM
    • seqadmin
      Current Approaches to Protein Sequencing
      by seqadmin


      Proteins are often described as the workhorses of the cell, and identifying their sequences is key to understanding their role in biological processes and disease. Currently, the most common technique used to determine protein sequences is mass spectrometry. While still a valuable tool, mass spectrometry faces several limitations and requires a highly experienced scientist familiar with the equipment to operate it. Additionally, other proteomic methods, like affinity assays, are constrained...
      04-04-2024, 04:25 PM

    ad_right_rmr

    Collapse

    News

    Collapse

    Topics Statistics Last Post
    Started by seqadmin, 04-11-2024, 12:08 PM
    0 responses
    37 views
    0 likes
    Last Post seqadmin  
    Started by seqadmin, 04-10-2024, 10:19 PM
    0 responses
    41 views
    0 likes
    Last Post seqadmin  
    Started by seqadmin, 04-10-2024, 09:21 AM
    0 responses
    35 views
    0 likes
    Last Post seqadmin  
    Started by seqadmin, 04-04-2024, 09:00 AM
    0 responses
    54 views
    0 likes
    Last Post seqadmin  
    Working...
    X