Unconfigured Ad

Collapse
X
 
  • Filter
  • Time
  • Show
Clear All
new posts
  • ramirob
    Member
    • Apr 2012
    • 16

    Organizing projects, data inputs and output

    Hello!

    Do you have any suggestions, pointers, references, models on how to organize projects and their correspondingraw data, rmarkdowns, results, etc?

    We do a lot of bioinformatics projects, most of them have as raw data, fastq files, alignments files, differential expression counts matrices, etc; we develop some kind of pipeline and data processing and as output have some kind of report that we generate, as well as output data. We need to organize this information so that we can answer for example:

    - What have we done for investigator X?
    - When was the last time that we did such and such analysis?
    - What was the output for the analysis we did for investigator X or project Y?
    - I need the output for project Y again.
    - For whom have we done type of analysis Z (e.g. Differential expression, etc.)
    - Project X that we did in 2010, what did it consist of? what was the input raw data, and output?

    And other things like this. Right now, for answering some of these things I basically do a very crude search on the directory tree, which is getting bigger and bigger, and rely on memory, past emails, etc. Dangerous.

    Any suggestions? Including any commercial software for helping with these kinds of things?

    Thanks,
    Ramiro
  • hoytpr
    Member
    • Dec 2009
    • 62

    #2
    Ramiro, you're gonna need a query-able database. If MySql is not on your favorites list, Microsoft Access has a business template that can be hacked to run a core facility. It's a pain to transfer old data, but it'll work for most of the things you've asked about. My suggestion is to do as much as possible using dates in the yyyymmdd format for starting projects.

    Comment

    Latest Articles

    Collapse

    • SEQadmin2
      Nine Things a Sample Prep Scientist Thinks About Before Sequencing
      by SEQadmin2


      I’m not a sequencing expert. I’m a purification scientist who uses NGS to evaluate workflows my group develops. With this perspective, we think about the sample first and the NGS workflow second. The sequencer is an exceptionally honest reporter, but it can only report on what you give it, so whether you get clean, interpretable data from an NGS workflow is largely determined before you begin.

      Here are nine questions we think about, in roughly the order they matter, before...
      06-18-2026, 07:11 AM
    • SEQadmin2
      From Collection to Sequencing: Why Sample Preparation and Preservation Define Sequencing Data
      by SEQadmin2


      Data variability is still an issue in sequencing technologies despite the advances in reproducibility and accuracy of these platforms. But the problem does not originate in the sequencing itself, but in the previous steps, before the sample reaches the sequencer.


      The first step is collection, followed by preservation and sample preparation for analysis. Most scientists overlook those steps, but not being careful might just be skewing the experiment’s results.
      ...
      06-02-2026, 10:05 AM

    ad_right_rmr

    Collapse

    News

    Collapse

    Topics Statistics Last Post
    Started by SEQadmin2, 06-17-2026, 06:09 AM
    0 responses
    41 views
    0 reactions
    Last Post SEQadmin2  
    Started by SEQadmin2, 06-09-2026, 11:58 AM
    0 responses
    102 views
    0 reactions
    Last Post SEQadmin2  
    Started by SEQadmin2, 06-05-2026, 10:09 AM
    0 responses
    123 views
    0 reactions
    Last Post SEQadmin2  
    Started by SEQadmin2, 06-04-2026, 08:59 AM
    0 responses
    114 views
    0 reactions
    Last Post SEQadmin2  
    Working...