Go Back   SEQanswers > Bioinformatics > Bioinformatics

Similar Threads
Thread Thread Starter Forum Replies Last Post
Introducing Skymap: a single data matrix for each omic layer for each species brianytsui Bioinformatics 0 08-12-2018 03:20 PM
cufflinks generated >400,000 transcripts?? IrisZhu Bioinformatics 13 05-15-2014 08:15 PM

Thread Tools
Old 10-16-2018, 09:01 AM   #1
Junior Member
Location: UCSD, blog:

Join Date: Aug 2018
Posts: 4
Default Introducing Skymap-JHub: accessing omics from >400,000 sequencing runs made simple

Quick start: Click here to use our JupyterHub to go from omic-data to publication figures in < 1 minute (YOUR CAN DO THIS EVEN ON YOUR PHONE)

Every paper that is published with sequencing data is supposed to put their data in the SRA. And those data are generated by your tax money, but the crappy part is that everyone uses their own pipeline to generate the results which make it almost impossible to reuse easily. And one day I decided to run >400k raw sequencing data through the same pipeline, and generated an omic matrix for each data layer which everyone can query and go from data to some common publication figures in a minute.

The reason I am posting here is to get some feedback from the community to see what are the data layers they want to see. The project is still in the early phase, your comments will be deeply valued and decide on how the project is going to be. Currently, the project only offers transcript counts and allelic read counts.

Related blog posts:
Overview of the project (Intro): A preview of the Skymap project: Extracting allelic read counts and expression profiles of >400,000 public sequencing runs and merging them into simple -omic matrices that can fit into your hard drive
Design rationale of the computational infrastructure (Method) How can a Jupyter notebook extract the expression levels or allelic read counts from > 200,000 sequencing runs in seconds?

GitHub page


Last edited by brianytsui; 10-18-2018 at 12:16 AM.
brianytsui is offline   Reply With Quote

big data, rnaseq, variants, wgs, wxs

Thread Tools

Posting Rules
You may not post new threads
You may not post replies
You may not post attachments
You may not edit your posts

BB code is On
Smilies are On
[IMG] code is On
HTML code is Off

All times are GMT -8. The time now is 04:40 AM.

Powered by vBulletin® Version 3.8.9
Copyright ©2000 - 2021, vBulletin Solutions, Inc.
Single Sign On provided by vBSSO