Go Back   SEQanswers > Bioinformatics > Bioinformatics

Similar Threads
Thread Thread Starter Forum Replies Last Post
Concerns for combining data from HiSeq 2000 and HiSeq 2500 jaaker Illumina/Solexa 1 02-04-2013 02:56 PM
Illumina iScan example data sets c_ro87 Bioinformatics 0 09-14-2012 09:12 AM
Evaluation of genomic high-throughput sequencing data generated on Illumina HiSeq and dan Literature Watch 1 11-09-2011 04:18 AM
sff_extract: combining data from 454 Flx and Titanium data sets agroster Bioinformatics 7 01-14-2010 10:19 AM
In silico data sets from BACs for GAII Illumina CG&R Bioinformatics 1 12-16-2009 06:37 AM

Thread Tools
Old 03-12-2013, 11:03 PM   #1
Location: New York

Join Date: Mar 2013
Posts: 10
Default Combining Illumina GA and HiSeq microRNA sequencing data-sets

I have a set of ~100 samples with microRNA sequencing data obtained using Illumina Genome Analyzer, and another set of ~200 samples with the data obtained using Illumina HiSeq 2000. The total ~300 samples belong to two groups, equally represented in the two data-sets. I am interested in differential expression analysis to compare microRNA expression between the two groups.

I want to combine the GA and HiSeq data (available as either absolute read counts and counts per million reads) to have a larger sample-size for the analyses.

The GA and HiSeq 2000 platforms use the same 'chemistry', and I understand that the main difference between them is that the latter has a higher throughput (processing time), so combining the data obtained with the two different platform seems reasonable.

Can anyone advise if this is indeed so? Further,

(1) Should one use the absolute read count values, or the count per million values?

(2) Should I normalize the data after combining the data-sets? What method will be appropriate?

(3) How should missing values be dealt with? E.g., unlike in the HiSeq data-set, microRNA miR-X may not have been detected in any sample of the GA data-set (and thus missing in it).

Thank you.
alpha2zee is offline   Reply With Quote

combining, illumina, microrna sequencing, normalization, platform

Thread Tools

Posting Rules
You may not post new threads
You may not post replies
You may not post attachments
You may not edit your posts

BB code is On
Smilies are On
[IMG] code is On
HTML code is Off

All times are GMT -8. The time now is 10:58 PM.

Powered by vBulletin® Version 3.8.9
Copyright ©2000 - 2020, vBulletin Solutions, Inc.
Single Sign On provided by vBSSO