Seqanswers Leaderboard Ad

**N311V** · 07-08-2014, 07:52 PM

Maybe start here but remember this is only guide. The appropriate workflow and analysis may differ for your data.

http://www.broadinstitute.org/gatk/guide/best-practices?bpm=index

**seqador** · 07-08-2014, 08:03 PM

Originally posted by N311V View Post

Maybe start here but remember this is only guide. The appropriate workflow and analysis may differ for your data.

http://www.broadinstitute.org/gatk/g...ices?bpm=index

Thank you very much.

**blancha** · 07-08-2014, 08:04 PM

I've never used it myself, but the Galaxy project may be the best answer for you.
They offer a web-based interface to all these command line tools.
I don't know that the wait time is on their public server though.

Galaxy

https://usegalaxy.org

Galaxy is a community-driven web-based analysis platform for life science research.

I would not run the analysis on a personal desktop, unless you have a lot of RAM and hard drive space.
To run the analysis on your own Linux server, you'll need to install the following tools.
Trimming: Trimmomatic
Alignment: TopHat
Gene quantification: Cufflinks (or htseq-count and DESeq2, but these tools are a bit harder to use).

You'll also need the genome and its annotation.
I would recommend downloading the iGenome for the mouse.

You can easily find all these resources by googling them.

If the wait time on the Galaxy public server is too long, you're probably better off finding a bioinformatician with access to a Unix server to help you.

I suppose it is your supervisor's idea that you do the analysis yourself. Time and again, I've seen principal investigators overestimate the ability of their wet lab students to use the Unix command line, as well as overestimate their students' knowledge of the basic statistics required to understand the results of a differential expression analysis.

**NextGenSeq** · 07-09-2014, 08:28 AM

The simple way to proceed is to download a demo version of CLC Bio or NextGene.

10 million reads is not very many. A decent desktop computer can align this.

**jwfoley** · 07-09-2014, 08:38 AM

I suppose it is your supervisor's idea that you do the analysis yourself. Time and again, I've seen principal investigators overestimate the ability of their wet lab students to use the Unix command line, as well as overestimate their students' knowledge of the basic statistics required to understand the results of a differential expression analysis.

Too true. "Computer literacy" is an excellent metaphor; if you're not already proficient with statistics and the Unix-like command line, then your PI's asking you to learn those things just to analyze one sequencing run is like asking you to learn Ancient Greek just to translate one document. I do encourage every scientist who works with large datasets to learn these things, but don't hold up your whole project for it. Even aside from the huge delay while you start your education from scratch, you're inevitably going to make mistakes and get wrong results (possibly without knowing it) the first time. At least get an expert to do the analysis for you and then go over her scripts to understand how they work.

**seqador** · 07-11-2014, 07:18 PM

Thanks everybody that help me!

**jwag** · 08-01-2014, 09:17 AM

You could check out Practical Computing for Biologists. Gives a pretty good intro into using command interfaces, setting up environments, etc.

**Zapages** · 08-02-2014, 06:36 AM

I would recommend iPlantcolloaborative.org as it a lot of useful tools and guides on what to do and how to do everything including visualization of the results.

**tomc** · 09-26-2014, 01:16 AM

Greg Wilson's Software Carpentry is designed to help people in your position.
Contact them, and convince your University to invite them down for a bootcamp
but in the meanwhile they have their teaching materials online

http://software-carpentry.org/lessons.html

n.b. I have no compeating interests, just respect for Greg's work.

**QazSeDc** · 06-04-2015, 09:53 PM

Originally posted by jwfoley View Post

Too true. "Computer literacy" is an excellent metaphor; if you're not already proficient with statistics and the Unix-like command line, then your PI's asking you to learn those things just to analyze one sequencing run is like asking you to learn Ancient Greek just to translate one document. I do encourage every scientist who works with large datasets to learn these things, but don't hold up your whole project for it. Even aside from the huge delay while you start your education from scratch, you're inevitably going to make mistakes and get wrong results (possibly without knowing it) the first time. At least get an expert to do the analysis for you and then go over her scripts to understand how they work.

I agree with jwfoley. If you dont have any bioinformatics skills you better ask someone to do it for you at this moment. I think BGI does provide data analysis plans but of course you'll have to pay extra.

Topics	Statistics	Last Post
Cancer Metastasis: A Deep Dive into Cellular Plasticity by seqadmin Started by seqadmin, 04-11-2024, 12:08 PM	0 responses 30 views 0 likes	Last Post by seqadmin 04-11-2024, 12:08 PM
Proteogenomic Profiles Offer New Clues in Prostate Cancer by seqadmin Started by seqadmin, 04-10-2024, 10:19 PM	0 responses 32 views 0 likes	Last Post by seqadmin 04-10-2024, 10:19 PM
Novel Diagnostic Assay Enhances Ovarian Cancer Detection by seqadmin Started by seqadmin, 04-10-2024, 09:21 AM	0 responses 28 views 0 likes	Last Post by seqadmin 04-10-2024, 09:21 AM
Evolutionary Dynamics of Centromeres: A Comparative Genomic Analysis by seqadmin Started by seqadmin, 04-04-2024, 09:00 AM	0 responses 52 views 0 likes	Last Post by seqadmin 04-04-2024, 09:00 AM

Seqanswers Leaderboard Ad

Announcement

Little desperate and alone (Help Me)

Comment

Comment

Comment

Comment

Comment

Comment

Comment

Comment

Comment

Comment

Latest Articles

ad_right_rmr

News