SEQanswers

Go Back   SEQanswers > Search Forums


Showing results 1 to 25 of 99
Search took 0.01 seconds.
Search: Posts Made By: jgibbons1
Forum: Bioinformatics 05-08-2013, 11:50 AM
Replies: 2
Views: 89
Posted By jgibbons1
Awesome...thank you @kmcarr...this worked...

Awesome...thank you @kmcarr...this worked perfectly

I also tried replacing "gzip -dc" with "zcat" and that seems to be a little faster

Thanks!
Forum: Bioinformatics 05-08-2013, 08:21 AM
Replies: 2
Views: 89
Posted By jgibbons1
wildcard use for bowtie2 unpaired read sets?

Hi all (and specifically @salzberg),

I've been trying to figure out a way to map a batch of unpaired fastq.gz files in the bowtie2 command line without manually entering a comma separated list of...
Forum: Bioinformatics 05-07-2013, 10:30 AM
Replies: 8
Views: 152
Posted By jgibbons1
Unfortunately, I don't think Biopieces takes...

Unfortunately, I don't think Biopieces takes compressed fastq files as an input option (with the read_fastq option).
Forum: Bioinformatics 05-07-2013, 10:27 AM
Replies: 8
Views: 152
Posted By jgibbons1
Ahhh...biopieces...hadn't thought about that. ...

Ahhh...biopieces...hadn't thought about that.

Thanks
Forum: Bioinformatics 05-07-2013, 10:21 AM
Replies: 8
Views: 152
Posted By jgibbons1
I thought about FastQC, but all really want is...

I thought about FastQC, but all really want is Number of Reads and base count or average read length...FastQC does a lot more (which is great) at the cost of time.

It may be my best alternative...
Forum: Bioinformatics 05-07-2013, 09:34 AM
Replies: 8
Views: 152
Posted By jgibbons1
fastq.gz stats READ-COUNT BASE-COUNT

Hi all,

Before I try writing some inefficient code, I was wondering if anyone has a script to count read number and total base count (or average read length) from a gzipped fastq file or even...
Forum: Bioinformatics 05-03-2013, 11:25 AM
Replies: 2
Views: 182
Posted By jgibbons1
For the KMER and GC plots, things start to...

For the KMER and GC plots, things start to normalize around 10 bp you may want to trim the 1st 10 bp.

As a rule of thumb I usually do the following QC and THEN run my data through FastQC:
(1)...
Forum: Bioinformatics 05-01-2013, 08:28 AM
Replies: 16
Views: 360
Posted By jgibbons1
I couldn't agree more...

I couldn't agree more...
Forum: Bioinformatics 05-01-2013, 08:13 AM
Replies: 16
Views: 360
Posted By jgibbons1
SOAP2 for example has great accuracy in calling...

SOAP2 for example has great accuracy in calling SNPs but can not confidently call indels larger than 6bp (see link to the Ruffalo et al. 2011 paper above). Novoalign, BWA and Bowtie are better at...
Forum: Bioinformatics 05-01-2013, 07:59 AM
Replies: 16
Views: 360
Posted By jgibbons1
I should say that I typically use Bowtie2 and BWA.

I should say that I typically use Bowtie2 and BWA.
Forum: Bioinformatics 05-01-2013, 07:58 AM
Replies: 16
Views: 360
Posted By jgibbons1
You may want to check out Heng Li's (BWA creator)...

You may want to check out Heng Li's (BWA creator) ROC curve benchmark on read mapping software:

http://lh3lh3.users.sourceforge.net/alnROC.shtml

One of the benefits of BWA, Bowtie2 and SOAP2 is...
Forum: Bioinformatics 04-24-2013, 06:05 AM
Replies: 3
Views: 206
Posted By jgibbons1
@ExMachina I would much rather have very few...

@ExMachina I would much rather have very few false positives even at the cost of losing true positives. I'm trying to get a reliable subset of single copy genes to calculate some statistics with....
Forum: Bioinformatics 04-23-2013, 08:13 AM
Replies: 3
Views: 206
Posted By jgibbons1
Identifying Single Copy Human Genes

Hi all,

I am trying to come up with a conservative list of single copy human genes -- i.e. I want to exclude all genes that may be part of a larger gene family/have highly similar sequence...
Forum: Illumina/Solexa 04-17-2013, 01:28 PM
Replies: 4
Views: 243
Posted By jgibbons1
I would echo mastal's recommendations and also...

I would echo mastal's recommendations and also add SOAPdenovo-Trans (http://soap.genomics.org.cn/SOAPdenovo-Trans.html)
Forum: Bioinformatics 04-03-2013, 09:10 AM
Replies: 9
Views: 297
Posted By jgibbons1
For anyone still interested... For...

For anyone still interested...

For simplicity, I ended up using Bioawk (https://github.com/lh3/bioawk):

Get the mean Phred quality score from FASTQ:
awk -c fastx '{ print ">"$name; print...
Forum: Bioinformatics 03-25-2013, 10:55 AM
Replies: 9
Views: 297
Posted By jgibbons1
Thanks to everyone's suggestions! I will try...

Thanks to everyone's suggestions!

I will try a few of these and let you know what I ended up doing.
Forum: Bioinformatics 03-25-2013, 08:14 AM
Replies: 9
Views: 297
Posted By jgibbons1
I'm trying to integrate this value into a larger...

I'm trying to integrate this value into a larger analysis pipeline. Automating it would be more efficient.
Forum: Bioinformatics 03-25-2013, 07:28 AM
Replies: 9
Views: 297
Posted By jgibbons1
Thanks @JackieBadger Do you know of any...

Thanks @JackieBadger

Do you know of any unix/linux based tools to do this? I'm trying to stay away from galaxy for this tidbit.
Forum: Bioinformatics 03-25-2013, 05:32 AM
Replies: 9
Views: 297
Posted By jgibbons1
average quality score for fastq file

Hello,
Simple question here, but I've had a problem finding a program to do it.

Given a fastq file, is there software that will calculate the average quality score for the entire file?

Thanks
Forum: Bioinformatics 03-21-2013, 01:02 PM
Replies: 6
Views: 356
Posted By jgibbons1
I would second that -- map reads against a panel...

I would second that -- map reads against a panel of Y-chromosome genes/exons.
Forum: RNA Sequencing 03-20-2013, 06:46 AM
Replies: 3
Views: 360
Posted By jgibbons1
I use the fastx_collapster to remove duplicates. ...

I use the fastx_collapster to remove duplicates.

http://hannonlab.cshl.edu/fastx_toolkit/commandline.html#fastx_collapser_usage

If you get a buggy error message about quality scores plug the...
Forum: Bioinformatics 03-19-2013, 09:08 AM
Replies: 12
Views: 392
Posted By jgibbons1
This is fantastic. Thanks for sharing. I'm sure...

This is fantastic. Thanks for sharing. I'm sure it will be faster than my pipeline (which takes about 6 hours from the data set I described).
Forum: Introductions 03-19-2013, 06:18 AM
Replies: 2
Views: 678
Posted By jgibbons1
Hi vaibhavvsk, What is your read size? Is this...

Hi vaibhavvsk,
What is your read size? Is this paired end or single end? What kind of data is this (Illumina, 454, pacbio etc.)?

You may find this blog post helpful:...
Forum: Bioinformatics 03-19-2013, 06:13 AM
Replies: 12
Views: 392
Posted By jgibbons1
Hi Maria, Sorry -- I meant to reply earlier....

Hi Maria,
Sorry -- I meant to reply earlier. Yes, the script works fine just with the command line perl statement. Luckily, I am not doing this on the whole genome scale. In brief, I'm intereste din...
Forum: Bioinformatics 03-18-2013, 09:03 AM
Replies: 12
Views: 392
Posted By jgibbons1
Ahh...that explains it! Thank you Maria. Is...

Ahh...that explains it! Thank you Maria.

Is there a simple way to modify the script so that I can corretc this?

Thanks...and apologies for my naivety
Showing results 1 to 25 of 99

 


All times are GMT -8. The time now is 12:38 PM.


Powered by vBulletin® Version 3.8.6
Copyright ©2000 - 2013, Jelsoft Enterprises Ltd.