![]() |
|
![]() |
||||
Thread | Thread Starter | Forum | Replies | Last Post |
NA gene expression values in TCGA RNA_Seq Level3 data | mrfox | Bioinformatics | 0 | 04-14-2015 12:37 PM |
RNA_seq on (non-model) less well annotated genomes | Drerio | General | 5 | 06-11-2012 12:46 AM |
EdgeR compare 3 samples (RNA_Seq) | Palgrave | Bioinformatics | 1 | 05-28-2012 12:39 AM |
Sample quality for RNA_SEQ | cub103 | Sample Prep / Library Generation | 3 | 08-02-2010 07:26 AM |
% matching problem- RNA_Seq | sanush | RNA Sequencing | 1 | 11-29-2009 10:58 PM |
![]() |
|
Thread Tools |
![]() |
#1 |
Junior Member
Location: newark, NJ Join Date: Jun 2015
Posts: 3
|
![]()
Hi to everyone,
I am new member to this forum. I have 100bp single read illumina fastq files. When we looked at the reads we saw some interesting sequences. We want to separate those reads and write it in separate fastq file for analysis. For example we want to separate "ATTTTTTTTAGAAAAAAAA" containing reads (we saw something around 2million reads out of 9million reads). Can you please give me guidance how to do it. IF there is program or any unix commands will be helpful. I am not a unix person. please give me commands to execute. Thanks a lot. Mayil |
![]() |
![]() |
![]() |
#2 |
Senior Member
Location: East Coast USA Join Date: Feb 2008
Posts: 7,080
|
![]()
bbduk.sh from BBMap package can do this. If that sequence is at the end of the reads then,
Code:
$ bbduk.sh -Xmx1g in=reads.fq outm=matched.fq outu=unmatched.fq restrictleft=19 k=19 literal=ATTTTTTTTAGAAAAAAAA So you could bin all the reads with your known sequence, then look at the remaining reads to see what they have in common. You can do the same thing with the tail of the read using "restrictright" instead, though you can't use both restrictions at the same time. |
![]() |
![]() |
![]() |
#3 |
Junior Member
Location: newark, NJ Join Date: Jun 2015
Posts: 3
|
![]()
Hi,
Thanks. The sequence will be either in the middle or in the end. How to separate if the interested sequence is in the middle. Thanks again Mayil |
![]() |
![]() |
![]() |
#4 |
Senior Member
Location: East Coast USA Join Date: Feb 2008
Posts: 7,080
|
![]()
Just remove the "restrictleft/right" directive and the entire sequence will be searched.
|
![]() |
![]() |
![]() |
#5 |
Junior Member
Location: newark, NJ Join Date: Jun 2015
Posts: 3
|
![]()
Hi,
Thanks a lot. I will try the commands you have given to me. Thanks again and have a nice weekend. Mayil |
![]() |
![]() |
![]() |
Thread Tools | |
|
|