Hi,
I am using cutadapt for removing the adapter sequence. I have 2 adapter sequence.
RNA 5Adapter (RA5)
5 GUUCAGAGUUCUACAGUCCGACGAUC
RNA 3?Adapter (RA3)
5 TGGAATTCTCGGGTGCCAAGG
The 1st one is 5' adapter and 2nd is 3' adapter.
I am using the following command line to remove the adapter seq.
cutadapt -a TGGAATTCTCGGGTGCCAAGG -g GUUCAGAGUUCUACAGUCCGACGAUC input.fastq > output.fastq
Length Distribution I get
Mean sequence length: 32.49 ± 10.53 bp
Minimum length: 16 bp
Maximum length: 51 bp
Length range: 36 bp
Mode length: 51 bp with 2,852,626 sequences
And I found that the 5' adapter has U instead of T. Will that be fine?
I tried replacing U with T GUUCAGAGUUCUACAGUCCGACGAUC > GTTCAGAGTTCTACAGTCCGACGATC and tried removing adapter sequence.
cutadapt -a TGGAATTCTCGGGTGCCAAGG -g GTTCAGAGTTCTACAGTCCGACGATC input.fastq > output.fastq
Length Distribution I get
Mean sequence length: 31.26 ± 11.29 bp
Minimum length: 1 bp
Maximum length: 51 bp
Length range: 51 bp
Mode length: 51 bp with 2,805,271 sequences
I get varied length distribution in both the cases. Which one should I choose..
First is the command that I am using is right??
I am working on small RNA seq data.
Kindly let me know.
Thanks in advance.
Regards
Vishwesh
I am using cutadapt for removing the adapter sequence. I have 2 adapter sequence.
RNA 5Adapter (RA5)
5 GUUCAGAGUUCUACAGUCCGACGAUC
RNA 3?Adapter (RA3)
5 TGGAATTCTCGGGTGCCAAGG
The 1st one is 5' adapter and 2nd is 3' adapter.
I am using the following command line to remove the adapter seq.
cutadapt -a TGGAATTCTCGGGTGCCAAGG -g GUUCAGAGUUCUACAGUCCGACGAUC input.fastq > output.fastq
Length Distribution I get
Mean sequence length: 32.49 ± 10.53 bp
Minimum length: 16 bp
Maximum length: 51 bp
Length range: 36 bp
Mode length: 51 bp with 2,852,626 sequences
And I found that the 5' adapter has U instead of T. Will that be fine?
I tried replacing U with T GUUCAGAGUUCUACAGUCCGACGAUC > GTTCAGAGTTCTACAGTCCGACGATC and tried removing adapter sequence.
cutadapt -a TGGAATTCTCGGGTGCCAAGG -g GTTCAGAGTTCTACAGTCCGACGATC input.fastq > output.fastq
Length Distribution I get
Mean sequence length: 31.26 ± 11.29 bp
Minimum length: 1 bp
Maximum length: 51 bp
Length range: 51 bp
Mode length: 51 bp with 2,805,271 sequences
I get varied length distribution in both the cases. Which one should I choose..
First is the command that I am using is right??
I am working on small RNA seq data.
Kindly let me know.
Thanks in advance.
Regards
Vishwesh
Comment