Originally posted by bvk
View Post
Seqanswers Leaderboard Ad
Collapse
Announcement
Collapse
No announcement yet.
X
-
Originally posted by GenoMax View PostSame command. You should use the rRNA GTF file instead of the original one. Adjust output file names if you need to keep those results around.
featureCounts -p -t exon -g gene_id -a hg19_rRNA -o MCF10A_BaP_pool1_counts.txt MCF10A_BaP_pool1_R1_postTrimming/accepted_hits.bam
//========================== featureCounts setting ===========================\\
|| ||
|| Input files : 1 BAM file ||
|| P MCF10A_BaP_pool1_R1_postTrimming/accepted_ ... ||
|| ||
|| Output file : MCF10A_BaP_pool1_counts.txt ||
|| Annotations : hg19_rRNA (GTF) ||
|| ||
|| Threads : 1 ||
|| Level : meta-feature level ||
|| Paired-end : yes ||
|| Strand specific : no ||
|| Multimapping reads : not counted ||
|| Multi-overlapping reads : not counted ||
|| Read orientations : fr ||
|| ||
|| Chimeric reads : counted ||
|| Both ends mapped : not required ||
|| ||
\\===================== http://subread.sourceforge.net/ ======================//
//================================= Running ==================================\\
|| ||
|| Load annotation file hg19_rRNA ... ||
|| Features : 1769 ||
|| Meta-features : 3 ||
|| Chromosomes/contigs : 38 ||
|| ||
|| Process BAM file MCF10A_BaP_pool1_R1_postTrimming/accepted_hits.bam... ||
|| Paired-end reads are included. ||
|| Assign fragments (read pairs) to features... ||
|| ||
|| WARNING: reads from the same pair were found not adjacent to each ||
|| other in the input (due to read sorting by location or ||
|| reporting of multi-mapping read pairs). ||
|| ||
|| Read re-ordering is performed. ||
|| ||
|| Total fragments : 56783755 ||
|| Successfully assigned fragments : 246578 (0.4%) ||
|| Running time : 3.98 minutes ||
|| ||
|| Read assignment finished. ||
|| ||
\\===================== http://subread.sourceforge.net/ ======================//
Thank you
Comment
-
Provided the input files were correct this is the relevant bit
Successfully assigned fragments : 246578 (0.4%)
There is also this warning to keep in mind
|| WARNING: reads from the same pair were found not adjacent to each ||
|| other in the input (due to read sorting by location or ||
|| reporting of multi-mapping read pairs). ||Last edited by GenoMax; 04-20-2016, 04:58 AM.
Comment
-
Originally posted by GenoMax View PostProvided the input files were correct this is the relevant bit
That would mean you have 246578 reads (each from R1/R2 files) that are aligning to rRNA (if that is the only thing in your GTF file). So the ribo-depletion appears to have worked reasonably well. Does not help you if you actually were interested in rRNA.
There is also this warning to keep in mind
|| WARNING: reads from the same pair were found not adjacent to each ||
|| other in the input (due to read sorting by location or ||
|| reporting of multi-mapping read pairs). ||
what do you mean here (Does not help you if you actually were interested in rRNA.) I didn't get this. There are 246578 read pairs aligned to rRNA regions. Could you please tell clearly about the warning. And finally is this right or not? If not how can I do now?Last edited by bvk; 04-20-2016, 05:24 AM.
Comment
-
Ribodepletion has worked well (only 0.4% reads align to rRNA).
Fragments that go into a library have a size distribution and generally it is possible to infer the size of the fragment based on the alignments to a reference. The warning indicates that in some cases the reads were found to be not at the expected distance from each other. This is to be expected since you are looking at a repeat region and the annotation is not perfect.
You have said you (or the person who did the experiment, if you are only analyzing the data) are interested in rRNA biology yet the samples are rRNA depleted. So those two observations are contradictory. Aside from this we don't have any information about what this experiment was for (e.g. were you only interested in finding expression levels of genes but not differential expression or are there groups of samples that need to be analyzed for differential expression) so only you have an idea as to what is to be done next.
Comment
-
Originally posted by GenoMax View PostRibodepletion has worked well (only 0.4% reads align to rRNA).
Fragments that go into a library have a size distribution and generally it is possible to infer the size of the fragment based on the alignments to a reference. The warning indicates that in some cases the reads were found to be not at the expected distance from each other. This is to be expected since you are looking at a repeat region and the annotation is not perfect.
You have said you (or the person who did the experiment, if you are only analyzing the data) are interested in rRNA biology yet the samples are rRNA depleted. So those two observations are contradictory. Aside from this we don't have any information about what this experiment was for (e.g. were you only interested in finding expression levels of genes but not differential expression or are there groups of samples that need to be analyzed for differential expression) so only you have an idea as to what is to be done next.
And if I need reads that are aligned to intron, exon or some other regions it should be the same arguments I need to use but with different gtf file which has exons and introns. Is it right?
Comment
-
Originally posted by bvk View PostAnd if I need reads that are aligned to intron, exon or some other regions it should be the same arguments I need to use but with different gtf file which has exons and introns. Is it right?
I hope you are making an effort to understand the finer points of the programs you are running (specially since you are going to hand the results off to someone else). Getting a program to produce an output does not always mean that the output is correct or logical. This is where an analyst like you has to use their expertise/judgement.
Comment
-
Originally posted by GenoMax View PostYou have already done that (before you used the rRNA GTF file). If you need read summarization at feature level you will need to look at the featureCount options again (one of which would be removal of the -g gene_id option).
I hope you are making an effort to understand the finer points of the programs you are running (specially since you are going to hand the results off to someone else). Getting a program to produce an output does not always mean that the output is correct or logical. This is where an analyst like you has to use their expertise/judgement.
Comment
Latest Articles
Collapse
-
by seqadmin
The field of epigenetics has traditionally concentrated more on DNA and how changes like methylation and phosphorylation of histones impact gene expression and regulation. However, our increased understanding of RNA modifications and their importance in cellular processes has led to a rise in epitranscriptomics research. “Epitranscriptomics brings together the concepts of epigenetics and gene expression,” explained Adrien Leger, PhD, Principal Research Scientist...-
Channel: Articles
04-22-2024, 07:01 AM -
-
by seqadmin
Proteins are often described as the workhorses of the cell, and identifying their sequences is key to understanding their role in biological processes and disease. Currently, the most common technique used to determine protein sequences is mass spectrometry. While still a valuable tool, mass spectrometry faces several limitations and requires a highly experienced scientist familiar with the equipment to operate it. Additionally, other proteomic methods, like affinity assays, are constrained...-
Channel: Articles
04-04-2024, 04:25 PM -
ad_right_rmr
Collapse
News
Collapse
Topics | Statistics | Last Post | ||
---|---|---|---|---|
Started by seqadmin, Today, 11:49 AM
|
0 responses
12 views
0 likes
|
Last Post
by seqadmin
Today, 11:49 AM
|
||
Started by seqadmin, Yesterday, 08:47 AM
|
0 responses
16 views
0 likes
|
Last Post
by seqadmin
Yesterday, 08:47 AM
|
||
Started by seqadmin, 04-11-2024, 12:08 PM
|
0 responses
61 views
0 likes
|
Last Post
by seqadmin
04-11-2024, 12:08 PM
|
||
Started by seqadmin, 04-10-2024, 10:19 PM
|
0 responses
60 views
0 likes
|
Last Post
by seqadmin
04-10-2024, 10:19 PM
|
Comment