Help in output only the alignments identical to the reference

Liam_Gallagher

Member

Join Date: Oct 2011

Posts: 18
- Share
- Tweet
#1

Help in output only the alignments identical to the reference

10-27-2017, 05:02 AM

Hello everyone,
I have a SAM file resulting from a STAR alignment, containing reads aligned to a reference of smallRNA sequences (I created the reference, downloading the fasta sequences of interest and concatenated).
Now I would like to keep only the reads that are of the same length of the region in which they are mapped in the fasta file (I also have a GFF file with the coordinates of every region).
I tried setting STAR to keep only the reads completely mapped, and it works....but I would like a SAM or BAM file with just the reads that completely overlap the mapping region and are not longer or shorter than the region (I alredy removed the adapters).
The final goal is to count correctly reads overlapping the region of interest.
Now I used this tools: cutadapt, STAR and HTSeq-count.
Please could someone help me (maybe with some awk function, script, algorithm, STAR options...)?
Thank you in advance!!

Cristian
Tags: fasta, filtering reads, sam bam, smallrna, star aligner

Previous template Next

Topics	Statistics	Last Post
Expanding the Horizons of Cellular Research with the Single Cell Atlas by seqadmin Started by seqadmin, Today, 11:49 AM	0 responses 12 views 0 likes	Last Post by seqadmin Today, 11:49 AM
Genetic Variants and Diabetes Risk in Childhood Cancer Survivors by seqadmin Started by seqadmin, Yesterday, 08:47 AM	0 responses 16 views 0 likes	Last Post by seqadmin Yesterday, 08:47 AM
Cancer Metastasis: A Deep Dive into Cellular Plasticity by seqadmin Started by seqadmin, 04-11-2024, 12:08 PM	0 responses 61 views 0 likes	Last Post by seqadmin 04-11-2024, 12:08 PM
Proteogenomic Profiles Offer New Clues in Prostate Cancer by seqadmin Started by seqadmin, 04-10-2024, 10:19 PM	0 responses 60 views 0 likes	Last Post by seqadmin 04-10-2024, 10:19 PM

Seqanswers Leaderboard Ad