Seqanswers Leaderboard Ad

Collapse

Announcement

Collapse
No announcement yet.
X
 
  • Filter
  • Time
  • Show
Clear All
new posts

  • read count with multicov

    Hi,

    I am doing a gene expression analysis, I have a raw data of libraries of RNAseq (microRNAs) of 2 conditions and 11 replicates each one, I used multicov to obtain the reads count

    bedtools multicov -bams SRR1054203.gz.segemehl.sam.bam.sorted.bam.bam SRR1054204.gz.cutadapt204.cutadapt.fastq.segemehl.sam.bam.sorted.bam.bam SRR1054205.gz.cutadapt205.cutadapt.fastq.segemehl.sam.bam.sorted.bam.bam SRR1054206.gz.cutadapt206.fastq.segemehl.sam.bam.sorted.bam.bam SRR1054207.gz.cutadapt207.fastq.segemehl.sam.bam.sorted.bam.bam SRR1054208.gz.cutadapt208.fastq.segemehl.sam.bam.sorted.bam.bam SRR1054209.gz.cutadapt209.fastq.segemehl.sam.bam.sorted.bam.bam SRR1054210.gz.cutadapt210.fastq.segemehl.sam.bam.sorted.bam.bam SRR1054211.gz.cutadapt211.fastq.segemehl.sam.bam.sorted.bam.bam SRR1054212.gz.cutadapt212.fastq.segemehl.sam.bam.sorted.bam.bam SRR1054218.gz.cutadapt218.fastq.segemehl.sam.bam.sorted.bam.bam SRR1054217.gz.cutadapt217.fastq.segemehl.sam.bam.sorted.bam.bam SRR1054213.gz.cutadapt213.fastq.segemehl.sam.bam.sorted.bam.bam SRR1054214.gz.cutadapt214.fastq.segemehl.sam.bam.sorted.bam.bam SRR1054215.gz.cutadapt215.fastq.segemehl.sam.bam.sorted.bam.bam SRR1054219.gz.cutadapt219.fastq.segemehl.sam.bam.sorted.bam.bam SRR1054220.gz.cutadapt220.fastq.segemehl.sam.bam.sorted.bam.bam SRR1054221.gz.cutadapt221.fastq.segemehl.sam.bam.sorted.bam.bam SRR1054222.gz.cutadapt222.fastq.segemehl.sam.bam.sorted.bam.bam SRR1054223.gz.cutadapt223.fastq.segemehl.sam.bam.sorted.bam.bam SRR1054224.gz.cutadapt224.fastq.segemehl.sam.bam.sorted.bam.bam SRR1054225.gz.cutadapt225.fastq.segemehl.sam.bam.sorted.bam.bam SRR1054226.gz.cutadapt226.fastq.segemehl.sam.bam.sorted.bam.bam SRR1054216.gz.cutadapt216.fastq.segemehl.sam.bam.sorted.bam.bam -bed results.out > conteo_mature_genome2

    the result

    chr20 62550849 62550871 hsa-mir-941-1 hsa-miR-941 MIMAT0004984 Homo sapiens miR-941 17 7 12 42 23 41 40 31 28 35 584 45 49 14 53 198 72 7 43 54 37 93 44 37
    chr20 62550905 62550927 hsa-mir-941-2 hsa-miR-941 MIMAT0004984 Homo sapiens miR-941 17 7 12 42 23 41 40 31 28 35 584 45 49 14 53 198 72 7 43 54 37 93 44 37
    chr20 62550961 62550983 hsa-mir-941-2 hsa-miR-941 MIMAT0004984 Homo sapiens miR-941 17 7 12 42 23 41 40 31 28 35 584 45 49 14 53 198 72 7 43 54 37 93 44 37
    chr20 62551156 62551178 hsa-mir-941-2 hsa-miR-941 MIMAT0004984 Homo sapiens miR-941 17 7 12 42 23 41 40 31 28 35 584 45 49 14 53 198 72 7 43 54 37 93 44 37
    chr20 62551268 62551290 hsa-mir-941-2 hsa-miR-941 MIMAT0004984 Homo sapiens miR-941 17 7 12 42 23 41 40 31 28 35 584 45 49 14 53 198 72 7 43 54 37 93 44 37
    chr20 62550905 62550927 hsa-mir-941-3 hsa-miR-941 MIMAT0004984 Homo sapiens miR-941 17 7 12 42 23 41 40 31 28 35 584 45 49 14 53 198 72 7 43 54 37 93 44 37
    chr20 62550961 62550983 hsa-mir-941-3 hsa-miR-941 MIMAT0004984 Homo sapiens miR-941 17 7 12 42 23 41 40 31 28 35 584 45 49 14 53 198 72 7 43 54 37 93 44 37
    chr20 62551156 62551178 hsa-mir-941-3 hsa-miR-941 MIMAT0004984 Homo sapiens miR-941 17 7 12 42 23 41 40 31 28 35 584 45 49 14 53 198 72 7 43 54 37 93 44 37
    chr20 62551268 62551290 hsa-mir-941-3 hsa-miR-941 MIMAT0004984 Homo sapiens miR-941 17 7 12 42 23 41 40 31 28 35 584 45 49 14 53 198 72 7 43 54 37 93 44 37
    chr20 62550905 62550927 hsa-mir-941-4 hsa-miR-941 MIMAT0004984 Homo sapiens miR-941 17 7 12 42 23 41 40 31 28 35 584 45 49 14 53 198 72 7 43 54 37 93 44 37
    chr20 62550961 62550983 hsa-mir-941-4 hsa-miR-941 MIMAT0004984 Homo sapiens miR-941 17 7 12 42 23 41 40 31 28 35 584 45 49 14 53 198 72 7 43 54 37 93 44 37
    chr20 62551156 62551178 hsa-mir-941-4 hsa-miR-941 MIMAT0004984 Homo sapiens miR-941 17 7 12 42 23 41 40 31 28 35 584 45 49 14 53 198 72 7 43 54 37 93 44 37
    chr20 62551268 62551290 hsa-mir-941-4 hsa-miR-941 MIMAT0004984 Homo sapiens miR-941 17 7 12 42 23 41 40 31 28 35 584 45 49 14 53 198 72 7 43 54 37 93 44 37
    chr20 62550905 62550927 hsa-mir-941-5 hsa-miR-941 MIMAT0004984 Homo sapiens miR-941 17 7 12 42 23 41 40 31 28 35 584 45 49 14 53 198 72 7 43 54 37 93 44 37
    chr20 62550961 62550983 hsa-mir-941-5 hsa-miR-941 MIMAT0004984 Homo sapiens miR-941 17 7 12 42 23 41 40 31 28 35 584 45 49 14 53 198 72 7 43 54 37 93 44 37
    chr20 62551156 62551178 hsa-mir-941-5 hsa-miR-941 MIMAT0004984 Homo sapiens miR-941 17 7 12 42 23 41 40 31 28 35 584 45 49 14 53 198 72 7 43 54 37 93 44 37
    chr20 62551268 62551290 hsa-mir-941-5 hsa-miR-941 MIMAT0004984 Homo sapiens miR-941 17 7 12 42 23 41 40 31 28 35 584 45 49 14 53 198 72 7 43 54 37 93 44 37

    I found for example, several time the mature hsa-miR-941 MIMAT0004984 with different hairpin names and some of them have the same coordinates, and they have the same number of reads count.

    I do not know if it possible to have the option of a window to aggregate nucleotides in order to merge read that starts with a few nucleotides of differences, but that belong to the same mature microRNA and report only one microRNA ID.

    I check the bedtools manual and have the windowsbed but with multicov it does not work, any idea?. I need to overcome this because when I use this table in edgeR or DESeq, it does not work with repeated ID:

    Thanks very much

    regards

    Adriana

  • #2
    You can download from miRBase a file with only the mature miRNAs.

    Comment

    Latest Articles

    Collapse

    • seqadmin
      Strategies for Sequencing Challenging Samples
      by seqadmin


      Despite advancements in sequencing platforms and related sample preparation technologies, certain sample types continue to present significant challenges that can compromise sequencing results. Pedro Echave, Senior Manager of the Global Business Segment at Revvity, explained that the success of a sequencing experiment ultimately depends on the amount and integrity of the nucleic acid template (RNA or DNA) obtained from a sample. “The better the quality of the nucleic acid isolated...
      03-22-2024, 06:39 AM
    • seqadmin
      Techniques and Challenges in Conservation Genomics
      by seqadmin



      The field of conservation genomics centers on applying genomics technologies in support of conservation efforts and the preservation of biodiversity. This article features interviews with two researchers who showcase their innovative work and highlight the current state and future of conservation genomics.

      Avian Conservation
      Matthew DeSaix, a recent doctoral graduate from Kristen Ruegg’s lab at The University of Colorado, shared that most of his research...
      03-08-2024, 10:41 AM

    ad_right_rmr

    Collapse

    News

    Collapse

    Topics Statistics Last Post
    Started by seqadmin, Yesterday, 06:37 PM
    0 responses
    12 views
    0 likes
    Last Post seqadmin  
    Started by seqadmin, Yesterday, 06:07 PM
    0 responses
    10 views
    0 likes
    Last Post seqadmin  
    Started by seqadmin, 03-22-2024, 10:03 AM
    0 responses
    51 views
    0 likes
    Last Post seqadmin  
    Started by seqadmin, 03-21-2024, 07:32 AM
    0 responses
    68 views
    0 likes
    Last Post seqadmin  
    Working...
    X