Seqanswers Leaderboard Ad

Collapse

Announcement

Collapse
No announcement yet.
X
 
  • Filter
  • Time
  • Show
Clear All
new posts

  • #16
    Here you go. You had a mistake in your code, you took the third column as your contig in this line;

    my $contig = $s[2];

    while the contig was in the second column.

    In addition, since you only have a subset of your data, you'll get these warnings. I removed the use warnings line. Run the script as;

    perl script.pl Sample_aligned_reads.txt sample_contig.txt

    Code:
    #!usr/bin/perl-w
    use strict;
    
    open(SAM,$ARGV[0]);
    my %hash = ();
    while(<SAM>){
      chomp;
      next if($_ =~ /^@/); ## remove the headers in sam file
      #split the line and obtain the read and contig
      my ($read,$contig,$sequence) =split;
    #split the read on the '|' character, to obtain the weight
      my (undef, $weight) = split(/\|/,$read);
    #save the total number of reads and clusters in the hash
      $hash{$contig}{'clusters'}++;
      $hash{$contig}{'total'}+=$weight;
    }
    close SAM;
    
    
    open(CTG,$ARGV[1]);
    my ($contigSeq,$prevhead) = ("","");
    while(<CTG>){
      chomp;
      $contigSeq.= $_ if(eof(CTG));
      if (/\>(\S+)/ || eof(CTG)){
         my $head=$1;
         if($contigSeq ne ''){
           #$contigSeq is the contig sequence, $prevhead is your contig
           my $len = length($contigSeq);
           #Now print the results
           print "$prevhead\t$len\t$hash{$prevhead}{'clusters'}\t$hash{$prevhead}{'total'}\t$contigSeq\n" if(defined $hash{$prevhead});
         }
         $prevhead = $head;
         $contigSeq='';
      }else{
         $contigSeq .= $_;
      }
    }
    close CTG;

    Comment


    • #17
      Yes,I am sorry forgot to mention , I modified the data while extracting the columns of interest from original SAM file,but now I executed the script on complete data files.

      Again I will try with this modified script.

      Comment


      • #18
        Oh great! its working,thank you very much.

        Comment


        • #19
          Great, good luck with your analysis.

          Comment


          • #20
            Thank you once again.

            Comment

            Latest Articles

            Collapse

            • seqadmin
              Essential Discoveries and Tools in Epitranscriptomics
              by seqadmin


              The field of epigenetics has traditionally concentrated more on DNA and how changes like methylation and phosphorylation of histones impact gene expression and regulation. However, our increased understanding of RNA modifications and their importance in cellular processes has led to a rise in epitranscriptomics research. “Epitranscriptomics brings together the concepts of epigenetics and gene expression,” explained Adrien Leger, PhD, Principal Research Scientist on Modified Bases...
              Today, 07:01 AM
            • seqadmin
              Current Approaches to Protein Sequencing
              by seqadmin


              Proteins are often described as the workhorses of the cell, and identifying their sequences is key to understanding their role in biological processes and disease. Currently, the most common technique used to determine protein sequences is mass spectrometry. While still a valuable tool, mass spectrometry faces several limitations and requires a highly experienced scientist familiar with the equipment to operate it. Additionally, other proteomic methods, like affinity assays, are constrained...
              04-04-2024, 04:25 PM

            ad_right_rmr

            Collapse

            News

            Collapse

            Topics Statistics Last Post
            Started by seqadmin, 04-11-2024, 12:08 PM
            0 responses
            37 views
            0 likes
            Last Post seqadmin  
            Started by seqadmin, 04-10-2024, 10:19 PM
            0 responses
            41 views
            0 likes
            Last Post seqadmin  
            Started by seqadmin, 04-10-2024, 09:21 AM
            0 responses
            35 views
            0 likes
            Last Post seqadmin  
            Started by seqadmin, 04-04-2024, 09:00 AM
            0 responses
            54 views
            0 likes
            Last Post seqadmin  
            Working...
            X