Seqanswers Leaderboard Ad

Collapse

Announcement

Collapse
No announcement yet.
X
 
  • Filter
  • Time
  • Show
Clear All
new posts

  • #16
    Here you go. You had a mistake in your code, you took the third column as your contig in this line;

    my $contig = $s[2];

    while the contig was in the second column.

    In addition, since you only have a subset of your data, you'll get these warnings. I removed the use warnings line. Run the script as;

    perl script.pl Sample_aligned_reads.txt sample_contig.txt

    Code:
    #!usr/bin/perl-w
    use strict;
    
    open(SAM,$ARGV[0]);
    my %hash = ();
    while(<SAM>){
      chomp;
      next if($_ =~ /^@/); ## remove the headers in sam file
      #split the line and obtain the read and contig
      my ($read,$contig,$sequence) =split;
    #split the read on the '|' character, to obtain the weight
      my (undef, $weight) = split(/\|/,$read);
    #save the total number of reads and clusters in the hash
      $hash{$contig}{'clusters'}++;
      $hash{$contig}{'total'}+=$weight;
    }
    close SAM;
    
    
    open(CTG,$ARGV[1]);
    my ($contigSeq,$prevhead) = ("","");
    while(<CTG>){
      chomp;
      $contigSeq.= $_ if(eof(CTG));
      if (/\>(\S+)/ || eof(CTG)){
         my $head=$1;
         if($contigSeq ne ''){
           #$contigSeq is the contig sequence, $prevhead is your contig
           my $len = length($contigSeq);
           #Now print the results
           print "$prevhead\t$len\t$hash{$prevhead}{'clusters'}\t$hash{$prevhead}{'total'}\t$contigSeq\n" if(defined $hash{$prevhead});
         }
         $prevhead = $head;
         $contigSeq='';
      }else{
         $contigSeq .= $_;
      }
    }
    close CTG;

    Comment


    • #17
      Yes,I am sorry forgot to mention , I modified the data while extracting the columns of interest from original SAM file,but now I executed the script on complete data files.

      Again I will try with this modified script.

      Comment


      • #18
        Oh great! its working,thank you very much.

        Comment


        • #19
          Great, good luck with your analysis.

          Comment


          • #20
            Thank you once again.

            Comment

            Latest Articles

            Collapse

            • seqadmin
              Current Approaches to Protein Sequencing
              by seqadmin


              Proteins are often described as the workhorses of the cell, and identifying their sequences is key to understanding their role in biological processes and disease. Currently, the most common technique used to determine protein sequences is mass spectrometry. While still a valuable tool, mass spectrometry faces several limitations and requires a highly experienced scientist familiar with the equipment to operate it. Additionally, other proteomic methods, like affinity assays, are constrained...
              04-04-2024, 04:25 PM
            • seqadmin
              Strategies for Sequencing Challenging Samples
              by seqadmin


              Despite advancements in sequencing platforms and related sample preparation technologies, certain sample types continue to present significant challenges that can compromise sequencing results. Pedro Echave, Senior Manager of the Global Business Segment at Revvity, explained that the success of a sequencing experiment ultimately depends on the amount and integrity of the nucleic acid template (RNA or DNA) obtained from a sample. “The better the quality of the nucleic acid isolated...
              03-22-2024, 06:39 AM

            ad_right_rmr

            Collapse

            News

            Collapse

            Topics Statistics Last Post
            Started by seqadmin, 04-11-2024, 12:08 PM
            0 responses
            30 views
            0 likes
            Last Post seqadmin  
            Started by seqadmin, 04-10-2024, 10:19 PM
            0 responses
            32 views
            0 likes
            Last Post seqadmin  
            Started by seqadmin, 04-10-2024, 09:21 AM
            0 responses
            28 views
            0 likes
            Last Post seqadmin  
            Started by seqadmin, 04-04-2024, 09:00 AM
            0 responses
            53 views
            0 likes
            Last Post seqadmin  
            Working...
            X