Seqanswers Leaderboard Ad

Collapse

Announcement

Collapse
No announcement yet.
X
 
  • Filter
  • Time
  • Show
Clear All
new posts

  • To find the difference in base pair position of two fasta file

    Hello,
    I am just beginner in Perl,
    I have Two fasta file of different length.
    I would like to align them to find difference in nucleotide position.

    Output should be like this
    Total length of fasta files
    First reference file: 1253630 base pair
    Second file: 4523366 base pair
    If match 2nd file is same as 1st reference file.
    Match position at base pair
    a-t 455222
    c-g 455665
    if not match out put should like this
    Mismatch position at base pair
    A-C 100025
    C-T 600045
    Result: should be in Output.txt

    I tried this code given below:

    use strict;
    use warnings;
    my $file1 = 'chr20.txt';
    my $file2 = 'chr21.txt';
    my $error = 'error.txt';
    open(my $in1, '<', $file1) or die "Cannot open file '$file1' for readi
    +ng: $!";
    open(my $in2, '<', $file2) or die "Cannot open file '$file2' for readi
    +ng: $!";
    open(my $out, '>', $error) or die "Cannot open file '$error' for writi
    +ng: $!";
    my $lineno = 1;
    while (my $line1 = <$in1>)
    {
    my $line2 = <$in2>;
    print "$. : $line2,$line1,";
    printf $out "Error:lineno:%d mismatch found \n", $lineno
    unless $line1 eq $line2;

    ++$lineno;
    }
    close $out or die "Cannot close file '$error': $!";
    close $in2 or die "Cannot close file '$file2': $!";
    close $in1 or die "Cannot close file '$file1': $!";

  • #2
    can anybody help me to short out the problem?

    Comment

    Latest Articles

    Collapse

    • seqadmin
      Essential Discoveries and Tools in Epitranscriptomics
      by seqadmin




      The field of epigenetics has traditionally concentrated more on DNA and how changes like methylation and phosphorylation of histones impact gene expression and regulation. However, our increased understanding of RNA modifications and their importance in cellular processes has led to a rise in epitranscriptomics research. “Epitranscriptomics brings together the concepts of epigenetics and gene expression,” explained Adrien Leger, PhD, Principal Research Scientist...
      04-22-2024, 07:01 AM
    • seqadmin
      Current Approaches to Protein Sequencing
      by seqadmin


      Proteins are often described as the workhorses of the cell, and identifying their sequences is key to understanding their role in biological processes and disease. Currently, the most common technique used to determine protein sequences is mass spectrometry. While still a valuable tool, mass spectrometry faces several limitations and requires a highly experienced scientist familiar with the equipment to operate it. Additionally, other proteomic methods, like affinity assays, are constrained...
      04-04-2024, 04:25 PM

    ad_right_rmr

    Collapse

    News

    Collapse

    Topics Statistics Last Post
    Started by seqadmin, Today, 08:47 AM
    0 responses
    12 views
    0 likes
    Last Post seqadmin  
    Started by seqadmin, 04-11-2024, 12:08 PM
    0 responses
    60 views
    0 likes
    Last Post seqadmin  
    Started by seqadmin, 04-10-2024, 10:19 PM
    0 responses
    59 views
    0 likes
    Last Post seqadmin  
    Started by seqadmin, 04-10-2024, 09:21 AM
    0 responses
    54 views
    0 likes
    Last Post seqadmin  
    Working...
    X