Seqanswers Leaderboard Ad

Collapse

Announcement

Collapse
No announcement yet.
X
 
  • Filter
  • Time
  • Show
Clear All
new posts

  • Combining fasta file and bed file into a different file using biopython,python

    hi, I want to create a file using biopython or python or java .for each bed file range, the particular ATGC sequence need to be retrieved and all the result is to be put in this file.
    for example, bed file contains like this:

    chr1 14362 14829 chr1:14363-14829:WASH5P 467 +
    chr1 14969 15038 chr1:14970-15038:WASH5P 69 +
    chr1 15795 15947 chr1:15796-15947:WASH5P 152 +
    chr2 14362 14829 chr2:14363-14829:WASHP 467 +
    chr3 14969 15038 chr3:14970-15038:WASH 69 +
    chr10 15795 15947 chr10:15796-15947:WASHOP 152 +
    ..........................................................................................
    ........................................................................................

    and fasta file contains like this:

    >chr1 dna:chromosome chromosome:GRCh37:1:1:249250621:1
    NNNNNGCCAAGTnggggctaaNNNNGGGGCCCCCCCCCCCCCcCCC

    >chr2 dna:chromosome chromosome:GRCh37:1:1:249250621:1
    NNNNNGCCAAGNNNNGCCAAGT
    nggggctaaNNNNGCCAAGT
    nggggctaaNNNNGCCAAGT
    nggggctaa

    >chr3 dna:chromosome chromosome:GRCh37:1:1:249250621:1
    AGTACNNNNGCCAAGT
    nggggctaaNNNNGCCAAGT
    nggggctaa
    ................................................
    ...................................

    Now I want the result file like the following:

    chr1 14362 14829 chr1:14363-14829:WASH5P 467 +
    ATTTGCCCCCNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNN
    ATTTTTTTTTTTTTTTTTTTTTTTTTTTTTTGGGGGGGGGGGGGGGGGGGGG
    chr1 14969 15038 chr1:14970-15038:WASH5P 69 +
    NNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNATTTTTTTTTT
    AAAAAAAAAAAATTTTTTTTTTTTTTTTTTTTTt

    itmeans for a particular range I have to retrieve the particular fasta sequence and put them into the file
    I did it in java but it is taking very long time...may be I am doing some mistake please help me such that I can use some library function and run the program easily

Latest Articles

Collapse

  • seqadmin
    Essential Discoveries and Tools in Epitranscriptomics
    by seqadmin


    The field of epigenetics has traditionally concentrated more on DNA and how changes like methylation and phosphorylation of histones impact gene expression and regulation. However, our increased understanding of RNA modifications and their importance in cellular processes has led to a rise in epitranscriptomics research. “Epitranscriptomics brings together the concepts of epigenetics and gene expression,” explained Adrien Leger, PhD, Principal Research Scientist on Modified Bases...
    Yesterday, 07:01 AM
  • seqadmin
    Current Approaches to Protein Sequencing
    by seqadmin


    Proteins are often described as the workhorses of the cell, and identifying their sequences is key to understanding their role in biological processes and disease. Currently, the most common technique used to determine protein sequences is mass spectrometry. While still a valuable tool, mass spectrometry faces several limitations and requires a highly experienced scientist familiar with the equipment to operate it. Additionally, other proteomic methods, like affinity assays, are constrained...
    04-04-2024, 04:25 PM

ad_right_rmr

Collapse

News

Collapse

Topics Statistics Last Post
Started by seqadmin, 04-11-2024, 12:08 PM
0 responses
39 views
0 likes
Last Post seqadmin  
Started by seqadmin, 04-10-2024, 10:19 PM
0 responses
41 views
0 likes
Last Post seqadmin  
Started by seqadmin, 04-10-2024, 09:21 AM
0 responses
35 views
0 likes
Last Post seqadmin  
Started by seqadmin, 04-04-2024, 09:00 AM
0 responses
55 views
0 likes
Last Post seqadmin  
Working...
X