Hi,
I am using blast+. I have formatted a nucleotide database using makeblastdb.
I am trying to extract sequences from a file containing a list of IDs.
Using the following:
When I do I get the same error for every ID I am searching for (here is an example):
My list of ids is in this format:
The fast file I made the database from looks like this
I have checked a few of my IDs manually and they are indeed in my database. Can anyone tell me what I am doing wrong? Or suggest another approach?
I am using blast+. I have formatted a nucleotide database using makeblastdb.
I am trying to extract sequences from a file containing a list of IDs.
Using the following:
Code:
/Users/wolniaklab/blast/programs/blastdbcmd -db /Users/wolniaklab/Desktop/search/seqs2 -dbtype nucl -entry_batch /Users/wolniaklab/Desktop/search/ids1.txt -out /Users/wolniaklab/Desktop/search/output.txt
Code:
Error: >lcl|comp9999_c1_seq11: OID not found
Code:
>lcl|comp10021_c0_seq1 >lcl|comp1002_c0_seq1 >lcl|comp10045_c0_seq13 >lcl|comp10045_c0_seq14 >lcl|comp10045_c0_seq19 >lcl|comp10045_c0_seq4 >lcl|comp10045_c0_seq4 >lcl|comp10049_c0_seq4 >lcl|comp10075_c0_seq13 >lcl|comp10075_c0_seq9 >lcl|comp100777_c0_seq1 >lcl|comp10082_c0_seq1
Code:
>lcl|comp11191_c0_seq1 len=589 path=[0:0-128 613:129-135 136:136-588] GTTCTATTGTATTGTTATCCATCTGAGGTTTTCTCTCTGCGTTTGTCTGTGCAGAATCTA GTGATCTCCCACAACATGATGTGGCCACCAGGGATGGAACAAAGCTGGTGAGAAGGGCCG ATATGGCTCGAAAAATTCCTCAATTCAAGATACTTTGATCCCTGCACCGAGCACCACTTC AACAAAAATGAGAAAAACCATTTCTGCATTTGTTGTAATGAAGGTCCTCACTCCCATCAC CAAACTCTCCAAGTCCGCCGGGCGTCCCATGCCAACTGTGTCCGGGTCGAAAACATCTCC TAGATTCTAGACATTTCTGGAATTCAAACCTACATCATCAACAACCATAAAATTGTCTTC CTCCAAAGGCAGGCCAATGTGAAGCAGATCATGTCAAGGTTGTTGATCAGTTCAACAGGA GGTCTCCATGTCTCTGCTAATGCCAAGCATTGCCATACCTGTGGAAGAGCTTTGTCCACT GATTTAATGAAGTTTTGCTCCATTAAATGCAAGCTTATGCCTACTTCTTTTAATTTTGTT TCTAGAATTTGAAACTCATTTTACTAAACTGGTTATATTTTGTTTTTAG >lcl|comp10877_c0_seq1 len=1212 path=[3176:0-121 3368:122-148 3395:149-192 3439:193-281 4481:282-332 3578:333-1211] AAAGCATGCCTAAGTCGATTTATTATTAATTTATTTAGTCGCTTTATTCTAACTATCCCG ACTCAAGCTTAACTAACGGTTCTACTATTCGATTTCCATCTCTAGGTTCGGTTTCTAACT CGTCTAACTCCCTCGCCTACGGAATTCATGACTTCGGTCATCGCTAACCTCGGCAACCCT CTACGTGAGTTTAGTCACCAACAGTGTCAAGTTCCGTCCAACAGCGTCAACATCCGTCCG ACCATCGATATCTATTCATCTCCGTTTAATCTATATCCTACTGTTATTAAACACATTTCC TATACTATCATGATGTGTCTTTGGGCTCTAGGGATCATATCTACCCACCTATCTAATCTG ATTGGGTCATCACTTATTAATATACTACAGTGAATCAAGGCTCATCTAGCCTATCTGTCC TCGGCTTACTATTCCGTCACCCAGAGTACCACCGAACGATGTCGGCCTATCCTCTAATCA TCCTATCAATCTACTATCACAAGGTGCATCAATTCTACGTCGTTCTATCCAATCGAATCC GGTCCATACCAATCTCAGTAGCTCCGACATTATTGACACTGTTAGGATCCCGTCGGTCAC GTCCGTTCGGCTTCACCTTCCCAGCCTTAGTTGCCAGGCCTTAATCTAATCCTAGCTCCT TATAATCTATATGGATTCTAGTCATATAACGCTAGGAAGATTAACGACTCCCGCTATTTA CTACCCGATCGGTACGTCATCACACTACTGCCAGTGTATTTCTATTGGAAACCCTAACTC CATTCTACTATGGTTAAATAAGAGTGGGTTCCTATGGATTAAAGCTCTAGTGTGCTCTTC CTATGGTACTCATATCTCCTTCCTAAATTACTTACTCAAACACCTCCTTAAGCCAAATTC TAGAGATATAATAAGTCAAATTCTATAGGGGTTTCTAACCAATTTAGTAGATCTATAACT TACTTATCCCATAGGTTTCTAACTTACAACTTAGTCCTATAGGGCTTGATTTATTATATA CAAGATAACTCACTCTATAAGCTTTGCTCACACATCATCTCACACCAATATATACCAAAA TAGCTCTCAAAAGGATTTGACTCAACACCCCTATGGGATATCATCTAAGTCATCTAATTT AACTAATATTTCTATTACATGGGCTAGAGTAGGTCTCTTTCAATCAATCATGCACCCATT CCAAAAGTCTAG >lcl|comp10877_c0_seq2 len=1160 path=[6037:0-34 11677:35-40 11683:41-46 1200:47-73 1227:74-108 1262:109-1159] CTCATAGAGAGATTCGTCATCTAGGGAACAATGCAAATGCACACTAAATGAGTTAATTAA ACATCCAATTATCACCATTAAGCAAGTCAAAATCAATCTAGAGCATTCCATGTGTATGCA TAAGTTGGAAGTTAGAAAACCTTACCTGGAAGCCCTTCTGAGTACCTTAAAAAACTATAA AAACTATCTAATCAAGGCAATTAATATAATCTCTAGAATTAATTGTAATTAGAAATCAAG CTTAAGTCCTAAATATAAAACTAGGGCAAATATAATTATAAGTTAATCCAAGTCCTTATC AAGTCCTAGTGAATCAAATTTTCAGTCAAGCTAAATCCTCAAAATTAAATATGGAATTAT GTCAAGGTCAAGGCTTAGTCAGCTTATAATGGTCCTAGGTCTAGTCTAAGTCCTAGGGAA AAAAAAGAAAGAAGAAAAAAACTAAAAAAACAAGTCAAAACTCATTATAGTGGAAAAATA
Comment