Программы пакета BLAST для работы с нуклеотидными последовательностями
formatdb -i pm_genome.fasta -p F -n pm
genpath=/home/export/samba/public/tmp
genomes="$genpath/xc_genome.fasta $genpath/st_genome.fasta $genpath/pm_genome.fasta"
formatdb -i "$genomes" -p F -n 3g
blsatall -p tblastn -d pm -i CAPP_ECOLI -o pm_search.txt
Поиск гомологов CAPP_ECOLI | Геном Pasteurella multocida | *Геном Xanthomonas campestris | *Геном Salmonella typhimurium | |
Число находок с Е-value<0,01 |
1 | |||
Поиск по трём геномам |
||||
Число находок с E-value<0.01 | 1 | 1 | 1 | |
E-value лучшей находки | 0.0 | e-110 | 0.0 | |
AC соответствующей записи EMBL | AE006090 | AE012175 | AE008892 | |
Характеристика лучшей находки |
||||
E-value лучшей находки | 0.0 | e-110 | 0.0 | |
координаты выравниваний в записи генома | 4204-6840 | 173-2836 | 15358-12710 | |
AC соответствующей записи EMBL | AE006090 | AE012175 | AE008892 | |
Координаты CDS в записи EMBL | complement(4201..6840) | (125..2839) | complement(12707..15358) | |
AC/ID UniProt, название кодирующего гена в записи EMBL | Q9CN89
CAPP_PASMU ген ppc |
Q8P336
CAPP_XANCP ген ppc |
Q8ZKM0
CAPP_SALTY ген ppc | |
Процент идентичности | 63% | 32% | 91% | |
Примечания | Все три лучшие находки относятся к белковому семейству PEPCase, являются фосфоенолпируват карбоксилазами, обладают одинаковыми функциями. |
blastall -p blastn -d 3g -i CAPP_gene1.fasta -o blastn_search.txtРезультат поиска - blastn_search.txt.
E-value лучшей находки | 0.0 |
Геном | Salmonella typhimurium |
Процент идентичности | 85% |
Соответствующий белок | CAPP_SALTY |
>AE008892 AE006468 |AE008892| Salmonella typhimurium LT2, section 196 of 220 of the complete genome. Length = 20385 Score = 2095 bits (1057), Expect = 0.0 Identities = 2170/2541 (85%) Strand = Plus / Minus Query: 1 atgaacgaacaatattccgcattgcgtagtaatgtcagtatgctcggcaaagtgctggga 60 |||||||||||||||||||| ||||||||||||||||||||||||||||| ||||||||| Sbjct: 15358 atgaacgaacaatattccgcgttgcgtagtaatgtcagtatgctcggcaaggtgctggga 15299 Query: 61 gaaaccatcaaggatgcgttgggagaacacattcttgaacgcgtagaaactatccgtaag 120 |||||||||||||||||| ||||||||||||||||||| ||||||||||| ||||||||| Sbjct: 15298 gaaaccatcaaggatgcgctgggagaacacattcttgatcgcgtagaaacaatccgtaag 15239 Query: 121 ttgtcgaaatcttcacgcgctggcaatgatgctaaccgccaggagttgctcaccacctta 180 | || |||||||||||||| |||||||| ||||| ||||||||| |||||||||| || Sbjct: 15238 ctatccaaatcttcacgcgccggcaatgaagctaatcgccaggagctgctcaccacgcta 15179 Query: 181 caaaatttgtcgaacgacgagctgctgcccgttgcgcgtgcgtttagtcagttcctgaac 240 ||||||||||| || |||||||||||||| |||||||| ||||||||||||||||||||| Sbjct: 15178 caaaatttgtctaatgacgagctgctgccagttgcgcgcgcgtttagtcagttcctgaac 15119 Query: 241 ctggccaacaccgccgagcaataccacagcatttcgccgaaaggcgaagctgccagcaac 300 |||||||| || |||||||||||||||||||||||||| ||||||||||| ||||||||| Sbjct: 15118 ctggccaatactgccgagcaataccacagcatttcgccaaaaggcgaagccgccagcaac 15059 Query: 301 ccggaagtgatcgcccgcaccctgcgtaaactgaaaaaccagccggaactgagcgaagac 360 ||||||||||| |||||||| ||||| |||||||||||||| ||||| || | ||| | Sbjct: 15058 ccggaagtgattgcccgcactctgcgcaaactgaaaaaccaaccggacctcaacgacgca 14999 Query: 361 accatcaaaaaagcagtggaatcgctgtcgctggaactggtcctcacggctcacccaacc 420 |||||||||||||| || || |||||||| ||||| |||| || || || || || || Sbjct: 14998 accatcaaaaaagcggtagagtcgctgtctctggagttggtgctaaccgcccatccgaca 14939 Query: 421 gaaattacccgtcgtacactgatccacaaaatggtggaagtgaacgcctgtttaaaacag 480 ||||||||||| ||||| || || |||||||||| ||| | ||| |||| |||||||| Sbjct: 14938 gaaattacccgccgtacgcttattcacaaaatgggtgaaatcaacaactgtctaaaacag 14879 Query: 481 ctcgataacaaagatatcgctgactacgaacacaaccagctgatgcgtcgcctgcgccag 540 || ||||| | |||||||| |||||||||| | ||||| | |||||||||||||||||| Sbjct: 14878 cttgataataccgatatcgccgactacgaacgccaccaggtaatgcgtcgcctgcgccag 14819 Query: 541 ttgatcgcccagtcatggcataccgatgaaatccgtaagctgcgtccaagcccggtagat 600 ||||| ||||| || |||||||| ||||| ||||| |||| ||||||||||||||| ||| Sbjct: 14818 ttgattgcccaatcctggcatacggatgagatccgcaagcagcgtccaagcccggtggat 14759 Query: 601 gaagccaaatggggctttgccgtagtggaaaacagcctgtggcaaggcgtaccaaattac 660 ||||||||||||||||| ||||| || || |||||||||||||| |||||||| || || Sbjct: 14758 gaagccaaatggggcttcgccgtggttgagaacagcctgtggcagggcgtacctaactat 14699 Query: 661 ctgcgcgaactgaacgaacaactggaagagaacctcggctacaaactgcccgtcgaattt 720 ||||| |||||||||||||| |||||||| || |||||||||||| |||| || || ||| Sbjct: 14698 ctgcgtgaactgaacgaacagctggaagaaaatctcggctacaaattgccggtggatttt 14639 Query: 721 gttccggtccgttttacttcgtggatgggcggcgaccgcgacggcaacccgaacgtcact 780 || ||||| |||||||| || ||||||||||||||||| ||||||||||||||||| || Sbjct: 14638 gtgccggtacgttttacctcctggatgggcggcgaccgtgacggcaacccgaacgtgacg 14579 Query: 781 gccgatatcacccgccacgtcctgctactcagccgctggaaagccaccgatttgttcctg 840 || ||||||||||||||||| || | | ||||||||||||||||||||| |||||||| Sbjct: 14578 gcggatatcacccgccacgtactcttgttaagccgctggaaagccaccgatctgttcctg 14519 Query: 841 aaagatattcaggtgctggtttctgaactgtcgatggttgaagcgacccctgaactgctg 900 ||||| ||||| || ||||| || |||||||||||||| || || || || || |||||| Sbjct: 14518 aaagacattcatgttctggtatcagaactgtcgatggtcgacgccacgccggagctgctg 14459 Query: 901 gcgctggttggcgaagaaggtgccgcagaaccgtatcgctatctgatgaaaaacctgcgt 960 ||| | || ||||||||||| || | || ||||||||||| ||||||||||| |||| Sbjct: 14458 gcgttagtgggcgaagaaggcgcgtctgagccgtatcgctacctgatgaaaaaattgcgc 14399 Query: 961 tctcgcctgatggcgacacaggcatggctggaagcgcgcctgaaaggcgaagaactgcca 1020 | || ||||||||||| ||| | |||||||||||||| ||||||||||| | ||||| Sbjct: 14398 gcccgtctgatggcgacccagtcctggctggaagcgcgtctgaaaggcgagaagctgccc 14339 Query: 1021 aaaccagaaggcctgctgacacaaaacgaagaactgtgggaaccgctctacgcttgctac 1080 ||||| | ||||||||||| |||||||| | || |||||||| || ||||| |||||| Sbjct: 14338 aaaccggctggcctgctgacgcaaaacgagcagctctgggaacctctgtacgcctgctac 14279 Query: 1081 cagtcacttcaggcgtgtggcatgggtattatcgccaacggcgatctgctcgacaccctg 1140 ||||| | ||||| || |||||||| ||||||||||||||||| | |||||||| || Sbjct: 14278 cagtcgttacaggcctgcggcatgggcattatcgccaacggcgagttactcgacacgctc 14219 Query: 1141 cgccgcgtgaaatgtttcggcgtaccgctggtccgtattgatatccgtcaggagagcacg 1200 ||||||||||| |||||||||||||||||||| |||||||||||||| ||||| || || Sbjct: 14218 cgccgcgtgaagtgtttcggcgtaccgctggtgcgtattgatatccgccaggaaagtacc 14159 Query: 1201 cgtcataccgaagcgctgggcgagctgacccgctacctcggtatcggcgactacgaaagc 1260 || ||||| |||||||||||||| | ||||||||||||||||| ||||||||||||||| Sbjct: 14158 cgccatactgaagcgctgggcgaaattacccgctacctcggtattggcgactacgaaagc 14099 Query: 1261 tggtcagaggccgacaaacaggcgttcctgatccgcgaactgaactccaaacgtccgctt 1320 ||||| || |||||||| ||||| ||||||||||||||||||||||||||||||||||| Sbjct: 14098 tggtcggaagccgacaagcaggccttcctgatccgcgaactgaactccaaacgtccgctg 14039 Query: 1321 ctgccgcgcaactggcaaccaagcgccgaaacgcgcgaagtgctcgatacctgccaggtg 1380 |||||||| |||||| | || ||| ||| || ||||||||||| || |||||| |||| Sbjct: 14038 ctgccgcgtaactgggagccgagcaacgatacccgcgaagtgcttgaaacctgcaaggtt 13979 Query: 1381 attgccgaagcaccgcaaggctccattgccgcctacgtgatctcgatggcgaaaacgccg 1440 ||||||||||| || |||| || || ||||||||||| || || ||||||||||||||| Sbjct: 13978 attgccgaagcgccaaaaggatcgatcgccgcctacgtaatttcaatggcgaaaacgccg 13919 Query: 1441 tccgacgtactggctgtccacctgctgctgaaagaagcgggtatcgggtttgcgatgccg 1500 || || || |||||||| || |||||||||||||| || |||||||| ||||| |||||| Sbjct: 13918 tctgatgtgctggctgtgcatctgctgctgaaagaggcaggtatcggctttgccatgccg 13859 Query: 1501 gttgctccgctgtttgaaaccctcgatgatctgaacaacgccaacgatgtcatgacccag 1560 || || ||||||||||||||||||||||| |||||||| ||| |||| || ||||||||| Sbjct: 13858 gtcgcgccgctgtttgaaaccctcgatgacctgaacaatgccgacgacgtgatgacccag 13799 Query: 1561 ctgctcaatattgactggtatcgtggcctgattcagggcaaacagatggtgatgattggc 1620 || | ||||| ||||||||||| || ||||||||||| || |||||||| ||||| ||| Sbjct: 13798 ttgttgaatatcgactggtatcgcggactgattcagggtaagcagatggtcatgatcggc 13739 Query: 1621 tattccgactcagcaaaagatgcgggagtgatggcagcttcctgggcgcaatatcaggca 1680 || |||||||| || ||||| || || || ||||| || || |||||||| |||||||| Sbjct: 13738 tactccgactcggcgaaagacgccggcgttatggccgcgtcatgggcgcagtatcaggcg 13679 Query: 1681 caggatgcattaatcaaaacctgcgaaaaagcgggtattgagctgacgttgttccacggt 1740 ||||| || | ||||||||||| |||||||| || || ||||| || | |||||||| Sbjct: 13678 caggacgccctgatcaaaacctgtgaaaaagccggcatcgagcttaccctcttccacggc 13619 Query: 1741 cgcggcggttccattggtcgcggcggcgcacctgctcatgcggcgctgctgtcacaaccg 1800 |||||||| || ||||| || |||||||| || || || ||||||||||| || |||||| Sbjct: 13618 cgcggcggatctattggccgtggcggcgcgccagcccacgcggcgctgctttcgcaaccg 13559 Query: 1801 ccaggaagcctgaaaggcggcctgcgcgtaaccgaacagggcgagatgatccgctttaaa 1860 ||||| || ||||||||||| |||||||| ||||| |||||||||||||||||||| ||| Sbjct: 13558 ccaggcagtctgaaaggcggtctgcgcgtgaccgagcagggcgagatgatccgcttcaaa 13499 Query: 1861 tatggtctgccagaaatcaccgtcagcagcctgtcgctttataccggggcgattctggaa 1920 || || ||||| ||| |||||||||||||||| ||||| || ||| | || ||||||||| Sbjct: 13498 tacggcctgccggaagtcaccgtcagcagcctctcgctctacaccagcgcaattctggaa 13439 Query: 1921 gccaacctgctgccaccgccggagccgaaagagagctggcgtcgcattatggatgaactg 1980 || ||||||||||| |||||||| |||||||| |||||||||| ||||||||||| || Sbjct: 13438 gcaaacctgctgccgccgccggaaccgaaagacagctggcgtcatattatggatgagctt 13379 Query: 1981 tcagtcatctcctgcgatgtctaccgcggctacgtacgtgaaaacaaagattttgtgcct 2040 || ||||||||||| || ||||||||||||||| || ||||| ||||| ||||| || Sbjct: 13378 tccgtcatctcctgtgaaacctaccgcggctacgtgcgcgaaaataaagactttgtaccg 13319 Query: 2041 tacttccgctccgctacgccggaacaagaactgggcaaactgccgttgggttcacgtccg 2100 |||||||||||||| |||||||| ||||| |||||||| ||||| | || ||||||||| Sbjct: 13318 tacttccgctccgcgacgccggagcaagagttgggcaaattgccgctcggctcacgtccg 13259 Query: 2101 gcgaaacgtcgcccaaccggcggcgtcgagtcactacgcgccattccgtggatcttcgcc 2160 ||||||||||| ||||| ||||||||||| || || ||||| |||||||||||||||||| Sbjct: 13258 gcgaaacgtcgtccaactggcggcgtcgaatcgctgcgcgcgattccgtggatcttcgcc 13199 Query: 2161 tggacgcaaaaccgtctgatgctccccgcctggctgggtgcaggtacggcgctgcaaaaa 2220 |||||||||||||| |||||||| || ||||||||||| || ||||| |||||||||||| Sbjct: 13198 tggacgcaaaaccgcctgatgctgccagcctggctgggcgcgggtactgcgctgcaaaaa 13139 Query: 2221 gtggtcgaagacggcaaacagagcgagctggaggctatgtgccgcgattggccattcttc 2280 ||||| |||||||| ||||| ||||| ||||| || ||||||||||| ||||| |||||| Sbjct: 13138 gtggtggaagacggtaaacaaagcgaactggaagccatgtgccgcgactggccgttcttc 13079 Query: 2281 tcgacgcgtctcggcatgctggagatggtcttcgccaaagcagacctgtggctggcggaa 2340 || |||||||| || |||||||| ||||| ||| | ||||| |||||||||||||| || Sbjct: 13078 tccacgcgtcttgggatgctggaaatggtgttctcgaaagccgacctgtggctggccgac 13019 Query: 2341 tactatgaccaacgcctggtagacaaagcactgtggccgttaggtaaagagttacgcaac 2400 || || || || |||||||| | ||| | || |||||| | || |||||| |||| || Sbjct: 13018 tattacgatcagcgcctggtggcgaaaacgctttggccgctgggcaaagagctacgagac 12959 Query: 2401 ctgcaagaagaagacatcaaagtggtgctggcgattgccaacgattcccatctgatggcc 2460 || | ||||||||||| ||||||||||||||||||||||||||||| || ||||||||| Sbjct: 12958 ctactggaagaagacattaaagtggtgctggcgattgccaacgattcgcacctgatggcc 12899 Query: 2461 gatctgccgtggattgcagagtctattcagctacggaatatttacaccgacccgctgaac 2520 || | ||||||||||| ||||| |||||| || | || |||| ||||| || | ||| Sbjct: 12898 gacttaccgtggattgcggagtccattcagttaagaaacgtttataccgatccattaaac 12839 Query: 2521 gtattgcaggccgagttgctg 2541 || ||||||||||| |||||| Sbjct: 12838 gtgttgcaggccgaattgctg 12818 |
/note="similar to E. coli phosphoenolpyruvate carboxylase (AAC76938.1); Blastp hit to AAC76938.1 (883 aa), 93% identity in aa 1 - 883. CAPP_SALTY - очень близкий гомолог для CAPP_ECOLI.
FT gene complement(12707..15368) FT /gene="ppc" FT /note="synonym: STM4119" FT CDS complement(12707..15358) FT /codon_start=1 FT /transl_table=11 FT /gene="ppc" FT /product="phosphoenolpyruvate carboxylase" FT /EC_number="4.1.1.31" FT /note="similar to E. coli phosphoenolpyruvate carboxylase FT (AAC76938.1); Blastp hit to AAC76938.1 (883 aa), 93% FT identity in aa 1 - 883" FT /db_xref="GOA:Q8ZKM0" FT /db_xref="InterPro:IPR001449" FT /db_xref="UniProtKB/Swiss-Prot:Q8ZKM0" FT /protein_id="AAL22958.1" FT /translation="MNEQYSALRSNVSMLGKVLGETIKDALGEHILDRVETIRKLSKSS FT RAGNEANRQELLTTLQNLSNDELLPVARAFSQFLNLANTAEQYHSISPKGEAASNPEVI FT ARTLRKLKNQPDLNDATIKKAVESLSLELVLTAHPTEITRRTLIHKMGEINNCLKQLDN FT TDIADYERHQVMRRLRQLIAQSWHTDEIRKQRPSPVDEAKWGFAVVENSLWQGVPNYLR FT ELNEQLEENLGYKLPVDFVPVRFTSWMGGDRDGNPNVTADITRHVLLLSRWKATDLFLK FT DIHVLVSELSMVDATPELLALVGEEGASEPYRYLMKKLRARLMATQSWLEARLKGEKLP FT KPAGLLTQNEQLWEPLYACYQSLQACGMGIIANGELLDTLRRVKCFGVPLVRIDIRQES FT TRHTEALGEITRYLGIGDYESWSEADKQAFLIRELNSKRPLLPRNWEPSNDTREVLETC FT KVIAEAPKGSIAAYVISMAKTPSDVLAVHLLLKEAGIGFAMPVAPLFETLDDLNNADDV FT MTQLLNIDWYRGLIQGKQMVMIGYSDSAKDAGVMAASWAQYQAQDALIKTCEKAGIELT FT LFHGRGGSIGRGGAPAHAALLSQPPGSLKGGLRVTEQGEMIRFKYGLPEVTVSSLSLYT FT SAILEANLLPPPEPKDSWRHIMDELSVISCETYRGYVRENKDFVPYFRSATPEQELGKL FT PLGSRPAKRRPTGGVESLRAIPWIFAWTQNRLMLPAWLGAGTALQKVVEDGKQSELEAM FT CRDWPFFSTRLGMLEMVFSKADLWLADYYDQRLVAKTLWPLGKELRDLLEEDIKVVLAI FT ANDSHLMADLPWIAESIQLRNVYTDPLNVLQAELLYRSRLTEEQGKSPDPRVEQALMVT FT IAGVAAGMRNTG" |