MAST - Motif Alignment and Search Tool
MAST version 4.1.1 (Release date: Tue Apr 21 15:00:06 PDT 2009)
For further information on how to interpret these results or to get
a copy of the MAST software please access http://meme.nbcr.net.
REFERENCE
If you use this program in your research, please cite:
Timothy L. Bailey and Michael Gribskov,
"Combining evidence using p-values: application to sequence homology
searches", Bioinformatics, 14(48-54), 1998.
DATABASE AND MOTIFS
DATABASE ./seed.fasta (peptide)
Last updated on Sat May 29 19:03:23 2010
Database contains 12 sequences, 4063 residues
MOTIFS ./memeout.txt (peptide)
MOTIF WIDTH BEST POSSIBLE MATCH
----- ----- -------------------
1 40 AHGYLIHQFLSPRTNHRTDEYGGSFENRMRFCYEVIRAIR
2 48 MANKIIQNGKADMVFMGRPHLADPHWPRKAQEGRNDEIRHCIGCNQCC
3 32 WDDHHMPWFRKLTEAVHEYGSKIMIQLWHTGR
PAIRWISE MOTIF CORRELATIONS:
MOTIF 1 2
----- ----- -----
2 0.11
3 0.09 0.16
No overly similar pairs (correlation > 0.60) found.
Random model letter frequencies (from non-redundant database):
A 0.073 C 0.018 D 0.052 E 0.062 F 0.040 G 0.069 H 0.022 I 0.056 K 0.058
L 0.092 M 0.023 N 0.046 P 0.051 Q 0.041 R 0.052 S 0.074 T 0.059 V 0.064
W 0.013 Y 0.033
SECTION I: HIGH-SCORING SEQUENCES
- Each of the following 12 sequences has E-value less than 10.
- The E-value of a sequence is the expected number of sequences
in a random database of the same size that would match the motifs as
well as the sequence does and is equal to the combined p-value of the
sequence times the number of sequences in the database.
- The combined p-value of a sequence measures the strength of the
match of the sequence to all the motifs and is calculated by
- finding the score of the single best match of each motif
to the sequence (best matches may overlap),
- calculating the sequence p-value of each score,
- forming the product of the p-values,
- taking the p-value of the product.
- The sequence p-value of a score is defined as the
probability of a random sequence of the same length containing
some match with as good or better a score.
- The score for the match of a position in a sequence to a motif
is computed by by summing the appropriate entry from each column of
the position-dependent scoring matrix that represents the motif.
- Sequences shorter than one or more of the motifs are skipped.
- The table is sorted by increasing E-value.
Links | Sequence Name | Description | E-value | Length
|
---|
| BAICD_EUBSP/5-328
|
| 2e-66
| 324
|
| NAMA_BACSU/4-327
|
| 2.5e-62
| 324
|
| NADO_THEBR/6-340
|
| 1.8e-61
| 335
|
| FADH_ECOLI/5-331
|
| 4e-59
| 327
|
| YQIG_BACSU/7-338
|
| 3.9e-25
| 332
|
| KYE1_KLULA/15-367
|
| 2.6e-24
| 353
|
| DHTM_METME/9-342
|
| 8.7e-23
| 334
|
| BAIH_EUBSP/7-340
|
| 5.4e-21
| 334
|
| OYE1_SACPS/16-368
|
| 3.9e-20
| 353
|
| EBP1_CANAL/27-376
|
| 3.5e-11
| 350
|
| OYEA_SCHPO/6-353
|
| 1e-07
| 348
|
| OYEB_SCHPO/12-360
|
| 2.8e-06
| 349
|
SECTION II: MOTIF DIAGRAMS
- The ordering and spacing of all non-overlapping motif occurrences
are shown for each high-scoring sequence listed in Section I.
- A motif occurrence is defined as a position in the sequence whose
match to the motif has POSITION p-value less than 0.0001.
- The POSITION p-value of a match is the probability of
a single random subsequence of the length of the motif
scoring at least as well as the observed match.
- For each sequence, all motif occurrences are shown unless there
are overlaps. In that case, a motif occurrence is shown only if its
p-value is less than the product of the p-values of the other
(lower-numbered) motif occurrences that it overlaps.
- The table also shows the E-value of each sequence.
- Spacers and motif occurences are indicated by
- Spacers and motif occurences are indicated by
- Spacers and motif occurences are indicated by
- occurrence of motif `n' with p-value less than 0.0001.
- Sequences longer than 1000 are not shown to scale and are indicated by thicker lines.
Links | Name | Expect | Motifs
|
---|
| BAICD_EUBSP/5-328
| 2e-66
|
|
| NAMA_BACSU/4-327
| 2.5e-62
|
|
| NADO_THEBR/6-340
| 1.8e-61
|
|
| FADH_ECOLI/5-331
| 4e-59
|
|
| YQIG_BACSU/7-338
| 3.9e-25
|
|
| KYE1_KLULA/15-367
| 2.6e-24
|
|
| DHTM_METME/9-342
| 8.7e-23
|
|
| BAIH_EUBSP/7-340
| 5.4e-21
|
|
| OYE1_SACPS/16-368
| 3.9e-20
|
|
| EBP1_CANAL/27-376
| 3.5e-11
|
|
| OYEA_SCHPO/6-353
| 1e-07
|
|
| OYEB_SCHPO/12-360
| 2.8e-06
|
|
SCALE
|
| |
| |
| |
| |
| |
| |
| |
| |
| |
| |
| |
| |
| |
| |
1 |
25 |
50 |
75 |
100 |
125 |
150 |
175 |
200 |
225 |
250 |
275 |
300 |
325 |
|
---|
SECTION III: ANNOTATED SEQUENCES
- The positions and p-values of the non-overlapping motif occurrences
are shown above the actual sequence for each of the high-scoring
sequences from Section I.
- A motif occurrence is defined as a position in the sequence whose
match to the motif has POSITION p-value less than 0.0001 as
defined in Section II.
- For each sequence, the first line specifies the name of the sequence.
- The second (and possibly more) lines give a description of the
sequence.
- Following the description line(s) is a line giving the length,
combined p-value, and E-value of the sequence as defined in Section I.
- The next line reproduces the motif diagram from Section II.
- The entire sequence is printed on the following lines.
- Motif occurrences are indicated directly above their positions in the
sequence on lines showing
- the motif number of the occurrence,
- the position p-value of the occurrence,
- the best possible match to the motif, and
- columns whose match to the motif has a positive score (indicated
by a plus sign).
BAICD_EUBSP/5-328
LENGTH = 324 COMBINED P-VALUE = 1.64e-67 E-VALUE = 2e-66
DIAGRAM: 71-[3]-49-[1]-132
[3]
7.6e
WDDH
++++
1 ALFSPFKVRGLELKNRIVLPGMNTKMAKNKHDIGEDMIAYHVARAKAGCALNIFECVALCPAPHAYMYMGLYTDH
-32
HMPWFRKLTEAVHEYGSKIMIQLWHTGR
++++++++++++++++++++++++++++
76 HVEQLKKLTDAVHEAGGKMGIQLWHGGFSPQMFFDETNTLETPDTLTVERIHEIVEEFGRGARMAVQAGFDAVEF
[1]
1.6e-44
AHGYLIHQFLSPRTNHRTDEYGGSFENRMRFCYEVIRAIR
++++++++++++++++++++++++++++++++++++++++
151 HAAHSYLPHEFLSPGMNKRTDEYGGSFENRCRFCYEVVQAIRSNIPDDMPFFMRADCIDELMEQTMTEEEIVTFI
NAMA_BACSU/4-327
LENGTH = 324 COMBINED P-VALUE = 2.08e-63 E-VALUE = 2.5e-62
DIAGRAM: 73-[3]-57-[1]-122
[3
2.
WD
++
1 KLFTPITIKDMTLKNRIVMSPMCMYSSHEKDGKLTPFHMAHYISRAIGQVGLIIVEASAVNPQGRITDQDLGIWS
]
0e-31
DHHMPWFRKLTEAVHEYGSKIMIQLWHTGR
++++++++++++++++++++++++++++++
76 DEHIEGFAKLTEQVKEQGSKIGIQLAHAGRKAELEGDIFAPSAIAFDEQSATPVEMSAEKVKETVQEFKQAAARA
[1]
1.2e-41
AHGYLIHQFLSPRTNHRTDEYGGSFENRMRFCYEVIRAIR
++++++++++++++++++++++++++++++++++++++++
151 KEAGFDVIEIHAAHGYLIHEFLSPLSNHRTDEYGGSPENRYRFLREIIDEVKQVWDGPLFVRVSASDYTDKGLDI
NADO_THEBR/6-340
LENGTH = 335 COMBINED P-VALUE = 1.50e-62 E-VALUE = 1.8e-61
DIAGRAM: 73-[3]-61-[1]-129
[3
1.
WD
++
1 NLFSEGRIGNLVIRNRIVMPPMATNLANEDGSVSQRLIDYYVARARGGVGLIILENVQVDYPQGKNVACQLRLDD
]
1e-30
DHHMPWFRKLTEAVHEYGSKIMIQLWHTGR
++++++++++++++++++++++++++++++
76 DKYMAGFFELAEAVHSYGAKIFMQIHHAGRQTTPGITEGLQPVAPSPVPCSFLGTQPRELTINEIEEIIQKFVDA
[1]
8.4e-41
AHGYLIHQFLSPRTNHRTDEYGGSFENRMRFCYEVIRAIR
++++++++++++++++++++++++++++++++++++++++
151 AVRAKGAMFDGIELHGAHGYLIGQFMSPRTNRRVDKYGGSFERRMRFPLEIIRRIKEAVGEDYPISFRFSADEFV
FADH_ECOLI/5-331
LENGTH = 327 COMBINED P-VALUE = 3.37e-60 E-VALUE = 4e-59
DIAGRAM: 71-[3]-56-[1]-128
[3]
8.8e
WDDH
++++
1 SLFAPLDLGFTTLKNRVLMGSMHTGLEEYPDGAERLAAFYAERARHGVALIVSGGIAPDLTGVGMEGGAMLNDAS
-30
HMPWFRKLTEAVHEYGSKIMIQLWHTGR
++++++++++++++++++++++++++++
76 QIPHHRTITEAVHQEGGKIALQILHTGRYSYQPHLVAPSALQAPINRFVPHELSHEEILQLIDNFARCAQLAREA
[1]
6.6e-39
AHGYLIHQFLSPRTNHRTDEYGGSFENRMRFCYEVIRAIR
++++++++++++++++++++++++++++++++++++++++
151 GYDGVEVMGSEGYLINEFLTLRTNQRSDQWGGDYRNRMRFAVEVVRAVRERVGNDFIIIYRLSMLDLVEDGGTFA
YQIG_BACSU/7-338
LENGTH = 332 COMBINED P-VALUE = 3.24e-26 E-VALUE = 3.9e-25
DIAGRAM: 72-[3]-60-[1]-128
[3]
2.7
WDD
++
1 PLFEPFTFKSGVTINNRIAVAPMTHYASNEDGTISEAELDYIIPRSKEMGMVITACANVTPDGKAFPGQPAIHDD
e-13
HHMPWFRKLTEAVHEYGSKIMIQLWHTGR
+ ++++++++ + ++++ ++++++
76 SNIPGLKKLAQAIQAQGAKAVVQIHHGGIECPSELVPQQDVVGPSDVFDNGKQIARALTEEEVENIVKAFGEATR
[1]
9.5e-22
AHGYLIHQFLSPRTNHRTDEYGGSFENRMRFCYEVIRAIR
+ ++++ ++ ++ ++++++ ++++ + + +++ ++++++
151 RAIEAGFDGVEIHGANGYLIQQFYSPKTNQRTDRWGGSDEKRLAFPLAIVDEVKKAASEHAKGAFLVGYRLSPEE
KYE1_KLULA/15-367
LENGTH = 353 COMBINED P-VALUE = 2.14e-25 E-VALUE = 2.6e-24
DIAGRAM: 74-[3]-72-[1]-135
[
3
W
+
1 DIFKPIKIGNTELKHRVVMPALTRMRALHPGNVPNPDWAVEYYRQRSQYPGTMIITEGAFPSAQSGGYDNAPGVW
3]
.9e-09
DDHHMPWFRKLTEAVHEYGSKIMIQLWHTGR
+ ++ ++ +++ + + + +++ +++
76 SEEQLAQWRKIFKAIHDNKSFVWVQLWVLGRQAFADNLARDGLRYDSASDEVYMGEDEKERAIRSNNPQHGITKD
[1]
6.2e-25
AHGYLIHQFLSPRTNHRTDEYGGSFENRMRFCYEVIRAIR
+ +++ ++++ + +++++++++++ +++ ++ +++++++
151 EIKQYIRDYVDAAKKCIDAGADGVEIHSANGYLLNQFLDPISNKRTDEYGGSIENRARFVLEVVDAVVDAVGAER
DHTM_METME/9-342
LENGTH = 334 COMBINED P-VALUE = 7.28e-24 E-VALUE = 8.7e-23
DIAGRAM: 70-[3]-61-[1]-131
[3]
3.5e-
WDDHH
++
1 ILFEPIQIGPKTLRNRFYQVPHCIGAGSDKPGFQSAHRSVKAEGGWAALNTEYCSINPESDDTHRLSARIWDEGD
09
MPWFRKLTEAVHEYGSKIMIQLWHTGR
+ ++ +++ ++ +++ + ++ ++
76 VRNLKAMTDEVHKYGALAGVELWYGGAHAPNMESRATPRGPSQYASEFETLSYCKEMDLSDIAQVQQFYVDAAKR
[1]
2.3e-23
AHGYLIHQFLSPRTNHRTDEYGGSFENRMRFCYEVIRAIR
++++++ +++ + ++++++++++++++ ++ ++ + ++
151 SRDAGFDIVYVYGAHSYLPLQFLNPYYNKRTDKYGGSLENRARFWLETLEKVKHAVGSDCAIATRFGVDTVYGPG
BAIH_EUBSP/7-340
LENGTH = 334 COMBINED P-VALUE = 4.54e-22 E-VALUE = 5.4e-21
DIAGRAM: 72-[3]-60-[1]-130
[3]
3.8
WDD
+++
1 RLFSPLQIGSLTLSNRVGMAPMSMDYEAADGTVPKRLADVFVRRAEGGTGYVMIDAVTIDSKYPYMGNTTALDRD
e-13
HHMPWFRKLTEAVHEYGSKIMIQLWHTGR
+ ++++++ ++ ++++++ ++ + +
76 ELVPQFKEFADRVKEAGSTLVPQIIHPGPESVCGYRHIAPLGPSANTNANCHVSRSISIDEIHDIIKQFGQAARR
[1]
1.5e-17
AHGYLIHQFLSPRTNHRTDEYGGSFENRMRFCYEVIRAIR
+++ +++++ +++ ++++++++++ ++ + + + +
151 AEEAGCGAISLHCAHAYMLPGSFLSPLRNKRMDEYGGSLDNRARFVIEMIEEARRNVSPDFPIFLRISGDERMVG
OYE1_SACPS/16-368
LENGTH = 353 COMBINED P-VALUE = 3.22e-21 E-VALUE = 3.9e-20
DIAGRAM: 74-[3]-72-[1]-135
[
2
W
+
1 NLFKPIKIGNNELLHRAVIPPLTRMRALHPGNIPNRDWAVEYYTQRAQRPGTMIITEGAFISPQAGGYDNAPGVW
3]
.3e-07
DDHHMPWFRKLTEAVHEYGSKIMIQLWHTGR
+ +++ ++ + ++ + +++ ++
76 SEEQMVEWTKIFNAIHEKKSFVWVQLWVLGWAAFPDNLARDGLRYDSASDNVFMDAEQEAKAKKANNPQHSLTKD
[1]
3.5e-22
AHGYLIHQFLSPRTNHRTDEYGGSFENRMRFCYEVIRAIR
+ +++ ++++ + ++ ++++++++ +++ ++ ++++++
151 EIKQYIKEYVQAAKNSIAAGADGVEIHSANGYLLNQFLDPHSNTRTDEYGGSIENRARFTLEVVDALVEAIGHEK
EBP1_CANAL/27-376
LENGTH = 350 COMBINED P-VALUE = 2.92e-12 E-VALUE = 3.5e-11
DIAGRAM: 72-[3]-72-[1]-134
[3]
2.0
WDD
+ +
1 KLFQPIKVGNNVLPQRIAYVPTTRFRASKDHIPSDLQLNYYNARSQYPGTLIITEATFASERGGIDLHVPGIYND
e-08
HHMPWFRKLTEAVHEYGSKIMIQLWHTGR
+ +++ ++ + ++ +++ +++
76 AQAKSWKKINEAIHGNGSFSSVQLWYLGRVANAKDLKDSGLPLIAPSAVYWDENSEKLAKEAGNELRALTEEEID
[1]
1.9e-11
AHGYLIHQFLSPRTNHRTDEYGGSFENRMRFCYEVIRAIR
+++++ ++++ + +++++++++ ++ +
151 HIVEVEYPNAAKHALEAGFDYVEIHGAHGYLLDQFLNLASNKRTDKYGCGSIENRARLLLRVVDKLIEVVGANRL
OYEA_SCHPO/6-353
LENGTH = 348 COMBINED P-VALUE = 8.49e-09 E-VALUE = 1e-07
DIAGRAM: 72-[3]-82-[1]-122
[3]
1.8
WDD
+
1 ELFKPIKVGNMLLQHRIVHAPMTRLRATDYGKITGLMVEYYSQRSMIPGTLLIADATFVGEKSGGFPNNPRCFTK
e-06
HHMPWFRKLTEAVHEYGSKIMIQLWHTGR
++ + + ++++ + +++ + +
76 EQAESWIPLVEAVHKNKSFLFIQFWPLPGDLKDEYRNDLEKMQKITYSDCPQDPGGLPAGIHSFDAVQGVEVYKK
[1]
1.8e-10
AHGYLIHQFLSPRTNHRTDEYGGSFENRMRFCYEVIRAI
+ + + + ++ ++ +++++++++++ ++
151 KYMSKRDIQEHIQDFVNAADLAVNIAKADGVEIHQVNGFLLDRFVLGGFGDQCDPEYRGSIENRCRFPLEVLEAV
R
226 TRKIGQERVGYRISPFSGWMQKIDFMEVNIYLMSEIAKRFPKLAYIHAIEPRKYWSGHKLVSSEQNTSFLQKYWK
OYEB_SCHPO/12-360
LENGTH = 349 COMBINED P-VALUE = 2.37e-07 E-VALUE = 2.8e-06
DIAGRAM: 73-[3]-82-[1]-122
[3
1.
WD
+
1 KLFQPIKVGNMQLQHRMVHAPATRLRCLDNGLVMTDLVKEYYKQRSSIPGTLLITESLFSGAKSGGFSNIPCLYN
]
7e-06
DHHMPWFRKLTEAVHEYGSKIMIQLWHTGR
+++++ + + + + +++ + +
76 DEHVEAWKPIVQAIHDNDCFVFIQFWNLPGELKVDYLEDQERLEKVTQGECPMDPTGLPAALGSAYSICGEELYV
[1]
1.1e-08
AHGYLIHQFLSPRTNHRTDEYGGSFENRMRFCYEVIRA
+ + + + ++ ++++++ +++++ +
151 DKFMTKQDIQEHITTYTEAAKRAVFGCGADGVEVHQVNGFLLDKFVLNGYGDKCDPEYCGSLENRARFCLELLES
IR
+
226 VVNAIGQERVGYRISPFSDIWKDKDSFEAHVFMIKKIKERFPNLAYLHAIEPRQYWNGHVHITQEKNTLIYKNLW
Debugging Information
CPU: kodomo-count
Time 0.088005 secs.
mast ./memeout.txt -d ./seed.fasta -ev 10.000000 -mt 0.000100
Button Help
Links to Entrez database at NCBI
Links to sequence scores (section I)
Links to motif diagrams (section II)
Links to sequence/motif annotated alignments (section III)
This information