Difference between revisions of "SCI/Specifications/SCI in action/Parser"

← Older edit

SCI/Specifications/SCI in action/Parser (view source)

Revision as of 15:00, 10 February 2010

22 bytes removed , 15:00, 10 February 2010

→‎Semantics

Fingolfin

1,079

edits

@@ Line 218: / Line 218: @@
 With the notable exception of the first rule, these rules constitute  the set of production rules <math>P</math>. Furthermore we have <math>V := \{x \mid \exists R \in P :  x \in R\}</math>; typically, <math>V = \{ \mathrm{0x12f}, \ldots, \mathrm{0x13f}\}</math> and  <math>s = m_0</math> of the first rule encountered; in all games observed, it was set to 0x13c. <math>\Sigma</math> contains all word groups and class masks. For the sake of simplicity, we will consider rules matching composite class masks to be several rules. Here is a simplified example of what such a grammar might look like (the hexadecimal prefix '0x' is omitted for brevity):
-In addition to this grammar, each right-hand non-terminal <i>m<sub>i</sub></i> carries its semantic value <i>&rho;<sub>i</sub></i> , which is not relevant for constructing a syntax tree, but must be considered for the semantic tree <i>T</i><sub>&pi;</sub>. These values were omitted in the example above. As in the example above, the grammar is a context-free (type 2) grammar, almost in Chomsky Normal Form (CNF) in SCI; constructing a grammar with CNF rules from it would be trivial.<ref>FreeSCI constructs a GNF (Greibach Normal Form) representation from these rules for parsing.</ref>
+In addition to this grammar, each right-hand non-terminal <math>m_i</math> carries its semantic value <math>\rho_i</math> , which is not relevant for constructing a syntax tree, but must be considered for the semantic tree <math>T_\pi</math>. These values were omitted in the example above. As in the example above, the grammar is a context-free (type 2) grammar, almost in Chomsky Normal Form (CNF) in SCI; constructing a grammar with CNF rules from it would be trivial.<ref>FreeSCI constructs a GNF (Greibach Normal Form) representation from these rules for parsing.</ref>
-Obviously, G is an ambiguous grammar. In SCI, rule precedence is implied by rule order, so the resulting left derivation tree is well-defined (in the example, it would be defined by <i>D<sub>0</sub></i>.<ref>In FreeSCI, you can use the ”parse” console command to retreive all possible left derivation trees.</ref>
+Obviously, <math>G</math> is an ambiguous grammar. In SCI, rule precedence is implied by rule order, so the resulting left derivation tree is well-defined (in the Parser example further down, it would be defined by <math>D_0</math>.<ref>In FreeSCI, you can use the ”parse” console command to retrieve all possible left derivation trees.</ref>
 <br>
 ====Parser grammar example====
-<span style="white-space:nowrap">G = &lang; { 12f, &hellip;, 13e }, { C<sub>1</sub>, C<sub>2</sub>, C<sub>4</sub>, &hellip;, C<sub>100</sub> }, P, 13c &rang;</span>
+<math>G = \langle \{ \mathrm{12f}, \ldots, \mathrm{13e} \}, \{ C_1, C_2, C_4, \ldots, C_{100} \}, P, \mathrm{13c} \rangle</math>
 {| <i>
 |----
@@ Line 290: / Line 290: @@
 ===Semantics===
-This is important, since the parser does much more than just accept or discard input. Using the semantic tags applied to each non-terminal on the right-hand side of a rule, it constructs what I will call the semantic parse tree <i>T</i><sub>&pi;</sub> , which attempts to describe what the input means. For each non-terminal rule
+This is important, since the parser does much more than just accept or discard input. Using the semantic tags applied to each non-terminal on the right-hand side of a rule, it constructs what I will call the semantic parse tree <math>T_\pi</math> , which attempts to describe what the input means. For each non-terminal rule
-<i>r = &upsilon;<sub>0</sub> &#8614; &upsilon;<sub>1</sub>&upsilon;<sub>2</sub>&hellip;&upsilon;<sub>n</sub></i>
+<math>r = v_0 \to v_1 v_2 \ldots v_n</math>
-there are semantic tags <i>&sigma;<sub>r,1</sub>,&sigma;<sub>r,2</sub>&hellip;&sigma;<sub>r,n</sub> &isin; S</i> , as explained above. <i>T</i><sub>&pi;</sub> is now constructed from the resulting derivation and the semantic tags assiociated with each non-terminal of the rule used. The construction algorithm is explained below with <i>T</i><sub>&pi;</sub> being constructed from nodes, which have the following structure:
+there are semantic tags <math>\sigma_{r,1}, \sigma_{r,2} \ldots \sigma_{r,n} \in S</math>, as explained above. <math>T_\pi</math> is now constructed from the resulting derivation and the semantic tags assiociated with each non-terminal of the rule used. The construction algorithm is explained below with <math>T_\pi</math> being constructed from nodes, which have the following recursive structure:
-<i>NODE = {&#9671;} &cup; S &#215; V &#215; (NODE &cup; &Gamma;)*;</i>
+<math>
+\newcommand{\Node}{\textrm{NODE}}
+\Node = \{\diamond\} \cup S \times V \times (\Node \cup \Gamma)*;</math>
-Where S is the set of possible semantic values, and V is the set of non-terminals as defined in the grammar. We will also use the sequence <i>&gamma;<sub>0</sub>,&gamma<sub>1</sub>,&gamma<sub>2</sub> &hellip; &gamma<sub>k-1</sub></i> which will represent the word groups the input tokens belonged to (in the exact order they were accepted), and the sequence <i>r<sub>0</sub>,r<sub>1</sub>,r<sub>2</sub> &hellip; r<sub>l-1</sub></i> , which will be the list of rules used to create the left derivation tree as described in the previous section.
+Where <math>S</math> is the set of possible semantic values, and <math>V</math> is the set of non-terminals as defined in the grammar. We will also use the sequence <math>\gamma_{0}, \gamma_{1}, \gamma_{2} \ldots \gamma_{k-1}</math> which will represent the word groups the input tokens belonged to (in the exact order they were accepted), and the sequence <math>r_{0}, r_{1}, r_{2} \ldots r_{l-1}</math>, which will be the list of rules used to create the left derivation tree as described in the previous section.
 <pre>
@@ Line 332: / Line 334: @@
 ====Example: Parser example====
 Parse is called with <i>"open door"</i>.
-* <i>"open" &isin; &lang;842,{C<sub>80</sub>}&rang;</i> (an imperative word of the word group 0x842)
+* <math>\textrm{"open"} \in \langle 842, \{ C_{80} \} \rangle</math> (an imperative word of the word group 0x842)
-* <i>"door" &isin; &lang;917,{C<sub>10</sub>}&rang; (a substantive of the word group 0x917)</i>
+* <math>\textrm{"door"} \in \langle 917, \{ C_{10} \} \rangle</math> (a substantive of the word group 0x917)
-* <i>I = &lang;842,{C<sub>80</sub>}&rang;,&lang;917,{C<sub>10</sub>}&rang;</i>
+* <math>I = \langle 842, \{C_{80} \} \rangle, \langle 917, \{C_{10} \} \rangle</math>
-<i>I</i> is clearly accepted by automatons based on the grammar described above, There are two possible derivations:
+<math>I</math> is clearly accepted by automatons based on the grammar described above, There are two possible derivations:
 {|
-|D<sub>0</sub> = 13c
+|colspan="2" align="center"| D<sub>0</sub> = 13c
+|width="50em"|&nbsp;
+|colspan="2" align="center"| D<sub>1</sub> = 13c
 |----
-|(13c &#8614; 13b134)
+|(13c &#8614; 13b 134)
 |&#8866; 13b 134
+|
+|(13c &#8614; 13b)
+|&#8866; 13b
 |----
 |(13b &#8614; 131)
+|&#8866; 131 134
+|
+|(13b &#8614; 131 134)
 |&#8866; 131 134
 |----
+|(131 &#8614; C<sub>80</sub>
+|&#8866; C<sub>80</sub> 134
+|
 |(131 &#8614; C<sub>80</sub>
 |&#8866; C<sub>80</sub> 134
@@ Line 352: / Line 365: @@
 |(134 &#8614; C<sub>10</sub>
 |&#8866; C<sub>80</sub> C<sub>10</sub>
-|----
+|
-|}
-<br>
-{|
-|D<sub>1</sub> = 13c
-|----
-|(13c &#8614; 13b)
-|&#8866; 13b
-|----
-|(13b &#8614; 131134)
-|&#8866; 131 134
-|----
-|(131 &#8614; C<sub>80</sub>
-|&#8866; C<sub>80</sub> 134
-|----
 |(134 &#8614; C<sub>10</sub>
 |&#8866; C<sub>80</sub> C<sub>10</sub>
 |----
 |}
 ====Example: Semantic tree example====

Difference between revisions of "SCI/Specifications/SCI in action/Parser"

SCI/Specifications/SCI in action/Parser (view source)

Revision as of 15:00, 10 February 2010

Navigation menu

Search