[12] | 1 | <html> |
---|
| 2 | |
---|
| 3 | <head> |
---|
| 4 | <meta http-equiv="Content-Type" |
---|
| 5 | content="text/html; charset=iso-8859-1"> |
---|
| 6 | <meta name="GENERATOR" content="Microsoft FrontPage Express 2.0"> |
---|
| 7 | <title>TokenizerFunction Concept</title> |
---|
| 8 | </head> |
---|
| 9 | |
---|
| 10 | <body bgcolor="#FFFFFF" text="#000000" link="#0000EE" |
---|
| 11 | vlink="#551A8B" alink="#FF0000"> |
---|
| 12 | |
---|
| 13 | <p><img src="../../boost.png" alt="C++ Boost" width="277" |
---|
| 14 | height="86"> </p> |
---|
| 15 | |
---|
| 16 | <h1 align="center">TokenizerFunction Concept</h1> |
---|
| 17 | |
---|
| 18 | <p>A TokenizerFunction is a functor whose purpose is to parse a |
---|
| 19 | given sequence until exactly 1 token has been found or the end is |
---|
| 20 | reached. It then updates the token, and informs the caller of the |
---|
| 21 | location in the sequence of the next element immediately after |
---|
| 22 | the last element of the sequence that was parsed for the current |
---|
| 23 | token. </p> |
---|
| 24 | |
---|
| 25 | <h2>Refinement of</h2> |
---|
| 26 | |
---|
| 27 | <p>Assignable, CopyConstructable</p> |
---|
| 28 | |
---|
| 29 | <h2>Notation</h2> |
---|
| 30 | |
---|
| 31 | <table border="1"> |
---|
| 32 | <tr> |
---|
| 33 | <td valign="top"><tt>X</tt> </td> |
---|
| 34 | <td valign="top">A type that is a model of |
---|
| 35 | TokenizerFunction</td> |
---|
| 36 | </tr> |
---|
| 37 | <tr> |
---|
| 38 | <td valign="top"><tt>func</tt> </td> |
---|
| 39 | <td valign="top">Object of type <tt>X</tt> </td> |
---|
| 40 | </tr> |
---|
| 41 | <tr> |
---|
| 42 | <td valign="top"><tt>tok</tt></td> |
---|
| 43 | <td valign="top">Object of Token</td> |
---|
| 44 | </tr> |
---|
| 45 | <tr> |
---|
| 46 | <td>next</td> |
---|
| 47 | <td>iterator that points to the first unparsed element of |
---|
| 48 | the sequence being parsed</td> |
---|
| 49 | </tr> |
---|
| 50 | <tr> |
---|
| 51 | <td>end</td> |
---|
| 52 | <td>iterator that points to the past the end of the |
---|
| 53 | sequence being parsed</td> |
---|
| 54 | </tr> |
---|
| 55 | </table> |
---|
| 56 | |
---|
| 57 | <h2>Definitions</h2> |
---|
| 58 | |
---|
| 59 | <p>A token is the result of parsing a sequence.</p> |
---|
| 60 | |
---|
| 61 | <h2>Valid expressions</h2> |
---|
| 62 | |
---|
| 63 | <p>In addition to the expression in Assignable and |
---|
| 64 | CopyConstructable the following expressions are valid</p> |
---|
| 65 | |
---|
| 66 | <table border="1"> |
---|
| 67 | <tr> |
---|
| 68 | <th>Name </th> |
---|
| 69 | <th>Expression </th> |
---|
| 70 | <th>Return type </th> |
---|
| 71 | </tr> |
---|
| 72 | <tr> |
---|
| 73 | <td valign="top">Functor</td> |
---|
| 74 | <td valign="top"><tt>func(next, end, tok)</tt></td> |
---|
| 75 | <td valign="top"><tt>bool</tt></td> |
---|
| 76 | </tr> |
---|
| 77 | <tr> |
---|
| 78 | <td valign="top">reset</td> |
---|
| 79 | <td valign="top"><tt>reset()</tt></td> |
---|
| 80 | <td valign="top"><tt>void</tt></td> |
---|
| 81 | </tr> |
---|
| 82 | </table> |
---|
| 83 | |
---|
| 84 | <h2>Expression semantics</h2> |
---|
| 85 | |
---|
| 86 | <p>In addition to the expression semantics in Assignable and |
---|
| 87 | CopyConstructable, TokenizerFunction has the following expression |
---|
| 88 | semantcs</p> |
---|
| 89 | |
---|
| 90 | <table border="1"> |
---|
| 91 | <tr> |
---|
| 92 | <th>Name </th> |
---|
| 93 | <th>Expression </th> |
---|
| 94 | <th>Precondition </th> |
---|
| 95 | <th>Semantics </th> |
---|
| 96 | <th>Postcondition </th> |
---|
| 97 | </tr> |
---|
| 98 | <tr> |
---|
| 99 | <td>operator()</td> |
---|
| 100 | <td><tt>func(next, end, tok)</tt></td> |
---|
| 101 | <td><tt>next</tt> and <tt>end</tt> are valid iterators to |
---|
| 102 | the same sequence. next is a reference the function is |
---|
| 103 | free to modify. tok is constructed. </td> |
---|
| 104 | <td>The return value indicates whether a new token was |
---|
| 105 | found in the sequence (next,end]</td> |
---|
| 106 | <td>If the return value is true, the new token is |
---|
| 107 | assigned to tok. next is always updated to the position |
---|
| 108 | where parsing should start on the subsequent call.</td> |
---|
| 109 | </tr> |
---|
| 110 | <tr> |
---|
| 111 | <td>reset</td> |
---|
| 112 | <td><tt>reset()</tt></td> |
---|
| 113 | <td><tt>None</tt></td> |
---|
| 114 | <td>Clears out all state variables that are used by the |
---|
| 115 | object in parsing the current sequence.</td> |
---|
| 116 | <td>A new sequence to parse can be given.</td> |
---|
| 117 | </tr> |
---|
| 118 | </table> |
---|
| 119 | |
---|
| 120 | <h2>Complexity guarantees</h2> |
---|
| 121 | |
---|
| 122 | <p>No guarantees. Models of TokenizerFunction are free to define |
---|
| 123 | their own complexity</p> |
---|
| 124 | |
---|
| 125 | <h2>Models</h2> |
---|
| 126 | |
---|
| 127 | <p><a href="escaped_list_separator.htm">escaped_list_separator</a></p> |
---|
| 128 | |
---|
| 129 | <p><a href="offset_separator.htm">offset_separator</a></p> |
---|
| 130 | |
---|
| 131 | <p><a href="char_delimiters_separator.htm">char_delimiters_separator</a></p> |
---|
| 132 | |
---|
| 133 | <p> </p> |
---|
| 134 | |
---|
| 135 | <p> </p> |
---|
| 136 | |
---|
| 137 | <hr> |
---|
| 138 | |
---|
| 139 | <p>© Copyright John R. Bandela 2001. Permission to copy, use, |
---|
| 140 | modify, sell and distribute this document is granted provided |
---|
| 141 | this copyright notice appears in all copies. This document is |
---|
| 142 | provided "as is" without express or implied warranty, |
---|
| 143 | and with no claim as to its suitability for any purpose.</p> |
---|
| 144 | </body> |
---|
| 145 | </html> |
---|