Note that the scanner requires an aditional level of escaping because
'/' indicates the end of the regular expression. So
must be escaped by a backslash:
'\/'. If you need a
backslash in you regexp, it must me escaped to:
'\\ '. Since
formfeed and other control characters are often needed
' for x from
'Z' is mapped to
' (ctrl x, ). This means
subtracted from the original character.
\B = ^B, ...
\n (newline). This is somewhat ad-hoc, but was easy to implement and
allows users of limited editors to enter control characters in the format
Here is a first small example of a structured document collection to be indexed.