Versions Compared

Key

  • This line was added.
  • This line was removed.
  • Formatting was changed.
Comment: Published by Scroll Versions from space WBRIDGE and version 7.6.0
CharacterDescription
Notes
\
aMatch a BELL, \
u0007 
u0007.
 
\
AMatch at the beginning of the input. Differs from ^ in that \A will not match after a new line within the input.
  
\b, outside of a [Set]
 
Match if the current position is a word boundary. Boundaries occur at the transitions
betweem
between word (\w) and non-word (\W) characters, with combining marks ignored. For better word boundaries, see ICU Boundary Analysis.
 
 
\b, within a [Set]
 
Match a BACKSPACE, \u0008.
  
\
BMatch if the current position is not a word boundary.
  
\
cX 
cXMatch a control-X character.
 
 
\
dMatch any character with the Unicode General Category of Nd (Number, Decimal Digit.)
 
.
 
\
DMatch any character that is not a decimal digit.
  
\
eMatch an ESCAPE, \u001B.
  
\
ETerminates a \Q ... \E quoted sequence.
  
\
fMatch a FORM FEED, \u000C.
 
 
\
GMatch if the current position is at the end of the previous match.
  
\
nMatch a LINE FEED, \u000A.
 
 
\N{UNICODE CHARACTER NAME}
 
Match the named character.
 post 2.4 
\p{UNICODE PROPERTY NAME}
 
Match any character with the specified Unicode Property.
  
\P{UNICODE PROPERTY NAME}
 
Match any character not having the specified Unicode Property.
 
 
\
QQuotes all following characters until \E.
 
 
\
rMatch a CARRIAGE RETURN, \u000D.
  
\
sMatch a white space character. White space is defined as [\t\n\f\r\p{Z}].
  
\
SMatch a non-white space character.
  
\
tMatch a HORIZONTAL TABULATION, \u0009.
 
 
\
uhhhh 
uhhhhMatch the character with the hex value hhhh.
  
\
Uhhhhhhhh 
UhhhhhhhhMatch the character with the hex value hhhhhhhh. Exactly eight hex digits must be provided, even though the largest Unicode code point is \U0010ffff.
 
 
\
wMatch a word character. Word characters are [\p{Ll}\p{Lu}\p{Lt}\p{Lo}\p{Nd}].
  
\
WMatch a non-word character.
 
 
\x{hhhh}
 post 2
Match the character with hex value
hhhh 
hhhh.
\
xhh post 2
xhhMatch the character with two digit hex value
hh 
hh.
\
XMatch a Grapheme Cluster. Partial implementation for ICU 2.4, does not handle Hangul syllables.
  
\
ZMatch if the current position is at the end of input, but before the final line terminator, if one exists.
  
\
zMatch if the current position is at the end of input.
 
 
\
0nnn post 2
0nnnMatch the character with octal value
nnn 
nnn.
\
nBack Reference. Match whatever the nth capturing group matched. n must be > 1 and < total number of capture groups in the
pattern post 2
pattern.
[pattern]
 
Match any one character from the set. See UnicodeSet for a full description of what may appear in the
pattern  
pattern.
.
 
Match any character.
 
 
^
Match at the beginning of a line.
 
 
$
Match at the end of a line.
  
\
 
Quotes the following character. Characters that must be quoted to be treated as literals are * ? + [ ( ) { } ^ $
\ . /