Expression |
Syntax |
Description |
Uppercase letter
|
:Lu
|
Matches any one capital letter. For example, :Luhe matches "The" but not "the".
|
Lowercase letter
|
:Ll
|
Matches any one lower case letter. For example, :Llhe matches "the" but not "The".
|
Title case letter
|
:Lt
|
Matches characters that combine an uppercase letter with a lowercase letter, such as Nj and Dz.
|
Modifier letter
|
:Lm
|
Matches letters or punctuation, such as commas, cross accents, and double prime, used to indicate modifications to the preceding letter.
|
Other letter
|
:Lo
|
Matches other letters, such as gothic letter ahsa.
|
Decimal digit
|
:Nd
|
Matches decimal digits such as 0-9 and their full-width equivalents.
|
Letter digit
|
:Nl
|
Matches letter digits such as roman numerals and ideographic number zero.
|
Other digit
|
:No
|
Matches other digits such as old italic number one.
|
Open punctuation
|
:Ps
|
Matches opening punctuation such as open brackets and braces.
|
Close punctuation
|
:Pe
|
Matches closing punctuation such as closing brackets and braces.
|
Initial quote punctuation
|
:Pi
|
Matches initial double quotation marks.
|
Final quote punctuation
|
:Pf
|
Matches single quotation marks and ending double quotation marks.
|
Dash punctuation
|
:Pd
|
Matches the dash mark.
|
Connector punctuation
|
:Pc
|
Matches the underscore or underline mark.
|
Other punctuation
|
:Po
|
Matches (,), ?, ", !, @, #, %, &, *, \, (:), (;), ', and /.
|
Space separator
|
:Zs
|
Matches blanks.
|
Line separator
|
:Zl
|
Matches the Unicode character U+2028.
|
Paragraph separator
|
:Zp
|
Matches the Unicode character U+2029.
|
Non-spacing mark
|
:Mn
|
Matches non-spacing marks.
|
Combining mark
|
:Mc
|
Matches combining marks.
|
Enclosing mark
|
:Me
|
Matches enclosing marks.
|
Math symbol
|
:Sm
|
Matches +, =, ~, |, <, and >.
|
Currency symbol
|
:Sc
|
Matches $ and other currency symbols.
|
Modifier symbol
|
:Sk
|
Matches modifier symbols such as circumflex accent, grave accent, and macron.
|
Other symbol
|
:So
|
Matches other symbols, such as the copyright sign, pilcrow sign, and the degree sign.
|
Other control
|
:Cc
|
Matches Unicode control characters such as TAB and NEWLINE.
|
Other format
|
:Cf
|
Formatting control character such as the bi-directional control characters.
|
Surrogate
|
:Cs
|
Matches one half of a surrogate pair.
|
Other private-use
|
:Co
|
Matches any character from the private-use area.
|
Other not assigned
|
:Cn
|
Matches characters that do not map to a Unicode character.
|