Kaydet (Commit) f13c4ba1 authored tarafından Georg Brandl's avatar Georg Brandl

[ 1243192 ] Incorrect documentation of re.UNICODE

üst 67962ab1
...@@ -342,17 +342,33 @@ with Python's string literals. ...@@ -342,17 +342,33 @@ with Python's string literals.
at the beginning or end of a word. This is just the opposite of {}\code{\e at the beginning or end of a word. This is just the opposite of {}\code{\e
b}, so is also subject to the settings of \code{LOCALE} and \code{UNICODE}. b}, so is also subject to the settings of \code{LOCALE} and \code{UNICODE}.
\item[\code{\e d}]Matches any decimal digit; this is \item[\code{\e d}]When the \constant{UNICODE} flag is not specified, matches
equivalent to the set \regexp{[0-9]}. any decimal digit; this is equivalent to the set \regexp{[0-9]}.
With \constant{UNICODE}, it will match whatever is classified as a digit
in the Unicode character properties database.
\item[\code{\e D}]Matches any non-digit character; this is \item[\code{\e D}]When the \constant{UNICODE} flag is not specified, matches
equivalent to the set \regexp{[{\textasciicircum}0-9]}. any non-digit character; this is equivalent to the set
\regexp{[{\textasciicircum}0-9]}. With \constant{UNICODE}, it will match
anything other than character marked as digits in the Unicode character
properties database.
\item[\code{\e s}]Matches any whitespace character; this is \item[\code{\e s}]When the \constant{LOCALE} and \constant{UNICODE}
flags are not specified, matches any whitespace character; this is
equivalent to the set \regexp{[ \e t\e n\e r\e f\e v]}. equivalent to the set \regexp{[ \e t\e n\e r\e f\e v]}.
With \constant{LOCALE}, it will match this set plus whatever characters
\item[\code{\e S}]Matches any non-whitespace character; this is are defined as space for the current locale. If \constant{UNICODE} is set,
equivalent to the set \regexp{[\textasciicircum\ \e t\e n\e r\e f\e v]}. this will match the characters \regexp{[ \e t\e n\e r\e f\e v]} plus
whatever is classified as space in the Unicode character properties
database.
\item[\code{\e S}]When the \constant{LOCALE} and \constant{UNICODE}
flags are not specified, matches any non-whitespace character; this is
equivalent to the set \regexp{[\textasciicircum\ \e t\e n\e r\e f\e v]}
With \constant{LOCALE}, it will match any character not in this set,
and not defined as space in the current locale. If \constant{UNICODE}
is set, this will match anything other than \regexp{[ \e t\e n\e r\e f\e v]}
and characters marked as space in the Unicode character properties database.
\item[\code{\e w}]When the \constant{LOCALE} and \constant{UNICODE} \item[\code{\e w}]When the \constant{LOCALE} and \constant{UNICODE}
flags are not specified, matches any alphanumeric character and the flags are not specified, matches any alphanumeric character and the
...@@ -468,8 +484,8 @@ current locale. ...@@ -468,8 +484,8 @@ current locale.
\begin{datadesc}{L} \begin{datadesc}{L}
\dataline{LOCALE} \dataline{LOCALE}
Make \regexp{\e w}, \regexp{\e W}, \regexp{\e b}, and Make \regexp{\e w}, \regexp{\e W}, \regexp{\e b}, \regexp{\e B},
\regexp{\e B} dependent on the current locale. \regexp{\e s} and \regexp{\e S} dependent on the current locale.
\end{datadesc} \end{datadesc}
\begin{datadesc}{M} \begin{datadesc}{M}
...@@ -493,8 +509,9 @@ anything \emph{except} a newline. ...@@ -493,8 +509,9 @@ anything \emph{except} a newline.
\begin{datadesc}{U} \begin{datadesc}{U}
\dataline{UNICODE} \dataline{UNICODE}
Make \regexp{\e w}, \regexp{\e W}, \regexp{\e b}, and Make \regexp{\e w}, \regexp{\e W}, \regexp{\e b}, \regexp{\e B},
\regexp{\e B} dependent on the Unicode character properties database. \regexp{\e d}, \regexp{\e D}, \regexp{\e s} and \regexp{\e S}
dependent on the Unicode character properties database.
\versionadded{2.0} \versionadded{2.0}
\end{datadesc} \end{datadesc}
......
...@@ -433,6 +433,8 @@ Tests ...@@ -433,6 +433,8 @@ Tests
Documentation Documentation
------------- -------------
- Bug #1243192: re.UNICODE and re.LOCALE affect \d, \D, \s and \S.
- Bug #755617: Document the effects of os.chown() on Windows. - Bug #755617: Document the effects of os.chown() on Windows.
- Patch #1180012: The documentation for modulefinder is now in the library reference. - Patch #1180012: The documentation for modulefinder is now in the library reference.
......
Markdown is supported
0% or
You are about to add 0 people to the discussion. Proceed with caution.
Finish editing this message first!
Please register or to comment