Skip to content
Projeler
Gruplar
Parçacıklar
Yardım
Yükleniyor...
Oturum aç / Kaydol
Gezinmeyi değiştir
C
cpython
Proje
Proje
Ayrıntılar
Etkinlik
Cycle Analytics
Depo (repository)
Depo (repository)
Dosyalar
Kayıtlar (commit)
Dallar (branch)
Etiketler
Katkıda bulunanlar
Grafik
Karşılaştır
Grafikler
Konular (issue)
0
Konular (issue)
0
Liste
Pano
Etiketler
Kilometre Taşları
Birleştirme (merge) Talepleri
0
Birleştirme (merge) Talepleri
0
CI / CD
CI / CD
İş akışları (pipeline)
İşler
Zamanlamalar
Grafikler
Paketler
Paketler
Wiki
Wiki
Parçacıklar
Parçacıklar
Üyeler
Üyeler
Collapse sidebar
Close sidebar
Etkinlik
Grafik
Grafikler
Yeni bir konu (issue) oluştur
İşler
Kayıtlar (commit)
Konu (issue) Panoları
Kenar çubuğunu aç
Batuhan Osman TASKAYA
cpython
Commits
582acb75
Kaydet (Commit)
582acb75
authored
Ock 06, 2015
tarafından
Nick Coghlan
Dosyalara gözat
Seçenekler
Dosyalara Gözat
İndir
Sade Fark
Merge issue 19548 changes from 3.4
üst
5d575399
b9fdb7a4
Expand all
Hide whitespace changes
Inline
Side-by-side
Showing
9 changed files
with
90 additions
and
60 deletions
+90
-60
glossary.rst
Doc/glossary.rst
+4
-1
codecs.rst
Doc/library/codecs.rst
+0
-0
functions.rst
Doc/library/functions.rst
+5
-3
stdtypes.rst
Doc/library/stdtypes.rst
+2
-2
tarfile.rst
Doc/library/tarfile.rst
+1
-1
codecs.py
Lib/codecs.py
+30
-41
test_codecs.py
Lib/test/test_codecs.py
+37
-9
NEWS
Misc/NEWS
+8
-0
_codecsmodule.c
Modules/_codecsmodule.c
+3
-3
No files found.
Doc/glossary.rst
Dosyayı görüntüle @
582acb75
...
@@ -834,10 +834,13 @@ Glossary
...
@@ -834,10 +834,13 @@ Glossary
:
meth
:`~
collections
.
somenamedtuple
.
_asdict
`.
Examples
of
struct
sequences
:
meth
:`~
collections
.
somenamedtuple
.
_asdict
`.
Examples
of
struct
sequences
include
:
data
:`
sys
.
float_info
`
and
the
return
value
of
:
func
:`
os
.
stat
`.
include
:
data
:`
sys
.
float_info
`
and
the
return
value
of
:
func
:`
os
.
stat
`.
text
encoding
A
codec
which
encodes
Unicode
strings
to
bytes
.
text
file
text
file
A
:
term
:`
file
object
`
able
to
read
and
write
:
class
:`
str
`
objects
.
A
:
term
:`
file
object
`
able
to
read
and
write
:
class
:`
str
`
objects
.
Often
,
a
text
file
actually
accesses
a
byte
-
oriented
datastream
Often
,
a
text
file
actually
accesses
a
byte
-
oriented
datastream
and
handles
the
text
encoding
automatically
.
and
handles
the
:
term
:`
text
encoding
`
automatically
.
..
seealso
::
..
seealso
::
A
:
term
:`
binary
file
`
reads
and
write
:
class
:`
bytes
`
objects
.
A
:
term
:`
binary
file
`
reads
and
write
:
class
:`
bytes
`
objects
.
...
...
Doc/library/codecs.rst
Dosyayı görüntüle @
582acb75
This diff is collapsed.
Click to expand it.
Doc/library/functions.rst
Dosyayı görüntüle @
582acb75
...
@@ -940,15 +940,17 @@ are always available. They are listed here in alphabetical order.
...
@@ -940,15 +940,17 @@ are always available. They are listed here in alphabetical order.
*encoding* is the name of the encoding used to decode or encode the file.
*encoding* is the name of the encoding used to decode or encode the file.
This should only be used in text mode. The default encoding is platform
This should only be used in text mode. The default encoding is platform
dependent (whatever :func:`locale.getpreferredencoding` returns), but any
dependent (whatever :func:`locale.getpreferredencoding` returns), but any
encoding supported by Python can be used. See the :mod:`codecs` module for
:term:`text encoding` supported by Python
can be used. See the :mod:`codecs` module for
the list of supported encodings.
the list of supported encodings.
*errors* is an optional string that specifies how encoding and decoding
*errors* is an optional string that specifies how encoding and decoding
errors are to be handled--this cannot be used in binary mode.
errors are to be handled--this cannot be used in binary mode.
A variety of standard error handlers are available, though any
A variety of standard error handlers are available
(listed under :ref:`error-handlers`), though any
error handling name that has been registered with
error handling name that has been registered with
:func:`codecs.register_error` is also valid. The standard names
:func:`codecs.register_error` is also valid. The standard names
ar
e:
includ
e:
* ``'strict'`` to raise a :exc:`ValueError` exception if there is
* ``'strict'`` to raise a :exc:`ValueError` exception if there is
an encoding error. The default value of ``None`` has the same
an encoding error. The default value of ``None`` has the same
...
...
Doc/library/stdtypes.rst
Dosyayı görüntüle @
582acb75
...
@@ -1512,7 +1512,7 @@ expression support in the :mod:`re` module).
...
@@ -1512,7 +1512,7 @@ expression support in the :mod:`re` module).
a :exc:`UnicodeError`. Other possible
a :exc:`UnicodeError`. Other possible
values are ``'ignore'``, ``'replace'``, ``'xmlcharrefreplace'``,
values are ``'ignore'``, ``'replace'``, ``'xmlcharrefreplace'``,
``'backslashreplace'`` and any other name registered via
``'backslashreplace'`` and any other name registered via
:func:`codecs.register_error`, see section :ref:`
codec-base-classe
s`. For a
:func:`codecs.register_error`, see section :ref:`
error-handler
s`. For a
list of possible encodings, see section :ref:`standard-encodings`.
list of possible encodings, see section :ref:`standard-encodings`.
.. versionchanged:: 3.1
.. versionchanged:: 3.1
...
@@ -2384,7 +2384,7 @@ arbitrary binary data.
...
@@ -2384,7 +2384,7 @@ arbitrary binary data.
error handling scheme. The default for *errors* is ``'strict'``, meaning
error handling scheme. The default for *errors* is ``'strict'``, meaning
that encoding errors raise a :exc:`UnicodeError`. Other possible values are
that encoding errors raise a :exc:`UnicodeError`. Other possible values are
``'ignore'``, ``'replace'`` and any other name registered via
``'ignore'``, ``'replace'`` and any other name registered via
:func:`codecs.register_error`, see section :ref:`
codec-base-classe
s`. For a
:func:`codecs.register_error`, see section :ref:`
error-handler
s`. For a
list of possible encodings, see section :ref:`standard-encodings`.
list of possible encodings, see section :ref:`standard-encodings`.
.. note::
.. note::
...
...
Doc/library/tarfile.rst
Dosyayı görüntüle @
582acb75
...
@@ -798,7 +798,7 @@ metadata must be either decoded or encoded. If *encoding* is not set
...
@@ -798,7 +798,7 @@ metadata must be either decoded or encoded. If *encoding* is not set
appropriately, this conversion may fail.
appropriately, this conversion may fail.
The *errors* argument defines how characters are treated that cannot be
The *errors* argument defines how characters are treated that cannot be
converted. Possible values are listed in section :ref:`
codec-base-classe
s`.
converted. Possible values are listed in section :ref:`
error-handler
s`.
The default scheme is ``'surrogateescape'`` which Python also uses for its
The default scheme is ``'surrogateescape'`` which Python also uses for its
file system calls, see :ref:`os-filenames`.
file system calls, see :ref:`os-filenames`.
...
...
Lib/codecs.py
Dosyayı görüntüle @
582acb75
...
@@ -347,8 +347,7 @@ class StreamWriter(Codec):
...
@@ -347,8 +347,7 @@ class StreamWriter(Codec):
""" Creates a StreamWriter instance.
""" Creates a StreamWriter instance.
stream must be a file-like object open for writing
stream must be a file-like object open for writing.
(binary) data.
The StreamWriter may use different error handling
The StreamWriter may use different error handling
schemes by providing the errors keyword argument. These
schemes by providing the errors keyword argument. These
...
@@ -422,8 +421,7 @@ class StreamReader(Codec):
...
@@ -422,8 +421,7 @@ class StreamReader(Codec):
""" Creates a StreamReader instance.
""" Creates a StreamReader instance.
stream must be a file-like object open for reading
stream must be a file-like object open for reading.
(binary) data.
The StreamReader may use different error handling
The StreamReader may use different error handling
schemes by providing the errors keyword argument. These
schemes by providing the errors keyword argument. These
...
@@ -451,13 +449,12 @@ class StreamReader(Codec):
...
@@ -451,13 +449,12 @@ class StreamReader(Codec):
""" Decodes data from the stream self.stream and returns the
""" Decodes data from the stream self.stream and returns the
resulting object.
resulting object.
chars indicates the number of characters to read from the
chars indicates the number of decoded code points or bytes to
stream. read() will never return more than chars
return. read() will never return more data than requested,
characters, but it might return less, if there are not enough
but it might return less, if there is not enough available.
characters available.
size indicates the approximate maximum number of
bytes to
size indicates the approximate maximum number of
decoded
read from the stream for decoding purposes
. The decoder
bytes or code points to read for decoding
. The decoder
can modify this setting as appropriate. The default value
can modify this setting as appropriate. The default value
-1 indicates to read and decode as much as possible. size
-1 indicates to read and decode as much as possible. size
is intended to prevent having to decode huge files in one
is intended to prevent having to decode huge files in one
...
@@ -468,7 +465,7 @@ class StreamReader(Codec):
...
@@ -468,7 +465,7 @@ class StreamReader(Codec):
will be returned, the rest of the input will be kept until the
will be returned, the rest of the input will be kept until the
next call to read().
next call to read().
The method should use a greedy read strategy meaning that
The method should use a greedy read strategy
,
meaning that
it should read as much data as is allowed within the
it should read as much data as is allowed within the
definition of the encoding and the given size, e.g. if
definition of the encoding and the given size, e.g. if
optional encoding endings or state markers are available
optional encoding endings or state markers are available
...
@@ -603,7 +600,7 @@ class StreamReader(Codec):
...
@@ -603,7 +600,7 @@ class StreamReader(Codec):
def
readlines
(
self
,
sizehint
=
None
,
keepends
=
True
):
def
readlines
(
self
,
sizehint
=
None
,
keepends
=
True
):
""" Read all lines available on the input stream
""" Read all lines available on the input stream
and return them as
list of lines
.
and return them as
a list
.
Line breaks are implemented using the codec's decoder
Line breaks are implemented using the codec's decoder
method and are included in the list entries.
method and are included in the list entries.
...
@@ -751,19 +748,18 @@ class StreamReaderWriter:
...
@@ -751,19 +748,18 @@ class StreamReaderWriter:
class
StreamRecoder
:
class
StreamRecoder
:
""" StreamRecoder instances provide a frontend - backend
""" StreamRecoder instances translate data from one encoding to another.
view of encoding data.
They use the complete set of APIs returned by the
They use the complete set of APIs returned by the
codecs.lookup() function to implement their task.
codecs.lookup() function to implement their task.
Data written to the
stream
is first decoded into an
Data written to the
StreamRecoder
is first decoded into an
intermediate format (
which is dependent on the given codec
intermediate format (
depending on the "decode" codec) and then
combination) and then written to the stream using an instance
written to the underlying stream using an instance of the provided
of the provided
Writer class.
Writer class.
In the other direction, data is read from the
stream using a
In the other direction, data is read from the
underlying stream using
Reader instance and then return encoded data
to the caller.
a Reader instance and then encoded and returned
to the caller.
"""
"""
# Optional attributes set by the file wrappers below
# Optional attributes set by the file wrappers below
...
@@ -775,22 +771,17 @@ class StreamRecoder:
...
@@ -775,22 +771,17 @@ class StreamRecoder:
""" Creates a StreamRecoder instance which implements a two-way
""" Creates a StreamRecoder instance which implements a two-way
conversion: encode and decode work on the frontend (the
conversion: encode and decode work on the frontend (the
input to .read() and output of .write()) while
data visible to .read() and .write()) while Reader and Writer
Reader and Writer work on the backend (reading and
work on the backend (the data in stream).
writing to the stream).
You can use these objects to do transparent
direct
You can use these objects to do transparent
re
codings from e.g. latin-1 to utf-8 and back.
trans
codings from e.g. latin-1 to utf-8 and back.
stream must be a file-like object.
stream must be a file-like object.
encode
, decode must adhere to the Codec interface, Reader,
encode
and decode must adhere to the Codec interface; Reader and
Writer must be factory functions or classes providing the
Writer must be factory functions or classes providing the
StreamReader, StreamWriter interface resp.
StreamReader and StreamWriter interfaces resp.
encode and decode are needed for the frontend translation,
Reader and Writer for the backend translation. Unicode is
used as intermediate encoding.
Error handling is done in the same way as defined for the
Error handling is done in the same way as defined for the
StreamWriter/Readers.
StreamWriter/Readers.
...
@@ -865,7 +856,7 @@ class StreamRecoder:
...
@@ -865,7 +856,7 @@ class StreamRecoder:
### Shortcuts
### Shortcuts
def
open
(
filename
,
mode
=
'r
b
'
,
encoding
=
None
,
errors
=
'strict'
,
buffering
=
1
):
def
open
(
filename
,
mode
=
'r'
,
encoding
=
None
,
errors
=
'strict'
,
buffering
=
1
):
""" Open an encoded file using the given mode and return
""" Open an encoded file using the given mode and return
a wrapped version providing transparent encoding/decoding.
a wrapped version providing transparent encoding/decoding.
...
@@ -875,10 +866,8 @@ def open(filename, mode='rb', encoding=None, errors='strict', buffering=1):
...
@@ -875,10 +866,8 @@ def open(filename, mode='rb', encoding=None, errors='strict', buffering=1):
codecs. Output is also codec dependent and will usually be
codecs. Output is also codec dependent and will usually be
Unicode as well.
Unicode as well.
Files are always opened in binary mode, even if no binary mode
Underlying encoded files are always opened in binary mode.
was specified. This is done to avoid data loss due to encodings
The default file mode is 'r', meaning to open the file in read mode.
using 8-bit values. The default file mode is 'rb' meaning to
open the file in binary read mode.
encoding specifies the encoding which is to be used for the
encoding specifies the encoding which is to be used for the
file.
file.
...
@@ -914,13 +903,13 @@ def EncodedFile(file, data_encoding, file_encoding=None, errors='strict'):
...
@@ -914,13 +903,13 @@ def EncodedFile(file, data_encoding, file_encoding=None, errors='strict'):
""" Return a wrapped version of file which provides transparent
""" Return a wrapped version of file which provides transparent
encoding translation.
encoding translation.
Strings written to the wrapped file are interpret
ed according
Data written to the wrapped file is decod
ed according
to the given data_encoding and then
written to the original
to the given data_encoding and then
encoded to the underlying
file
as string using file_encoding. The intermediate encoding
file
using file_encoding. The intermediate data type
will usually be Unicode but depends on the specified codecs.
will usually be Unicode but depends on the specified codecs.
Strings are read from the file
using file_encoding and then
Bytes read from the file are decoded
using file_encoding and then
passed back to the caller
as string
using data_encoding.
passed back to the caller
encoded
using data_encoding.
If file_encoding is not given, it defaults to data_encoding.
If file_encoding is not given, it defaults to data_encoding.
...
...
Lib/test/test_codecs.py
Dosyayı görüntüle @
582acb75
...
@@ -1140,6 +1140,8 @@ class RecodingTest(unittest.TestCase):
...
@@ -1140,6 +1140,8 @@ class RecodingTest(unittest.TestCase):
# Python used to crash on this at exit because of a refcount
# Python used to crash on this at exit because of a refcount
# bug in _codecsmodule.c
# bug in _codecsmodule.c
self
.
assertTrue
(
f
.
closed
)
# From RFC 3492
# From RFC 3492
punycode_testcases
=
[
punycode_testcases
=
[
# A Arabic (Egyptian):
# A Arabic (Egyptian):
...
@@ -1592,6 +1594,16 @@ class IDNACodecTest(unittest.TestCase):
...
@@ -1592,6 +1594,16 @@ class IDNACodecTest(unittest.TestCase):
self
.
assertEqual
(
encoder
.
encode
(
"ample.org."
),
b
"xn--xample-9ta.org."
)
self
.
assertEqual
(
encoder
.
encode
(
"ample.org."
),
b
"xn--xample-9ta.org."
)
self
.
assertEqual
(
encoder
.
encode
(
""
,
True
),
b
""
)
self
.
assertEqual
(
encoder
.
encode
(
""
,
True
),
b
""
)
def
test_errors
(
self
):
"""Only supports "strict" error handler"""
"python.org"
.
encode
(
"idna"
,
"strict"
)
b
"python.org"
.
decode
(
"idna"
,
"strict"
)
for
errors
in
(
"ignore"
,
"replace"
,
"backslashreplace"
,
"surrogateescape"
):
self
.
assertRaises
(
Exception
,
"python.org"
.
encode
,
"idna"
,
errors
)
self
.
assertRaises
(
Exception
,
b
"python.org"
.
decode
,
"idna"
,
errors
)
class
CodecsModuleTest
(
unittest
.
TestCase
):
class
CodecsModuleTest
(
unittest
.
TestCase
):
def
test_decode
(
self
):
def
test_decode
(
self
):
...
@@ -1682,6 +1694,24 @@ class CodecsModuleTest(unittest.TestCase):
...
@@ -1682,6 +1694,24 @@ class CodecsModuleTest(unittest.TestCase):
for
api
in
codecs
.
__all__
:
for
api
in
codecs
.
__all__
:
getattr
(
codecs
,
api
)
getattr
(
codecs
,
api
)
def
test_open
(
self
):
self
.
addCleanup
(
support
.
unlink
,
support
.
TESTFN
)
for
mode
in
(
'w'
,
'r'
,
'r+'
,
'w+'
,
'a'
,
'a+'
):
with
self
.
subTest
(
mode
),
\
codecs
.
open
(
support
.
TESTFN
,
mode
,
'ascii'
)
as
file
:
self
.
assertIsInstance
(
file
,
codecs
.
StreamReaderWriter
)
def
test_undefined
(
self
):
self
.
assertRaises
(
UnicodeError
,
codecs
.
encode
,
'abc'
,
'undefined'
)
self
.
assertRaises
(
UnicodeError
,
codecs
.
decode
,
b
'abc'
,
'undefined'
)
self
.
assertRaises
(
UnicodeError
,
codecs
.
encode
,
''
,
'undefined'
)
self
.
assertRaises
(
UnicodeError
,
codecs
.
decode
,
b
''
,
'undefined'
)
for
errors
in
(
'strict'
,
'ignore'
,
'replace'
,
'backslashreplace'
):
self
.
assertRaises
(
UnicodeError
,
codecs
.
encode
,
'abc'
,
'undefined'
,
errors
)
self
.
assertRaises
(
UnicodeError
,
codecs
.
decode
,
b
'abc'
,
'undefined'
,
errors
)
class
StreamReaderTest
(
unittest
.
TestCase
):
class
StreamReaderTest
(
unittest
.
TestCase
):
def
setUp
(
self
):
def
setUp
(
self
):
...
@@ -1815,13 +1845,10 @@ if hasattr(codecs, "mbcs_encode"):
...
@@ -1815,13 +1845,10 @@ if hasattr(codecs, "mbcs_encode"):
# "undefined"
# "undefined"
# The following encodings don't work in stateful mode
# The following encodings don't work in stateful mode
broken_unicode_with_st
reams
=
[
broken_unicode_with_st
ateful
=
[
"punycode"
,
"punycode"
,
"unicode_internal"
"unicode_internal"
]
]
broken_incremental_coders
=
broken_unicode_with_streams
+
[
"idna"
,
]
class
BasicUnicodeTest
(
unittest
.
TestCase
,
MixInCheckStateHandling
):
class
BasicUnicodeTest
(
unittest
.
TestCase
,
MixInCheckStateHandling
):
def
test_basics
(
self
):
def
test_basics
(
self
):
...
@@ -1841,7 +1868,7 @@ class BasicUnicodeTest(unittest.TestCase, MixInCheckStateHandling):
...
@@ -1841,7 +1868,7 @@ class BasicUnicodeTest(unittest.TestCase, MixInCheckStateHandling):
(
chars
,
size
)
=
codecs
.
getdecoder
(
encoding
)(
b
)
(
chars
,
size
)
=
codecs
.
getdecoder
(
encoding
)(
b
)
self
.
assertEqual
(
chars
,
s
,
"encoding=
%
r"
%
encoding
)
self
.
assertEqual
(
chars
,
s
,
"encoding=
%
r"
%
encoding
)
if
encoding
not
in
broken_unicode_with_st
reams
:
if
encoding
not
in
broken_unicode_with_st
ateful
:
# check stream reader/writer
# check stream reader/writer
q
=
Queue
(
b
""
)
q
=
Queue
(
b
""
)
writer
=
codecs
.
getwriter
(
encoding
)(
q
)
writer
=
codecs
.
getwriter
(
encoding
)(
q
)
...
@@ -1859,7 +1886,7 @@ class BasicUnicodeTest(unittest.TestCase, MixInCheckStateHandling):
...
@@ -1859,7 +1886,7 @@ class BasicUnicodeTest(unittest.TestCase, MixInCheckStateHandling):
decodedresult
+=
reader
.
read
()
decodedresult
+=
reader
.
read
()
self
.
assertEqual
(
decodedresult
,
s
,
"encoding=
%
r"
%
encoding
)
self
.
assertEqual
(
decodedresult
,
s
,
"encoding=
%
r"
%
encoding
)
if
encoding
not
in
broken_
incremental_coders
:
if
encoding
not
in
broken_
unicode_with_stateful
:
# check incremental decoder/encoder and iterencode()/iterdecode()
# check incremental decoder/encoder and iterencode()/iterdecode()
try
:
try
:
encoder
=
codecs
.
getincrementalencoder
(
encoding
)()
encoder
=
codecs
.
getincrementalencoder
(
encoding
)()
...
@@ -1908,7 +1935,7 @@ class BasicUnicodeTest(unittest.TestCase, MixInCheckStateHandling):
...
@@ -1908,7 +1935,7 @@ class BasicUnicodeTest(unittest.TestCase, MixInCheckStateHandling):
from
_testcapi
import
codec_incrementalencoder
,
codec_incrementaldecoder
from
_testcapi
import
codec_incrementalencoder
,
codec_incrementaldecoder
s
=
"abc123"
# all codecs should be able to encode these
s
=
"abc123"
# all codecs should be able to encode these
for
encoding
in
all_unicode_encodings
:
for
encoding
in
all_unicode_encodings
:
if
encoding
not
in
broken_
incremental_coders
:
if
encoding
not
in
broken_
unicode_with_stateful
:
# check incremental decoder/encoder (fetched via the C API)
# check incremental decoder/encoder (fetched via the C API)
try
:
try
:
cencoder
=
codec_incrementalencoder
(
encoding
)
cencoder
=
codec_incrementalencoder
(
encoding
)
...
@@ -1948,7 +1975,7 @@ class BasicUnicodeTest(unittest.TestCase, MixInCheckStateHandling):
...
@@ -1948,7 +1975,7 @@ class BasicUnicodeTest(unittest.TestCase, MixInCheckStateHandling):
for
encoding
in
all_unicode_encodings
:
for
encoding
in
all_unicode_encodings
:
if
encoding
==
"idna"
:
# FIXME: See SF bug #1163178
if
encoding
==
"idna"
:
# FIXME: See SF bug #1163178
continue
continue
if
encoding
in
broken_unicode_with_st
reams
:
if
encoding
in
broken_unicode_with_st
ateful
:
continue
continue
reader
=
codecs
.
getreader
(
encoding
)(
io
.
BytesIO
(
s
.
encode
(
encoding
)))
reader
=
codecs
.
getreader
(
encoding
)(
io
.
BytesIO
(
s
.
encode
(
encoding
)))
for
t
in
range
(
5
):
for
t
in
range
(
5
):
...
@@ -1981,7 +2008,7 @@ class BasicUnicodeTest(unittest.TestCase, MixInCheckStateHandling):
...
@@ -1981,7 +2008,7 @@ class BasicUnicodeTest(unittest.TestCase, MixInCheckStateHandling):
# Check that getstate() and setstate() handle the state properly
# Check that getstate() and setstate() handle the state properly
u
=
"abc123"
u
=
"abc123"
for
encoding
in
all_unicode_encodings
:
for
encoding
in
all_unicode_encodings
:
if
encoding
not
in
broken_
incremental_coders
:
if
encoding
not
in
broken_
unicode_with_stateful
:
self
.
check_state_handling_decode
(
encoding
,
u
,
u
.
encode
(
encoding
))
self
.
check_state_handling_decode
(
encoding
,
u
,
u
.
encode
(
encoding
))
self
.
check_state_handling_encode
(
encoding
,
u
,
u
.
encode
(
encoding
))
self
.
check_state_handling_encode
(
encoding
,
u
,
u
.
encode
(
encoding
))
...
@@ -2185,6 +2212,7 @@ class WithStmtTest(unittest.TestCase):
...
@@ -2185,6 +2212,7 @@ class WithStmtTest(unittest.TestCase):
f
=
io
.
BytesIO
(
b
"
\xc3\xbc
"
)
f
=
io
.
BytesIO
(
b
"
\xc3\xbc
"
)
with
codecs
.
EncodedFile
(
f
,
"latin-1"
,
"utf-8"
)
as
ef
:
with
codecs
.
EncodedFile
(
f
,
"latin-1"
,
"utf-8"
)
as
ef
:
self
.
assertEqual
(
ef
.
read
(),
b
"
\xfc
"
)
self
.
assertEqual
(
ef
.
read
(),
b
"
\xfc
"
)
self
.
assertTrue
(
f
.
closed
)
def
test_streamreaderwriter
(
self
):
def
test_streamreaderwriter
(
self
):
f
=
io
.
BytesIO
(
b
"
\xc3\xbc
"
)
f
=
io
.
BytesIO
(
b
"
\xc3\xbc
"
)
...
...
Misc/NEWS
Dosyayı görüntüle @
582acb75
...
@@ -1441,6 +1441,10 @@ C API
...
@@ -1441,6 +1441,10 @@ C API
Documentation
Documentation
-------------
-------------
- Issue #19548: Update the codecs module documentation to better cover the
distinction between text encodings and other codecs, together with other
clarifications. Patch by Martin Panter.
- Issue #22394: Doc/Makefile now supports ``make venv PYTHON=../python`` to
- Issue #22394: Doc/Makefile now supports ``make venv PYTHON=../python`` to
create a venv for generating the documentation, e.g.,
create a venv for generating the documentation, e.g.,
``make html PYTHON=venv/bin/python3``.
``make html PYTHON=venv/bin/python3``.
...
@@ -1477,6 +1481,10 @@ Documentation
...
@@ -1477,6 +1481,10 @@ Documentation
Tests
Tests
-----
-----
- Issue #19548: Added some additional checks to test_codecs to ensure that
statements in the updated documentation remain accurate. Patch by Martin
Panter.
- Issue #22838: All test_re tests now work with unittest test discovery.
- Issue #22838: All test_re tests now work with unittest test discovery.
- Issue #22173: Update lib2to3 tests to use unittest test discovery.
- Issue #22173: Update lib2to3 tests to use unittest test discovery.
...
...
Modules/_codecsmodule.c
Dosyayı görüntüle @
582acb75
...
@@ -54,9 +54,9 @@ PyDoc_STRVAR(register__doc__,
...
@@ -54,9 +54,9 @@ PyDoc_STRVAR(register__doc__,
"register(search_function)
\n
\
"register(search_function)
\n
\
\n
\
\n
\
Register a codec search function. Search functions are expected to take
\n
\
Register a codec search function. Search functions are expected to take
\n
\
one argument, the encoding name in all lower case letters, and
return
\n
\
one argument, the encoding name in all lower case letters, and
either
\n
\
a tuple of functions (encoder, decoder, stream_reader, stream_writer)
\n
\
return None, or a tuple of functions (encoder, decoder, stream_reader,
\n
\
(or a CodecInfo object)."
);
stream_writer)
(or a CodecInfo object)."
);
static
static
PyObject
*
codec_register
(
PyObject
*
self
,
PyObject
*
search_function
)
PyObject
*
codec_register
(
PyObject
*
self
,
PyObject
*
search_function
)
...
...
Write
Preview
Markdown
is supported
0%
Try again
or
attach a new file
Attach a file
Cancel
You are about to add
0
people
to the discussion. Proceed with caution.
Finish editing this message first!
Cancel
Please
register
or
sign in
to comment