Kayıtlar (commit) · a6a21fabbcecb163c14110b0d62a3ad053f97f7e · Batuhan Osman TASKAYA / cpython

21 Tem, 2007 1 kayıt (commit)
- PEP 3123: Provide forward compatibility with Python 3.0, while keeping · 6819210b
  Martin v. Löwis 17 years ago yazdı
```
backwards compatibility. Add Py_Refcnt, Py_Type, Py_Size, and
PyVarObject_HEAD_INIT.
```
  6819210b
25 Şub, 2007 1 kayıt (commit)

Variation of patch # 1624059 to speed up checking if an object is a subclass · ee3a1b52

Neal Norwitz 18 years ago yazdı

of some of the common builtin types.

Use a bit in tp_flags for each common builtin type.  Check the bit
to determine if any instance is a subclass of these common types.
The check avoids a function call and O(n) search of the base classes.
The check is done in the various Py*_Check macros rather than calling
PyType_IsSubtype().

All the bits are set in tp_flags when the type is declared
in the Objects/*object.c files because PyType_Ready() is not called
for all the types.  Should PyType_Ready() be called for all types?
If so and the change is made, the changes to the Objects/*object.c files
can be reverted (remove setting the tp_flags).  Objects/typeobject.c
would also have to be modified to add conditions
for Py*_CheckExact() in addition to each the PyType_IsSubtype check.

ee3a1b52

14 Agu, 2006 1 kayıt (commit)

Slightly revised version of patch #1538956: · 040f76b7

Marc-André Lemburg 18 years ago yazdı

Replace UnicodeDecodeErrors raised during == and !=
compares of Unicode and other objects with a new
UnicodeWarning.

All other comparisons continue to raise exceptions.
Exceptions other than UnicodeDecodeErrors are also left
untouched.

040f76b7

14 Haz, 2006 1 kayıt (commit)
- Patch #1455898: Incremental mode for "mbcs" codec. · d825143b
  Martin v. Löwis 18 years ago yazdı
  
  d825143b
04 Haz, 2006 1 kayıt (commit)
- Patch #1359618: Speed-up charmap encoder. · 3f767795
  Martin v. Löwis 18 years ago yazdı
  
  3f767795
28 May, 2006 1 kayıt (commit)
- needforspeed: added Py_MEMCPY macro (currently tuned for Visual C only), · 80f8e80c
  Fredrik Lundh 18 years ago yazdı
```
and use it for string copy operations.  this gives a 20% speedup on some
string benchmarks.
```
  80f8e80c
26 May, 2006 2 kayıt (commit)
- needforspeed: added rpartition implementation · b3167cbc
  Fredrik Lundh 18 years ago yazdı
  
  b3167cbc
- needforspeed: partition implementation, part two. · 06a69dd8
  Fredrik Lundh 18 years ago yazdı
```
feel free to improve the documentation and the docstrings.
```
  06a69dd8
23 May, 2006 1 kayıt (commit)
- needforspeed: check first *and* last character before doing a full memcmp · 3d885e01
  Fredrik Lundh 18 years ago yazdı
  
  3d885e01
22 May, 2006 2 kayıt (commit)
- needforspeed: use memcpy for "long" strings; use a better algorithm · 8a8e05a2
  Fredrik Lundh 18 years ago yazdı
```
for long repeats.
```
  8a8e05a2
- needforspeed: speed up unicode repeat, unicode string copy · f1d60a53
  Fredrik Lundh 18 years ago yazdı
  
  f1d60a53
15 Şub, 2006 1 kayıt (commit)
- Merge ssize_t branch. · 18e16555
  Martin v. Löwis 19 years ago yazdı
  
  18e16555
29 Eki, 2005 1 kayıt (commit)

_PyUnicode_IsWhitespace(), · 2576c97f

Tim Peters 19 years ago yazdı

_PyUnicode_IsLinebreak():
Changed the declarations to match the definitions.

Don't know why they differed; MSVC warned about it;
don't know why only these two functions use "const".
Someone who does may want to do something saner ;-).

2576c97f

30 Agu, 2005 1 kayıt (commit)

SF bug #1251300: On UCS-4 builds the "unicode-internal" codec will now complain · a47d1c08

Walter Dörwald 19 years ago yazdı

about illegal code points. The codec now supports PEP 293 style error handlers.
(This is a variant of the Nik Haldimann's patch that detects truncated data)

a47d1c08

22 Kas, 2004 1 kayıt (commit)

Correct the handling of 0-termination of PyUnicode_AsWideChar() · a9cadcd4

Marc-André Lemburg 20 years ago yazdı

and its usage in PyLocale_strcoll().

Clarify the documentation on this.

Thanks to Andreas Degert for pointing this out.

a9cadcd4

31 Eki, 2004 1 kayıt (commit)
- SF patch #1056231: typo in comment (unicodeobject.h) · 57341c37
  Raymond Hettinger 20 years ago yazdı
  
  57341c37
07 Eyl, 2004 1 kayıt (commit)

SF patch #998993: The UTF-8 and the UTF-16 stateful decoders now support · 69652035

Walter Dörwald 20 years ago yazdı

decoding incomplete input (when the input stream is temporarily exhausted).
codecs.StreamReader now implements buffering, which enables proper
readline support for the UTF-16 decoders. codecs.StreamReader.read()
has a new argument chars which specifies the number of characters to
return. codecs.StreamReader.readline() and codecs.StreamReader.readlines()
have a new argument keepends. Trailing "\n"s will be stripped from the lines
if keepends is false. Added C APIs PyUnicode_DecodeUTF8Stateful and
PyUnicode_DecodeUTF16Stateful.

69652035

04 Agu, 2004 1 kayıt (commit)

SF #989185: Drop unicode.iswide() and unicode.width() and add · e9ddfbb4

Hye-Shik Chang 20 years ago yazdı

unicodedata.east_asian_width().  You can still implement your own
simple width() function using it like this:
    def width(u):
        w = 0
        for c in unicodedata.normalize('NFC', u):
            cwidth = unicodedata.east_asian_width(c)
            if cwidth in ('W', 'F'): w += 2
            else: w += 1
        return w

e9ddfbb4

08 Tem, 2004 1 kayıt (commit)
- Allow string and unicode return types from .encode()/.decode() · d2d4598e
  Marc-André Lemburg 20 years ago yazdı
```
methods on string and unicode objects. Added unicode.decode()
which was missing for no apparent reason.
```
  d2d4598e
02 Haz, 2004 1 kayıt (commit)

- SF #962502: Add two more methods for unicode type; width() and · 974ed7cf

Hye-Shik Chang 20 years ago yazdı

iswide() for east asian width manipulation. (Inspired by David
Goodger, Reviewed by Martin v. Loewis)
- Move _PyUnicode_TypeRecord.flags to the end of the struct so that
no padding is added for UCS-4 builds. (Suggested by Martin v. Loewis)

974ed7cf

15 Ara, 2003 1 kayıt (commit)
- Add rsplit method for str and unicode builtin types. · 3ae811b5
  Hye-Shik Chang 21 years ago yazdı
```
SF feature request #801847.
Original patch is written by Sean Reifschneider.
```
  3ae811b5
12 Agu, 2002 2 kayıt (commit)
- Add name mangling for new PyUnicode_FromOrdinal() and fix declaration · 9c329de4
  Marc-André Lemburg 22 years ago yazdı
```
to use new extern macro.
```
  9c329de4
- Excise DL_EXPORT from Include. · 91a681de
  Mark Hammond 22 years ago yazdı
```
Thanks to Skip Montanaro and Kalle Svensson for the patches.
```
  91a681de
11 Agu, 2002 1 kayıt (commit)

Add C API PyUnicode_FromOrdinal() which exposes unichr() at C level. · cc8764ca

Marc-André Lemburg 22 years ago yazdı

u'%c' will now raise a ValueError in case the argument is an
integer outside the valid range of Unicode code point ordinals.

Closes SF bug #593581.

cc8764ca

29 May, 2002 1 kayıt (commit)
- Fix for bug [ 561796 ] string.find causes lazy error · 4da6fd63
  Marc-André Lemburg 22 years ago yazdı
  
  4da6fd63
22 Nis, 2002 1 kayıt (commit)

Apply patch diff.txt from SF feature request · de02bcb2

Walter Dörwald 22 years ago yazdı

http://www.python.org/sf/444708

This adds the optional argument for str.strip
to unicode.strip too and makes it possible
to call str.strip with a unicode argument
and unicode.strip with a str argument.

de02bcb2

19 Eki, 2001 1 kayıt (commit)

SF patch #470578: Fixes to synchronize unicode() and str() · b8c65bc2

Guido van Rossum 23 years ago yazdı

    This patch implements what we have discussed on python-dev late in
    September: str(obj) and unicode(obj) should behave similar, while
    the old behaviour is retained for unicode(obj, encoding, errors).

    The patch also adds a new feature with which objects can provide
    unicode(obj) with input data: the __unicode__ method. Currently no
    new tp_unicode slot is implemented; this is left as option for the
    future.

    Note that PyUnicode_FromEncodedObject() no longer accepts Unicode
    objects as input. The API name already suggests that Unicode
    objects do not belong in the list of acceptable objects and the
    functionality was only needed because
    PyUnicode_FromEncodedObject() was being used directly by
    unicode(). The latter was changed in the discussed way:

    * unicode(obj) calls PyObject_Unicode()
    * unicode(obj, encoding, errors) calls PyUnicode_FromEncodedObject()

    One thing left open to discussion is whether to leave the
    PyUnicode_FromObject() API as a thin API extension on top of
    PyUnicode_FromEncodedObject() or to turn it into a (macro) alias
    for PyObject_Unicode() and deprecate it. Doing so would have some
    surprising consequences though, e.g.  u"abc" + 123 would turn out
    as u"abc123"...

[Marc-Andre didn't have time to check this in before the deadline.  I
hope this is OK, Marc-Andre!  You can still make changes and commit
them on the trunk after the branch has been made, but then please mail
Barry a context diff if you want the change to be merged into the
2.2b1 release branch.  GvR]

b8c65bc2

20 Eyl, 2001 1 kayıt (commit)
- Patch #435971: UTF-7 codec by Brian Quinlan. · c60e6f77
  Marc-André Lemburg 23 years ago yazdı
  
  c60e6f77
19 Eyl, 2001 1 kayıt (commit)
- Fix for bug #462737. · 5e6007c5
  Marc-André Lemburg 23 years ago yazdı
  
  5e6007c5
11 Eyl, 2001 1 kayıt (commit)

Possibly the end of SF [#460020] bug or feature: unicode() and subclasses. · 78e0fc74

Tim Peters 23 years ago yazdı

Changed unicode(i) to return a true Unicode object when i is an instance of
a unicode subclass.  Added PyUnicode_CheckExact macro.

78e0fc74

30 Agu, 2001 1 kayıt (commit)
- Make the Py<type>_Check() macro use PyObject_TypeCheck(). · 5eef77a2
  Guido van Rossum 23 years ago yazdı
  
  5eef77a2
17 Agu, 2001 1 kayıt (commit)

Patch #445762: Support --disable-unicode · 339d0f72

Martin v. Löwis 23 years ago yazdı

- Do not compile unicodeobject, unicodectype, and unicodedata if Unicode is disabled
- check for Py_USING_UNICODE in all places that use Unicode functions
- disables unicode literals, and the builtin functions
- add the types.StringTypes list
- remove Unicode literals from most tests.

339d0f72

09 Agu, 2001 1 kayıt (commit)

SF patch #438013 Remove 2-byte Py_UCS2 assumptions · 772747b3

Tim Peters 23 years ago yazdı

Removed all instances of Py_UCS2 from the codebase, and so also (I hope)
the last remaining reliance on the platform having an integral type
with exactly 16 bits.
PyUnicode_DecodeUTF16() and PyUnicode_EncodeUTF16() now read and write
one byte at a time.

772747b3

31 Tem, 2001 1 kayıt (commit)
- As discussed on python-dev: this patch adds name mangling to · b5ac6f62
  Marc-André Lemburg 23 years ago yazdı
```
assure that extensions and interpreters using the Unicode APIs
were compiled using the same Unicode width.
```
  b5ac6f62
30 Tem, 2001 1 kayıt (commit)

Add _PyUnicode_AsDefaultEncodedString to unicodeobject.h. · 3ce45389

Jeremy Hylton 23 years ago yazdı

And remove all the extern decls in the middle of .c files.
Apparently, it was excluded from the header file because it is
intended for internal use by the interpreter. It's still intended for
internal use and documented as such in the header file.

3ce45389

27 Haz, 2001 3 kayıt (commit)
- removed "register const" from scalar arguments to the unicode · 72b06856
  Fredrik Lundh 23 years ago yazdı
```
predicates
```
  72b06856
- use Py_UNICODE_WIDE instead of USE_UCS4_STORAGE and Py_UNICODE_SIZE · 8f455858
  Fredrik Lundh 23 years ago yazdı
```
tests.
```
  8f455858
- Encode surrogates in UTF-8 even for a wide Py_UNICODE. · ce9b5a55
  Martin v. Löwis 23 years ago yazdı
```
Implement sys.maxunicode.
Explicitly wrap around upper/lower computations for wide Py_UNICODE.
When decoding large characters with UTF-8, represent expected test
results using the \U notation.
```
  ce9b5a55
26 Haz, 2001 2 kayıt (commit)

Make Unicode work a bit better on Windows... · 9b14ab36
Fredrik Lundh 23 years ago yazdı

9b14ab36

Support using UCS-4 as the Py_UNICODE type: · 0ba70cc3

Martin v. Löwis 23 years ago yazdı

Add configure option --enable-unicode.
Add config.h macros Py_USING_UNICODE, PY_UNICODE_TYPE, Py_UNICODE_SIZE,
                    SIZEOF_WCHAR_T.
Define Py_UCS2.
Encode and decode large UTF-8 characters into single Py_UNICODE values
for wide Unicode types; likewise for UTF-16.
Remove test whether sizeof Py_UNICODE is two.

0ba70cc3