Issue #6697: _lsprof: normalizeUserObj() doesn't encode/decode (UTF-8) the

module name anymore, only work on unicode strings. Therefore it doesn't truncate module names with embedded NUL characters, or fail if the module name contains surrogate characters (UTF-8 encoder fails on a surrogate character). Patch written by Alexander Belopolsky.

Issue #6697: _lsprof: normalizeUserObj() doesn't encode/decode (UTF-8) the
module name anymore, only work on unicode strings. Therefore it doesn't truncate module names with embedded NUL characters, or fail if the module name contains surrogate characters (UTF-8 encoder fails on a surrogate character). Patch written by Alexander Belopolsky.
7edb5dfc · Victor Stinner · 99563b1d · 7edb5dfc
Kaydet (Commit) 7edb5dfc authored Haz 20, 2011 tarafından Victor Stinner
Hide whitespace changes
Inline Side-by-side

Showing with 19 additions and 26 deletions

_lsprof.c Modules/_lsprof.c +19 -26

No files found.
--- a/Modules/_lsprof.c
+++ b/Modules/_lsprof.c
@@ -176,36 +176,29 @@ normalizeUserObj(PyObject *obj)
    if (fn->m_self == NULL) {
        /* built-in function: look up the module name */
        PyObject *mod = fn->m_module;
-        const char *modname;
+        PyObject *modname = NULL;
-        if (mod && PyUnicode_Check(mod)) {
+        if (mod != NULL) {
-            /* XXX: The following will truncate module names with embedded
+            if (PyUnicode_Check(mod)) {
-             * null-characters.  It is unlikely that this can happen in
+                modname = mod;
-             * practice and the concequences are not serious enough to
+                Py_INCREF(modname);
-             * introduce extra checks here.
-             */
-            modname = _PyUnicode_AsString(mod);
-            if (modname == NULL) {
-                modname = "<encoding error>";
-                PyErr_Clear();
            }
-        }
+            else if (PyModule_Check(mod)) {
-        else if (mod && PyModule_Check(mod)) {
+                modname = PyModule_GetNameObject(mod);
-            modname = PyModule_GetName(mod);
+                if (modname == NULL)
-            if (modname == NULL) {
+                    PyErr_Clear();
-                PyErr_Clear();
-                modname = "builtins";
            }
        }
-        else {
+        if (modname != NULL) {
-            modname = "builtins";
+            if (PyUnicode_CompareWithASCIIString(modname, "builtins") != 0) {
+                PyObject *result;
+                result = PyUnicode_FromFormat("<%U.%s>", modname,
+                                              fn->m_ml->ml_name);
+                Py_DECREF(modname);
+                return result;
+            }
+            Py_DECREF(modname);
        }
-        if (strcmp(modname, "builtins") != 0)
+        return PyUnicode_FromFormat("<%s>", fn->m_ml->ml_name);
-            return PyUnicode_FromFormat("<%s.%s>",
-                                        modname,
-                                        fn->m_ml->ml_name);
-        else
-            return PyUnicode_FromFormat("<%s>",
-                                        fn->m_ml->ml_name);
    }
    else {
        /* built-in method: try to return