Skip to content
Projeler
Gruplar
Parçacıklar
Yardım
Yükleniyor...
Oturum aç / Kaydol
Gezinmeyi değiştir
C
cpython
Proje
Proje
Ayrıntılar
Etkinlik
Cycle Analytics
Depo (repository)
Depo (repository)
Dosyalar
Kayıtlar (commit)
Dallar (branch)
Etiketler
Katkıda bulunanlar
Grafik
Karşılaştır
Grafikler
Konular (issue)
0
Konular (issue)
0
Liste
Pano
Etiketler
Kilometre Taşları
Birleştirme (merge) Talepleri
0
Birleştirme (merge) Talepleri
0
CI / CD
CI / CD
İş akışları (pipeline)
İşler
Zamanlamalar
Grafikler
Paketler
Paketler
Wiki
Wiki
Parçacıklar
Parçacıklar
Üyeler
Üyeler
Collapse sidebar
Close sidebar
Etkinlik
Grafik
Grafikler
Yeni bir konu (issue) oluştur
İşler
Kayıtlar (commit)
Konu (issue) Panoları
Kenar çubuğunu aç
Batuhan Osman TASKAYA
cpython
Commits
3df02dbc
Kaydet (Commit)
3df02dbc
authored
Kas 23, 2017
tarafından
Berker Peksag
Kaydeden (comit)
Raymond Hettinger
Kas 23, 2017
Dosyalara gözat
Seçenekler
Dosyalara Gözat
İndir
Eposta Yamaları
Sade Fark
bpo-31325: Fix usage of namedtuple in RobotFileParser.parse() (#4529)
üst
0858495a
Hide whitespace changes
Inline
Side-by-side
Showing
4 changed files
with
19 additions
and
12 deletions
+19
-12
urllib.robotparser.rst
Doc/library/urllib.robotparser.rst
+4
-4
test_robotparser.py
Lib/test/test_robotparser.py
+6
-3
robotparser.py
Lib/urllib/robotparser.py
+4
-5
2017-11-23-22-12-11.bpo-31325.8jAUxN.rst
...S.d/next/Library/2017-11-23-22-12-11.bpo-31325.8jAUxN.rst
+5
-0
No files found.
Doc/library/urllib.robotparser.rst
Dosyayı görüntüle @
3df02dbc
...
...
@@ -69,10 +69,10 @@ structure of :file:`robots.txt` files, see http://www.robotstxt.org/orig.html.
.. method:: request_rate(useragent)
Returns the contents of the ``Request-rate`` parameter from
``robots.txt``
in the form of a :func:`~collections.namedtuple`
``(requests, seconds)``. If there is no such parameter or it doesn't
apply to the *useragent* specified or the ``robots.txt`` entry for this
parameter has invalid
syntax, return ``None``.
``robots.txt``
as a :term:`named tuple` ``RequestRate(requests, seconds)``.
If there is no such parameter or it doesn't apply to the *useragent*
specified or the ``robots.txt`` entry for this parameter has invalid
syntax, return ``None``.
.. versionadded:: 3.6
...
...
Lib/test/test_robotparser.py
Dosyayı görüntüle @
3df02dbc
...
...
@@ -3,7 +3,6 @@ import os
import
threading
import
unittest
import
urllib.robotparser
from
collections
import
namedtuple
from
test
import
support
from
http.server
import
BaseHTTPRequestHandler
,
HTTPServer
...
...
@@ -87,6 +86,10 @@ class BaseRequestRateTest(BaseRobotTest):
self
.
parser
.
crawl_delay
(
agent
),
self
.
crawl_delay
)
if
self
.
request_rate
:
self
.
assertIsInstance
(
self
.
parser
.
request_rate
(
agent
),
urllib
.
robotparser
.
RequestRate
)
self
.
assertEqual
(
self
.
parser
.
request_rate
(
agent
)
.
requests
,
self
.
request_rate
.
requests
...
...
@@ -108,7 +111,7 @@ Disallow: /a%2fb.html
Disallow: /
%7
ejoe/index.html
"""
agent
=
'figtree'
request_rate
=
namedtuple
(
'req_rate'
,
'requests seconds'
)
(
9
,
30
)
request_rate
=
urllib
.
robotparser
.
RequestRate
(
9
,
30
)
crawl_delay
=
3
good
=
[(
'figtree'
,
'/foo.html'
)]
bad
=
[
'/tmp'
,
'/tmp.html'
,
'/tmp/a.html'
,
'/a
%3
cd.html'
,
'/a
%3
Cd.html'
,
...
...
@@ -237,7 +240,7 @@ Crawl-delay: 1
Request-rate: 3/15
Disallow: /cyberworld/map/
"""
request_rate
=
namedtuple
(
'req_rate'
,
'requests seconds'
)
(
3
,
15
)
request_rate
=
urllib
.
robotparser
.
RequestRate
(
3
,
15
)
crawl_delay
=
1
good
=
[
'/'
,
'/test.html'
]
bad
=
[
'/cyberworld/map/index.html'
]
...
...
Lib/urllib/robotparser.py
Dosyayı görüntüle @
3df02dbc
...
...
@@ -16,6 +16,9 @@ import urllib.request
__all__
=
[
"RobotFileParser"
]
RequestRate
=
collections
.
namedtuple
(
"RequestRate"
,
"requests seconds"
)
class
RobotFileParser
:
""" This class provides a set of methods to read, parse and answer
questions about a single robots.txt file.
...
...
@@ -136,11 +139,7 @@ class RobotFileParser:
# check if all values are sane
if
(
len
(
numbers
)
==
2
and
numbers
[
0
]
.
strip
()
.
isdigit
()
and
numbers
[
1
]
.
strip
()
.
isdigit
()):
req_rate
=
collections
.
namedtuple
(
'req_rate'
,
'requests seconds'
)
entry
.
req_rate
=
req_rate
entry
.
req_rate
.
requests
=
int
(
numbers
[
0
])
entry
.
req_rate
.
seconds
=
int
(
numbers
[
1
])
entry
.
req_rate
=
RequestRate
(
int
(
numbers
[
0
]),
int
(
numbers
[
1
]))
state
=
2
if
state
==
2
:
self
.
_add_entry
(
entry
)
...
...
Misc/NEWS.d/next/Library/2017-11-23-22-12-11.bpo-31325.8jAUxN.rst
0 → 100644
Dosyayı görüntüle @
3df02dbc
Fix wrong usage of :func:`collections.namedtuple` in
the :meth:`RobotFileParser.parse() <urllib.robotparser.RobotFileParser.parse>`
method.
Initial patch by Robin Wellner.
Write
Preview
Markdown
is supported
0%
Try again
or
attach a new file
Attach a file
Cancel
You are about to add
0
people
to the discussion. Proceed with caution.
Finish editing this message first!
Cancel
Please
register
or
sign in
to comment