1. 24 Mar, 1999 1 kayıt (commit)
    • Guido van Rossum's avatar
      Added Samuel Bayer's new webchecker. · 54500625
      Guido van Rossum yazdı
      Unfortunately his code breaks wcgui.py in a way that's not easy
      to fix.  I expect that this is a temporary situation --
      eventually Sam's changes will be merged back in.
      (The changes add a -t option to specify exceptions to the -x
      option, and explicit checking for #foo style fragment ids.)
      54500625
  2. 06 Agu, 1998 1 kayıt (commit)
  3. 08 Tem, 1998 1 kayıt (commit)
  4. 15 Haz, 1998 1 kayıt (commit)
  5. 27 Nis, 1998 1 kayıt (commit)
  6. 06 Nis, 1998 1 kayıt (commit)
  7. 21 Şub, 1998 1 kayıt (commit)
  8. 06 Eki, 1997 1 kayıt (commit)
    • Guido van Rossum's avatar
      Several changes: · 2237b73b
      Guido van Rossum yazdı
      - Change the code that looks for robots.txt to always look in /, even
      if the "root" path is somewhere deep down below.
      
      - Add link processing in <AREA> tags.
      
      - Change safeclose() to avoid crashing when the file has no geturl()
      method.
      2237b73b
  9. 07 May, 1997 1 kayıt (commit)
  10. 02 Şub, 1997 1 kayıt (commit)
    • Guido van Rossum's avatar
      Restructured Checker class to get rid of 'ext' table. · af310c1d
      Guido van Rossum yazdı
      Links are now either in 'todo' or 'done', and ext links
      are hadled more like local links except that no further
      links are gathered (and sometimes they aren't checked,
      e.g. for mailto and news URLs).  The -x option reverses
      its meaning: it disables checking of ext links (they are
      moved to 'done' without checking).  A new 'errors' table
      collects pages with bad links as we go -- redundant,
      but useful for the GUI version which needs to report
      this as we go.  Some new methods, including reset().
      New checkpoint format.
      
      Adapted the GUI to the changes in the Checker class.
      Added Quit and "Start over" buttons, and a checkbox
      to disable checking external links.  The details
      window now also shows bad links emanating from the
      selected page.  Miscellaneous small chages.
      af310c1d
  11. 01 Şub, 1997 1 kayıt (commit)
  12. 31 Ock, 1997 2 kayıt (commit)
    • Guido van Rossum's avatar
      Spin off checking of external page in a subroutine. · 0b0b5f02
      Guido van Rossum yazdı
      Increase MAXPAGE to 150K.
      Add back printing of __doc__ for usage message.
      0b0b5f02
    • Guido van Rossum's avatar
      Many misc changes. · e5605ba3
      Guido van Rossum yazdı
      - Faster HTML parser derivede from SGMLparser (Fred Gansevles).
      
      - All manipulations of todo, done, ext, bad are done via methods, so a
      derived class can override.  Also moved the 'done' marking to
      dopage(), so run() is much simpler.
      
      - Added a method status() which returns a string containing the
      summary counts; added a "total" count.
      
      - Drop the guessing of the file type before opening the document -- we
      still need to check those links for validity!
      
      - Added a subroutine to close a connection which first slurps up the
      remaining data when it's an ftp URL -- apparently closing an ftp
      connection without reading till the end makes it hang.
      
      - Added -n option to skip running (only useful with -R).
      
      - The Checker object now has an instance variable which is set to 1
      when it is changed.  This is not pickled.
      e5605ba3
  13. 30 Ock, 1997 6 kayıt (commit)