Gitiles
Code Review
Sign In
gerrit-public.fairphone.software
/
platform
/
external
/
python
/
cpython3
/
602426e3cf4a65fe37fbe17a3e3a106d0a998811
/
Tools
/
webchecker
/
webchecker.py
182b5ac
Whitespace normalization, via reindent.py.
by Tim Peters
· 20 years ago
a982c44
[Patch #918212] Support XHTML's 'id' attribute, which can be on any element.
by Andrew M. Kuchling
· 21 years ago
ce56c37
When bad HTML is encountered, ignore the page rather than failing with
by Mark Hammond
· 22 years ago
0b9e3f7
Handle the Content-Type header a little more appropriately: if it
by Fred Drake
· 22 years ago
aaab30e
Apply diff2.txt from SF patch http://www.python.org/sf/572113
by Walter Dörwald
· 22 years ago
88a20ba
Apply diff.txt from SF patch http://www.python.org/sf/561478
by Walter Dörwald
· 22 years ago
566c0c7
[Bug #512799] urllib.splittype() returns a 2-tuple. (Reported by seb bacon)
by Andrew M. Kuchling
· 23 years ago
f0953b9
Fix SF bug #482171: webchecker dies on file: URLs w/o robots.txt
by Guido van Rossum
· 23 years ago
d34a9c9
Added more link attributes based on additonal information from Chris
by Fred Drake
· 24 years ago
f3186e8
A number of improvements based on a discussion with Chris McCafferty
by Fred Drake
· 24 years ago
8430624
Fix suggested by Magnus Kessler: in class Page, it is possible for
by Guido van Rossum
· 25 years ago
e284b21
Integrated Sam Bayer's wcnew.py code. It seems silly to keep two
by Guido van Rossum
· 25 years ago
dbd5c3e
Samuel L. Bayer:
by Guido van Rossum
· 25 years ago
0ec1493
Some changes (maybe not enough?) to make it work on Windows with local
by Guido van Rossum
· 26 years ago
a42c1ee
Added note() message to Page class -- this was used but didn't exist.
by Guido van Rossum
· 26 years ago
125700a
Instead of printint, use self.message() or self.note().
by Guido van Rossum
· 26 years ago
6eb9d32
sort the urls in the todo list
by Guido van Rossum
· 26 years ago
bee6453
Use a try-except so that the pickle file is written even when we die
by Guido van Rossum
· 27 years ago
986abac
Give in to tabnanny
by Guido van Rossum
· 27 years ago
00756bd
Major overhaul. Don't use global variable (e.g. verbose); use
by Guido van Rossum
· 27 years ago
2237b73
Several changes:
by Guido van Rossum
· 27 years ago
89efda3
Avoid the fancy handler for error 401 (request authentication).
by Guido van Rossum
· 28 years ago
af310c1
Restructured Checker class to get rid of 'ext' table.
by Guido van Rossum
· 28 years ago
6133ec6
Process <img> and <frame> tags. Don't bother skipping second href.
by Guido van Rossum
· 28 years ago
0b0b5f0
Spin off checking of external page in a subroutine.
by Guido van Rossum
· 28 years ago
e5605ba
Many misc changes.
by Guido van Rossum
· 28 years ago
c59a5d4
Set proper User-agent header (Python-webchecker/<version>).
by Guido van Rossum
· 28 years ago
2739cd7
Some refinements of the external-link checking code: insert the errors
by Guido van Rossum
· 28 years ago
de66268
Added -x option to check external links. Slooooow!
by Guido van Rossum
· 28 years ago
325a64f
Catch I/O errors when parsing robots.txt file.
by Guido van Rossum
· 28 years ago
3edbb35
Added robots.txt support, using Skip Montanaro's parser.
by Guido van Rossum
· 28 years ago
272b37d
web tree checker
by Guido van Rossum
· 28 years ago