PyKHTML

 - Scraping made easy

Todo List

While not an exhaustive list, below are features that are planned or bugs that are to be squashed in the near future.

  • More extensive use has made it clear that cancelling is important in real life examples. I think onNextLoad has to be updated accordingly so that it can accept either a URL as it has so far or a simple struct of the form (successCallback, timeLimit, timeoutCallback)
  • Tutorials
  • More unit tests: KWallet/cookies and 'the rest'
  • Inheritance diagrams in the documentation (JS based?)

Bugs:

  • Screenshots only work when there is a window being shown. I've looked at how Konqueror's HTML thumbnail previews are implemented but still amn't too sure where my code is going wrong. Anyone ;)?
Get PyKHTML 0.2 Linux Archive, 26.6kb