Q: how to get updates from web pages w/o RSS feed? A: XPath + cron or Jenkins job

  • sadly enough even now in 2016 a lot of web pages are not XHTML conform, but getting them fairly conform is not that expensive: use “xmlstarlet fo –html –recover
  • get the (cron or) Jenkins job to save the current page content in the job’s workspace
  • let the Jenkins job compare the current to the last state …
  • … and message you through XMPP, if there’s a change

Comments

Leave a Reply

This site uses Akismet to reduce spam. Learn how your comment data is processed.