{"id":6033,"date":"2016-08-06T12:49:40","date_gmt":"2016-08-06T10:49:40","guid":{"rendered":"http:\/\/www.b.shuttle.de\/hayek\/hayek\/jochen\/wp\/blog-de\/?p=6033"},"modified":"2016-08-06T12:49:40","modified_gmt":"2016-08-06T10:49:40","slug":"offnungszeiten-des-kofferhaus-witt-berlin-mit-xpath-extrahiert","status":"publish","type":"post","link":"https:\/\/wp.jochen.hayek.name\/blog-de\/2016\/08\/06\/offnungszeiten-des-kofferhaus-witt-berlin-mit-xpath-extrahiert\/","title":{"rendered":"\u00d6ffnungszeiten des Kofferhaus Witt (Berlin) \u2013 mit XPath extrahiert"},"content":{"rendered":"<ul>\n<li><a href=\"http:\/\/www.kofferhaus-witt.de\">http:\/\/www.kofferhaus-witt.de<\/a><\/li>\n<\/ul>\n<pre>$ curl --location http:\/\/www.kofferhaus-witt.de &gt; kofferhaus-witt.html\n\n# tja, leider ist auch dieses HTML kein ordentliches XML,\n# also m\u00fcssen wir es erst einmal in Ordnung bringen:\n\n$ xml fo --recover --html kofferhaus-witt.html &gt; kofferhaus-witt.html.xml\n\n# ein wenig Suche \u2026,\n# und hier ist der passende XPath:\n\n$ xml sel -t -c \"html\/body\/footer\/div\/div\/div[@class='col-md-3']\/p[3]\" --nl kofferhaus-witt.html.xml<\/pre>\n<p>Und dies kommt dabei heraus:<\/p>\n<pre>&lt;p&gt; \u00d6FFNUNGSZEITEN:&lt;br\/&gt; Montag\u2013Freitag 9.00 bis 18.00 Uhr&lt;br\/&gt; Samstag 9.00 bis 14.00 Uhr&lt;\/p&gt;<\/pre>\n<p>Noch lieber w\u00e4re mir nat\u00fcrlich, wenn es eine <strong>\u00f6ffentliche<\/strong> und direkte URL zur Abfrage der \u00d6ffnungszeiten g\u00e4be.<\/p>\n<p>War halt mal wieder eine nette Finger\u00fcbung am Samstag Vormittag\u00a0\ud83d\ude06 .\t\t\t\t<\/p>\n","protected":false},"excerpt":{"rendered":"<p>http:\/\/www.kofferhaus-witt.de $ curl &#8211;location http:\/\/www.kofferhaus-witt.de &gt; kofferhaus-witt.html # tja, leider ist auch dieses HTML kein ordentliches XML, # also m\u00fcssen wir es erst einmal in Ordnung bringen: $ xml fo &#8211;recover &#8211;html kofferhaus-witt.html &gt; kofferhaus-witt.html.xml # ein wenig Suche \u2026, # und hier ist der passende XPath: $ xml sel -t -c &#8220;html\/body\/footer\/div\/div\/div[@class=&#8217;col-md-3&#8242;]\/p[3]&#8221; &#8211;nl kofferhaus-witt.html.xml [&hellip;]<\/p>\n","protected":false},"author":1,"featured_media":0,"comment_status":"open","ping_status":"closed","sticky":false,"template":"","format":"standard","meta":{"_crdt_document":"","advanced_seo_description":"","jetpack_seo_html_title":"","jetpack_seo_noindex":false,"jetpack_post_was_ever_published":false,"_jetpack_newsletter_access":"","_jetpack_dont_email_post_to_subs":false,"_jetpack_newsletter_tier_id":0,"_jetpack_memberships_contains_paywalled_content":false,"_jetpack_memberships_contains_paid_content":false,"footnotes":"","jetpack_publicize_message":"","jetpack_publicize_feature_enabled":true,"jetpack_social_post_already_shared":true,"jetpack_social_options":{"image_generator_settings":{"template":"highway","default_image_id":0,"font":"","enabled":false},"version":2},"_share_on_mastodon":"0"},"categories":[352],"tags":[1149,1365],"class_list":["post-6033","post","type-post","status-publish","format-standard","hentry","category-nicht-zugeordnet","tag-offnungszeiten","tag-xpath"],"share_on_mastodon":{"url":"","error":""},"jetpack_publicize_connections":[],"jetpack_featured_media_url":"","jetpack_sharing_enabled":true,"jetpack_shortlink":"https:\/\/wp.me\/paO0l8-1zj","jetpack_likes_enabled":true,"amp_enabled":true,"_links":{"self":[{"href":"https:\/\/wp.jochen.hayek.name\/blog-de\/wp-json\/wp\/v2\/posts\/6033","targetHints":{"allow":["GET"]}}],"collection":[{"href":"https:\/\/wp.jochen.hayek.name\/blog-de\/wp-json\/wp\/v2\/posts"}],"about":[{"href":"https:\/\/wp.jochen.hayek.name\/blog-de\/wp-json\/wp\/v2\/types\/post"}],"author":[{"embeddable":true,"href":"https:\/\/wp.jochen.hayek.name\/blog-de\/wp-json\/wp\/v2\/users\/1"}],"replies":[{"embeddable":true,"href":"https:\/\/wp.jochen.hayek.name\/blog-de\/wp-json\/wp\/v2\/comments?post=6033"}],"version-history":[{"count":0,"href":"https:\/\/wp.jochen.hayek.name\/blog-de\/wp-json\/wp\/v2\/posts\/6033\/revisions"}],"wp:attachment":[{"href":"https:\/\/wp.jochen.hayek.name\/blog-de\/wp-json\/wp\/v2\/media?parent=6033"}],"wp:term":[{"taxonomy":"category","embeddable":true,"href":"https:\/\/wp.jochen.hayek.name\/blog-de\/wp-json\/wp\/v2\/categories?post=6033"},{"taxonomy":"post_tag","embeddable":true,"href":"https:\/\/wp.jochen.hayek.name\/blog-de\/wp-json\/wp\/v2\/tags?post=6033"}],"curies":[{"name":"wp","href":"https:\/\/api.w.org\/{rel}","templated":true}]}}