wp.jochen.hayek.name/blog-en

Blog

BorgBackup: you cannot remove single files from an “archive”, but you can remove entire “archives” from a repository
- https://borgbackup.readthedocs.io/en/stable/usage/delete.html
Why would I do that? Because that archive accidentally contains a huge ISO file. And there are enough archives around, so that deleting that single archive does not hurt.

I extended the “–exclude” list for my “borg create” accordingly.

I will go and create a list of the fattest files on the repo in the future. That will help will adapting the “–exclude” list even more.
2021-03-06
my way of studying Kubernetes: minikube on macOS
macOS is my OS platform, VirtualBox my virtualisation platform – I tried to follow the instructions (I found in my online courses by Sander von Fugt and Mumshad Mannambeth) and set up minikube within a VM. But I did not succeed setting up minikube in a VirtualBox VM. I attempted using a free VMware Fusion Player – but macOSwise I am not really up-to-date – I am still (now in March 2021) on Mojave (10.14), and VMware Fusion Player 12 does not run on Mojave. So finally I deviated from those advises, and followed the advised here below – and so far successfully.

I would have preferred to use my Synology NAS for my Kubernetes experiments, as I did for Docker. But as I am too anxious spoiling the NAS terribly, so I refrained from doing so. The NAS offers Docker itself, so that’s not such a risk at all. I would have liked to couple minikube and kubectl with that NAS as container environment – but I don’t feel fit enough for that.

So here are the links, that led me to my little success:
- https://kubernetes.io/docs/tasks/tools/install-kubectl/#install-kubectl-on-macos
- https://minikube.sigs.k8s.io/docs/start/ – here I found instructions for macOS
2021-03-05
“Miller” is like awk, sed, cut, join, and sort for name-indexed data such as CSV, TSV, and tabular JSON
- https://github.com/johnkerl/miller
- https://csvkit.readthedocs.io – Miller is quite similar to csvkit
2021-03-02
FLOSS Weekly is a free/libre and open-source software (FLOSS) themed netcast from the TWiT Network
2021-02-26
Tom HaCohen’s home page and blog
- https://stosb.com
- https://stosb.com/tag/explaining-configs/ – “Explaining My Configs: …” – ssh client, sshd_config, OpenVPN, nftables
2021-02-26
The Jewish Story Behind a Towering Statue of JFK (Tablet Magazine)
- https://www.tabletmag.com/sections/community/articles/jfk-statue-and-the-rabbi
2021-02-13
CPAN mirror network vs CDN
- https://perl.topicbox.com/groups/infrastructure/Tfb538341a77530c9
2021-02-02
“pdftohtml -xml” – only the poppler suite supports “-xml”
- https://forum.xpdfreader.com/viewtopic.php?f=3&t=41211
- only the poppler toolset (the xpdf-related toolset) has “pdftohtml -xml“
- https://en.wikipedia.org/wiki/Poppler_(software)
- https://poppler.freedesktop.org
- https://anongit.freedesktop.org/git/poppler/poppler.git
One of my most favourite tools.

I have been using it for years now – on a daily basis. (I came across it in my local Ruby user group many years ago.)

Of course it only works on PDF with text.

Luckily enough there are tools resp. services, that “OCR” your “image PDF”, just in case your PDF file does not include the text it shows as text.

I am editing the XML result in Emacs with nXML mode, and I developed a RELAX-NG grammar for context sensitive editing of such XML files.
I am annotating these XML files using specific XML comments.
For PDF files from several providers I created scripts for automated annotation. (Best case: find lvalue and rvalue together. Most of the time I find at least lvalue.)
I created scripts to extract the details from those annotations. And they create text, that resembles by (personal home-made / home-maintained) bank statements – so I can “reconcile” them.

I am processing every bill PDF like that.

I am processing every contract PDF like that. I guess you understand, how much better it is to read and annotate a text file in place instead of keeping notes outside the source. Yes, that’s of course like inline documentation within programming language source files.

Just in case anybody reads this and finds it useful: Of course I am able and most willing to provide far more details.
2021-02-02
The basics of GitLab CI: How to run jobs sequentially, in parallel, or out of order
- https://about.gitlab.com/blog/2020/12/10/basics-of-gitlab-ci-updated/
2021-02-02
the Bitwarden command-line tool
- https://bitwarden.com/help/article/cli/
- https://stedolan.github.io/jq/ – “bw” (the CLI utility) works well with jq
2021-01-28