XML Shell: xmlsh: csv2xml

http://www.xmlsh.org https://github.com/xmlsh/xmlsh1_3 they also provide a Java .jar file and a shell script etc around it, so you can run it on your platform http://www.xmlsh.org/CommandCsv2xml – builtin csv2xml A couple of command line options we are using: -delim ‘;’ – German excel files get exported to CSV files with ‘;’ as column separator / delimiter, so when we… Continue reading XML Shell: xmlsh: csv2xml

Published
Categorised as CSV Tagged

XML Shell: xmlsh

http://www.xmlsh.org https://github.com/xmlsh/xmlsh1_3 they also provide a Java .jar file and a shell script etc around it, so you can run it on your platform http://www.xmlsh.org/CommandCsv2xml – builtin csv2xml, funny! http://www.xmlsh.org/CommandRngconvert – that’s trang, dealing with “Relax NG“ examples for csv2xml: $ csv2xml -header my_file.csv $ csv2xml -header -attr my_file.csv # for CSV files created by Excel… Continue reading XML Shell: xmlsh

O’Reilly Media book: “Data Science at the Command Line” – installing the tools …

http://shop.oreilly.com/product/0636920032823.do excercising the tools within the Vagrant VM is fun … and certainly supplying the VM is a good and rather low-trouble start for the studies … but installing the tools in your real-life environment is the “challenge” … Appendix A lists the tools and their home pages http://csvkit.readthedocs.org – I just installed the csvkit,… Continue reading O’Reilly Media book: “Data Science at the Command Line” – installing the tools …

Published
Categorised as CSV

O’Reilly Media book: Data Science at the Command Line

http://shop.oreilly.com/product/0636920032823.do http://datascienceatthecommandline.com http://datasciencetoolbox.org http://jeroenjanssens.com/2013/09/19/seven-command-line-tools-for-data-science.html https://github.com/jeroenjanssens/data-science-at-the-command-line/issues http://csvkit.readthedocs.org Become a more efficient and productive data scientist by leveraging the power of the command line. With this practical guide, you’ll learn seven command-line tools for obtaining, scrubbing, and exploring data—including three tools developed by the author. If you rely on one-time scripts for working with data, this book… Continue reading O’Reilly Media book: Data Science at the Command Line

Published
Categorised as CSV

CSV Kit — commandline tools for working with CSV files

Nat Torkington’s Four short links: 17 January 2012:   http://csvkit.readthedocs.io — commandline tools for working with CSV files. (via Hadley Wickham)[…] A little longer: csvkit (CSV kit) is a suite of utilities for converting to and working with CSV.

Published
Categorised as CSV

table_pdf2csv.pl : extracting tables from PDF, saving them as CSV

I leave the PDF extraction bit to “pdftohtml -xml“. My perl scripts tells you, at what “physical columns” text gets found within the PDF file. You choose, which “physical columns” really makes sense to you as logical column starters. Now you run my perl script with those few serious physical columns specified, and it creates… Continue reading table_pdf2csv.pl : extracting tables from PDF, saving them as CSV

Published
Categorised as CSV