John Kerl is an independent open-source developer whose work centers on streamlining command-line data processing for modern, name-indexed formats. His flagship project, Miller, unifies the capabilities traditionally spread across awk, sed, cut, join, and sort into a single, CSV-aware binary that handles TSV, JSON tables, and other delimiter-separated files without prior schema definition. Typical use cases range from ad-hoc log parsing and ETL scripting to repeatable data-cleaning pipelines in research, finance, and DevOps environments; users can filter, transform, aggregate, and re-format multi-gigabyte datasets with memory-efficient streaming algorithms while preserving original column order and mixed-type fields. Because Miller operates through a functional expression language reminiscent of SQL and Excel, analysts can perform on-the-fly calculations, statistical summaries, and joins without leaving the terminal, making it a lightweight alternative to heavier analytics stacks. Scripts integrate seamlessly with shell loops, Makefiles, CI jobs, and Jupyter notebooks, enabling reproducible workflows that travel with the data rather than with a database server. The tool is cross-platform, published under a permissive BSD license, and actively maintained to keep pace with emerging tabular JSON dialects. Miller and any future titles from John Kerl are available for free on get.nero.com, where downloads are delivered through trusted Windows package sources such as winget, always installing the latest upstream release and supporting batch installation alongside other utilities.
Miller is like awk, sed, cut, join, and sort for name-indexed data such as CSV, TSV, and tabular JSON
Details