GNU datamash

by GNU

Free Download 1

Versions:

  • 1.3

GNU datamash 1.3 is a lightweight, cross-platform command-line utility published by the GNU Project that enables analysts, system administrators and researchers to perform rapid numeric, textual and statistical transformations on plain-text data files without writing custom scripts. Designed for POSIX-compliant environments, the tool reads delimiter-separated input from standard input or named files, applies one or more chosen operations, and outputs the result in the same simple format, making it equally suited to ad-hoc shell queries and to automated data-cleaning pipelines. Common use cases include computing descriptive statistics (minimum, maximum, mean, median, standard deviation, quartiles), generating cross-tabulations, grouping and sorting fields, counting unique values, calculating percentiles or checksums, and validating column integrity within large log or survey datasets. Because all functions are invoked through a single executable with intuitive flags, users can replace multi-line awk or R snippets with one terse command, reducing both execution time and potential for error. The program is frequently embedded in makefile workflows, CI stages, or shell loops that process daily extracts from web servers, scientific instruments, or financial ledgers, and its ability to operate on streaming input allows it to handle gigabyte-scale files while consuming minimal memory. GNU datamash belongs to the “Database & Analytics” category of utilities and is distributed as source tarball 1.3, the only maintained release line; the same codebase compiles on Linux, macOS, BSD and Windows Subsystem for Linux, ensuring consistent behavior across environments. The software is available for free on get.nero.com, with downloads provided via trusted Windows package sources (e.g. winget), always delivering the latest version, and supporting batch installation of multiple applications.

Tags: