Explore the anatomy of your columnar data files
Datanomy is a terminal-based tool for inspecting and understanding data files. It provides an interactive view of your data's structure, metadata, and internal organization.
- Parquet (
.parquet,.parq) - Arrow IPC (
.arrow,.feather,.ipc)
File-level layout showing header, record batches, and footer.
Arrow schema with per-column type and nullability details.
Preview of the first 50 rows.
File and schema-level metadata.
Physical buffer layout for each column — validity bitmap bits (color-coded valid/null), hex preview of values, offsets, and data buffers. For nested types (list, struct, map, dictionary) child array buffers are shown recursively.
# From PyPI
uv tool install datanomy
## with pip
pip install datanomy
# From source
uv tool install "datanomy @ git+https://github.com/raulcd/datanomy.git"
## cloning the repo
git clone https://github.com/raulcd/datanomy.git
cd datanomy
uv sync# Run without installing using uvx
uvx datanomy data.parquet
# Inspect a Parquet file
datanomy data.parquet
# Inspect an Arrow IPC file
datanomy data.arrowYou can also use from source using uvx. This uses the development version:
uvx "git+https://github.com/raulcd/datanomy.git" data.parquet
uvx "git+https://github.com/raulcd/datanomy.git" data.arrowq- Quit the application
# Install dependencies
uv sync
# Run from source
uv run datanomy path/to/file.parquet
uv run datanomy path/to/file.arrow# Install dev dependencies
uv sync --extra dev
# Run tests
uv run pytest
# Format code
uv run ruff format .
# Lint
uv run ruff check .
# Lint
uv run mypy .Apache License 2.0
Contributions welcome! Please open an issue or PR.




