Datanomy

Explore the anatomy of your columnar data files

Datanomy is a terminal-based tool for inspecting and understanding data files. It provides an interactive view of your data's structure, metadata, and internal organization.

Supported formats

Parquet (.parquet, .parq)
Arrow IPC (.arrow, .feather, .ipc)

Features for Parquet view

General Structure

Schema

Data

Metadata

Stats

Features for Arrow IPC view

Structure

File-level layout showing header, record batches, and footer.

Schema

Arrow schema with per-column type and nullability details.

Data

Preview of the first 50 rows.

Metadata

File and schema-level metadata.

Buffers

Physical buffer layout for each column — validity bitmap bits (color-coded valid/null), hex preview of values, offsets, and data buffers. For nested types (list, struct, map, dictionary) child array buffers are shown recursively.

Installation

# From PyPI
uv tool install datanomy
## with pip
pip install datanomy

# From source
uv tool install "datanomy @ git+https://github.com/raulcd/datanomy.git"
## cloning the repo 
git clone https://github.com/raulcd/datanomy.git
cd datanomy
uv sync

Usage

# Run without installing using uvx
uvx datanomy data.parquet

# Inspect a Parquet file
datanomy data.parquet

# Inspect an Arrow IPC file
datanomy data.arrow

You can also use from source using uvx. This uses the development version:

uvx "git+https://github.com/raulcd/datanomy.git" data.parquet
uvx "git+https://github.com/raulcd/datanomy.git" data.arrow

Keyboard Shortcuts

q - Quit the application

Development

# Install dependencies
uv sync

# Run from source
uv run datanomy path/to/file.parquet
uv run datanomy path/to/file.arrow

# Install dev dependencies
uv sync --extra dev

# Run tests
uv run pytest

# Format code
uv run ruff format .

# Lint
uv run ruff check .

# Lint
uv run mypy .

License

Apache License 2.0

Contributing

Contributions welcome! Please open an issue or PR.

Built with Textual and PyArrow

Name		Name	Last commit message	Last commit date
Latest commit History 59 Commits
.github/workflows		.github/workflows
docs		docs
src/datanomy		src/datanomy
tests		tests
.gitignore		.gitignore
LICENSE		LICENSE
README.md		README.md
pyproject.toml		pyproject.toml
uv.lock		uv.lock

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Datanomy

Supported formats

Features for Parquet view

General Structure

Schema

Data

Metadata

Stats

Features for Arrow IPC view

Structure

Schema

Data

Metadata

Buffers

Installation

Usage

Keyboard Shortcuts

Development

License

Contributing

About

Uh oh!

Releases 4

Packages

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

Datanomy

Supported formats

Features for Parquet view

General Structure

Schema

Data

Metadata

Stats

Features for Arrow IPC view

Structure

Schema

Data

Metadata

Buffers

Installation

Usage

Keyboard Shortcuts

Development

License

Contributing

About

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases 4

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages