ketab-online-sdk

SDK to access the public APIs exposed by ketabonline.com. The library provides helpers to download raw book data, inspect authors and categories, retrieve table of contents, and search for titles without having to reverse engineer the HTTP endpoints yourself.

🚀 Try the Live Demo — Browse categories, books, and authors with search and pagination. Built with TanStack Start and deployed to Cloudflare Pages. See demo/README.md for details.

Browser-compatible: The main entry point uses the Fetch API and works in browsers, Node.js 18+, Deno, and other modern JavaScript runtimes. Node.js-specific features (like downloadBook) are available via a separate import path.

Installation

Install the package with the package manager of your choice. The project is developed with Bun, but the published package works from any JavaScript runtime that supports the Fetch API.

# bun
bun add ketab-online-sdk

# npm
npm install ketab-online-sdk

# yarn
yarn add ketab-online-sdk

Usage

Browser & Universal (works everywhere)

import {
    getAuthorInfo,
    getBookContents,
    getBookIndex,
    getBookInfo,
    getBooks,
    getCategoryInfo,
} from 'ketab-online-sdk';

// Retrieve summary information about a book
const book = await getBookInfo(123);

// Search for books with pagination helpers
const books = await getBooks({ query: 'الفقه', page: 1, limit: 10 });

// Get the table of contents for a book
const index = await getBookIndex(67768, { isRecursive: true, part: 1 });

// Get full book contents as JSON (works in browser!)
const contents = await getBookContents(123);

Node.js Only

// For filesystem operations, import from the /node subpath
import { downloadBook } from 'ketab-online-sdk/node';

// Download a book bundle to disk for offline reading
const outputPath = await downloadBook(123, './book.json');

API surface

Function	Import Path	Description
`getAuthorInfo(id)`	`ketab-online-sdk`	Returns sanitized author metadata, removing falsy fields the API sometimes includes.
`getAuthors(options)`	`ketab-online-sdk`	Lists authors with optional pagination, sorting and search query parameters.
`getBookContents(id)`	`ketab-online-sdk`	Fetches, extracts and parses the JSON payload that contains full book contents.
`getBookIndex(id, options)`	`ketab-online-sdk`	Retrieves the table of contents index for a specific book with optional hierarchical structure.
`getBookInfo(id)`	`ketab-online-sdk`	Retrieves summary information for a book, including publication metadata.
`getBooks(options)`	`ketab-online-sdk`	Lists books with optional pagination, sorting and search query parameters.
`getCategories(options)`	`ketab-online-sdk`	Lists categories with optional pagination and filtering.
`getCategoryInfo(id)`	`ketab-online-sdk`	Fetches metadata for a category, including book counters.
`downloadBook(id, outputFile)`	`ketab-online-sdk/node`	Node.js only. Downloads the book JSON and saves it to disk.

Content Transformation Utilities

The SDK includes utilities for transforming HTML page content to Markdown:

Convert entire book to clean Markdown (no footnotes)

import {
    getBookContents,
    splitPageFootnotes,
    removeFootnoteReferences,
    htmlToMarkdown,
} from 'ketab-online-sdk';

const book = await getBookContents(62952);

// Convert all pages to clean markdown without footnotes
const cleanMarkdown = book.pages.map((page) => {
    // Split off the footer (footnotes section)
    const [body] = splitPageFootnotes(page.content);
    // Remove inline footnote references like (١)
    const cleanHtml = removeFootnoteReferences(body);
    // Convert to markdown
    return htmlToMarkdown(cleanHtml);
}).join('\n\n---\n\n');

Extract table of contents with titles and page numbers

import { getBookContents, flattenIndex } from 'ketab-online-sdk';

const book = await getBookContents(62952);

// Get all TOC entries as a flat list
const toc = flattenIndex(book.index).map((entry) => ({
    title: entry.title,
    page: entry.page,
    part: entry.part_name,
    level: entry.title_level,
}));
// [{ title: 'تمهيد', page: 5, part: '1', level: 1 }, ...]

Utility	Description
`htmlToMarkdown(html)`	Converts ketabonline HTML to Markdown, detecting headers.
`splitPageFootnotes(html)`	Splits page content into `[body, footer]` tuple.
`extractFootnotes(footer)`	Parses footer HTML into structured `{ number, text }[]`.
`stripFootnoteLinks(html)`	Removes footnote `<a>` tags but keeps the `(١)` text.
`removeFootnoteReferences(html)`	Removes all footnote references completely.
`flattenIndex(index)`	Converts hierarchical TOC to flat array.
`findIndexEntry(index, id)`	Finds a TOC entry by its ID.
`getIndexBreadcrumb(index, id)`	Builds breadcrumb path from entry to root.

Book Index Options

The getBookIndex function supports two modes:

Flat mode (default):

const index = await getBookIndex(67768);
// Returns flat array of index entries

Hierarchical mode:

const index = await getBookIndex(67768, { isRecursive: true });
// Returns nested structure with children property

You can also specify the part number:

const index = await getBookIndex(67768, { part: 2, isRecursive: true });

Every helper throws an error when the upstream API answers with an error status, so you can rely on normal try { ... } catch (error) { ... } flow control.

Development

The repository is managed with Bun and uses Biome and tsdown.

bun install            # install dependencies
bun run build          # build the library with tsdown
bun test               # run the bun:test based unit suite
bun run lint           # type-check, lint and format with biome

To release new functionality run the build, lint and test commands locally before opening a pull request.

Project structure

src/index.ts – browser-compatible entry point exporting SDK helpers (getBooks, getBookIndex, etc.).
src/node.ts – Node.js-only entry point that adds downloadBook and re-exports everything from index.ts.
src/types.ts – TypeScript type definitions for all API requests and responses.
src/utils/common.ts – shared transforms such as removeFalsyValues.
src/utils/io.ts – in-memory ZIP extraction using fflate (browser-compatible).
src/utils/network.ts – HTTP helpers using the Fetch API (httpsGet, buildUrl).
testing/e2e.test.ts – optional integration smoke test that hits the real API (skipped in CI by default).
tsdown.config.ts – the bundler config with dual entry points (index.ts, node.ts).

Keep documentation and tests close to the source files. Any new public helper should include:

JSDoc explaining the parameters and return values.
A bun:test unit suite demonstrating success and failure conditions.
README updates so downstream consumers know how to use the addition.

Testing

Unit tests use the it('should...') convention for descriptive test names. Run tests with:

bun test

For end-to-end testing against the live API:

bun run e2e

Name		Name	Last commit message	Last commit date
Latest commit History 2 Commits
AGENTS.MD		AGENTS.MD
LICENSE.MD		LICENSE.MD
README.md		README.md
biome.json		biome.json
bun.lock		bun.lock
package.json		package.json
release.config.mjs		release.config.mjs
tsconfig.json		tsconfig.json
tsdown.config.ts		tsdown.config.ts

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

ketab-online-sdk

Installation

Usage

Browser & Universal (works everywhere)

Node.js Only

API surface

Content Transformation Utilities

Convert entire book to clean Markdown (no footnotes)

Extract table of contents with titles and page numbers

Book Index Options

Development

Project structure

Testing

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

ketab-online-sdk

Installation

Usage

Browser & Universal (works everywhere)

Node.js Only

API surface

Content Transformation Utilities

Convert entire book to clean Markdown (no footnotes)

Extract table of contents with titles and page numbers

Book Index Options

Development

Project structure

Testing

About

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages