docs(schema): document HNSW int8 quantization option by kriszyp · Pull Request #508 · HarperFast/documentation

kriszyp · 2026-06-01T12:28:23Z

Summary

Documents the new quantization: "int8" option for HNSW vector indexes — adds a row to the HNSW parameter table and a short example.

Context

Pairs with HarperFast/harper#894 (optional int8 vector quantization for the HNSW index): ~5× smaller index, substantially faster search, ~1% recall cost; opt-in, the record's full-precision vector is unchanged. Making it the default is tracked separately (HarperFast/harper#932).

Generated with the assistance of an LLM (Claude Opus 4.8).

gemini-code-assist

Code Review

This pull request updates the database schema documentation to include details and an example for the new quantization parameter (specifically "int8") for HNSW indexes. As there are no review comments, I have no feedback to provide.

github-actions · 2026-06-01T12:32:10Z

🚀 Preview Deployment

Your preview deployment is ready!

🔗 Preview URL: https://preview.harper-documentation.harperfabric.com/pr-508

This preview will update automatically when you push new commits.

Ethan-Arrowood · 2026-06-01T15:31:46Z

 | `optimizeRouting`      | `0.5`             | Heuristic aggressiveness for omitting redundant connections (0 = off, 1 = most aggressive)          |
 | `mL`                   | computed from `M` | Normalization factor for level generation                                                           |
 | `efSearchConstruction` | `50`              | Max nodes explored during search                                                                    |
+| `quantization`         | _(full precision)_ | `"int8"` stores each indexed vector as 8-bit scalar-quantized values plus a per-vector scale instead of float32 — roughly a 5× smaller index and substantially faster search, at a small recall cost (~1%). Omit for full-precision float32. Only the index is quantized; the full-precision vector on the record is unchanged. |


I think default should be "float32" for this option to make more sense. _(full precision)_ is not a value. I believe right now this option only supports two values, but will there be more in the future? If not, and this is truly a binary configuration, should it not reflect that better? Like maybe making it a toggleable int8Quantization: boolean ?

@embed

* docs(v5.1): release notes, deployment tracking ops, deploy_component updates - Add 5.1.md release notes covering: models/AI, @embed directive, MCP server, deployment tracking, HNSW int8 quantization, and replication improvements - Update deploy_component docs: urlPath, install_allow_scripts params, deployment_id response - Document new deployment operations: list_deployments, get_deployment, get_deployment_payload, delete_deployment_payload - Document hdb_deployment record schema (fields, phases, peer_results) Note: models/AI detail, MCP reference, and HNSW quantization have separate PRs (#523, #507/#516, #508) — this PR adds the release notes overview and the deployment tracking operations which had no coverage. Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com> * style: run prettier on changed files Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com> * fix: remove cross-plugin MCP link that breaks Docusaurus build The release-notes and reference doc plugins are separate; relative .md links between them resolve incorrectly. Removing until PR #507 (MCP reference section) merges and can be linked with an absolute path. Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com> * docs(5.1): expand release notes — middleware/routing, caching, LOCAL_ONLY, HARPER_CONFIG, RocksDB, migrateOnStart, upgrade improvements --------- Co-authored-by: Claude Sonnet 4.6 <noreply@anthropic.com>

github-actions · 2026-06-13T00:32:20Z

🚀 Preview Deployment

Your preview deployment is ready!

🔗 Preview URL: https://preview.harper-documentation.harperfabric.com/pr-508

This preview will update automatically when you push new commits.

kriszyp requested a review from a team as a code owner June 1, 2026 12:28

gemini-code-assist Bot reviewed Jun 1, 2026

View reviewed changes

github-actions Bot temporarily deployed to pr-508 June 1, 2026 12:32 Inactive

Ethan-Arrowood reviewed Jun 1, 2026

View reviewed changes

kriszyp mentioned this pull request Jun 10, 2026

docs(v5.1): release notes, deployment tracking operations #524

Merged

4 tasks

kriszyp closed this Jun 13, 2026

kriszyp force-pushed the kris/hnsw-int8-quantization-docs branch from 7da20a0 to dcd5dc2 Compare June 13, 2026 00:29

github-actions Bot deployed to pr-508 June 13, 2026 00:32 View deployment

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

docs(schema): document HNSW int8 quantization option#508

docs(schema): document HNSW int8 quantization option#508
kriszyp wants to merge 0 commit into
mainfrom
kris/hnsw-int8-quantization-docs

kriszyp commented Jun 1, 2026

Uh oh!

gemini-code-assist Bot left a comment

Uh oh!

github-actions Bot commented Jun 1, 2026

Uh oh!

Ethan-Arrowood Jun 1, 2026

Uh oh!

github-actions Bot commented Jun 13, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Conversation

kriszyp commented Jun 1, 2026

Summary

Context

Uh oh!

gemini-code-assist Bot left a comment

Choose a reason for hiding this comment

Code Review

Uh oh!

github-actions Bot commented Jun 1, 2026

🚀 Preview Deployment

Uh oh!

Ethan-Arrowood Jun 1, 2026

Choose a reason for hiding this comment

Uh oh!

github-actions Bot commented Jun 13, 2026

🚀 Preview Deployment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants