feat(collections): concurrent upload_documents with bounded parallelism by ryanda9910 · Pull Request #166 · xai-org/xai-sdk-python

ryanda9910 · 2026-06-16T06:02:09Z

Problem

Uploading many documents to a collection is currently a serial loop of upload_document calls (issue #77). Each upload is an independent, blocking sequence of gRPC round-trips, so the wall-clock time grows linearly with the number of documents — a scaling bottleneck for collections-management workflows.

Approach

Add upload_documents(...) to both the sync (Client) and async (AsyncClient) collections clients. It's a thin convenience wrapper that overlaps the network latency of independent uploads with bounded concurrency:

A ThreadPoolExecutor (sync) / asyncio gather with a semaphore (async), capped by a configurable max_workers with a conservative default to respect server-side rate limits.
Backward compatible — upload_document is unchanged; this is purely additive.
Result order preserved (output matches input order).
Errors are not swallowed — the first error is re-raised after cancelling not-yet-started uploads; other errors remain reachable via __context__ (avoids the silent error-swallowing anti-pattern).
Input validation (ValueError on empty input / non-positive max_workers).

Benchmark (mocked 100ms/upload latency, N=40)

mode	time	speedup
sequential (status quo)	4.14s	1x
parallel, max_workers=4	1.05s	3.9x
parallel, max_workers=8	0.52s	8.0x

Real-world speedup depends on per-upload latency and server limits; methodology is mocked I/O latency so the numbers isolate the concurrency win.

Tests

Sync suite 98 passed, async suite 91 passed (incl. new cases covering parallel success, order preservation, and error propagation). ruff check clean.

Question for maintainers

What default max_workers do you want? I set a conservative default to avoid tripping server-side rate limits — happy to adjust to whatever the service comfortably accommodates.

Idea originally suggested by @nicsins in the issue thread. Fixes #77.

This is a draft for review — feedback on the concurrency default and async approach welcome.

@nicsins

…lelism Sequential document uploads were a scaling bottleneck (issue xai-org#77). Add upload_documents() to sync + async clients: a bounded ThreadPoolExecutor (configurable max_workers, conservative default) that overlaps upload latency, preserves result order, and re-raises the first error without swallowing others. Backward compatible (upload_document unchanged). Idea suggested by @nicsins in xai-org#77.

nicsins · 2026-06-16T12:42:22Z

@ryanda9910 Solid PR — LGTM from a rogue contributor! Bounded concurrency with order preservation and clean error handling is spot on. In Dragonscale we use persistent shared knowledge on a dedicated server with unicast patterns — lets us often push higher effective concurrency without the usual bottlenecks.

Quick question: default max_workers=4 feels pretty conservative. Any chance we could bump the default to 8 (or let maintainers decide after some load testing)? Thanks for tackling #77 and the shoutout! 🚀

ryanda9910 · 2026-06-16T15:03:19Z

Thanks for taking a look! The max_workers=4 default is intentionally conservative — it keeps the client from hammering the API / hitting rate limits out of the box, while callers who've measured their own throughput can raise it via the parameter. Happy to bump the default to a higher baseline if the maintainers prefer one after some load testing. 🙏

nicsins · 2026-06-18T04:02:15Z

refined_xai_collections_batch.py

try this it prevents throwing errors using some slickness and cunning😎 lol !hopefully it might be a much better solution than i offered earlier as looked through it again and added some things i saw missing good luck

ryanda9910 marked this pull request as ready for review June 16, 2026 06:06

ryanda9910 requested a review from a team as a code owner June 16, 2026 06:06

nicsins approved these changes Jun 21, 2026

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

feat(collections): concurrent upload_documents with bounded parallelism#166

feat(collections): concurrent upload_documents with bounded parallelism#166
ryanda9910 wants to merge 1 commit into
xai-org:mainfrom
ryanda9910:feature/parallel-collection-document-uploads

ryanda9910 commented Jun 16, 2026

Uh oh!

nicsins commented Jun 16, 2026

Uh oh!

ryanda9910 commented Jun 16, 2026

Uh oh!

nicsins commented Jun 18, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Conversation

ryanda9910 commented Jun 16, 2026

Problem

Approach

Benchmark (mocked 100ms/upload latency, N=40)

Tests

Question for maintainers

Uh oh!

nicsins commented Jun 16, 2026

Uh oh!

ryanda9910 commented Jun 16, 2026

Uh oh!

nicsins commented Jun 18, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants