The roadmap.

Our vision for a high-throughput future.
Focusing on optimization, resilience, and scale.

Next 2-4 weeks

Phase 1: Storage Optimization & Idempotency

Solving the "Small Files Problem"

Currently, logs are batched by count. At scale, this generates millions of tiny files in S3, causing high API costs and slow operations.

Shifting to size-based batching (5MB - 10MB). A single GZIP file can hold tens of thousands of logs, reducing S3 costs by 99%.

Solving "Double Processing"

If the Syncer crashes before deleting a processed file from S3, the next cycle will duplicate those logs.

Updating to ClickHouse ReplacingMergeTree. By generating unique log fingerprints, ClickHouse will automatically deduplicate data.

1-3 months

Solving "Explosive Backlogs"

During database downtime, massive S3 backlogs accumulate. A single-threaded syncer might take days to catch up.

Implement a multi-worker pool in Go with backpressure control to parallelize downloads and inserts, clearing backlogs in minutes.

Solving "Near Real-Time Latency"

The S3-staging introduces a 30-60s delay, which is too slow for live incident response or debugging.

Dual-path routing: ERROR logs bypass S3 and stream directly into ClickHouse or UI for instant visibility.

3-6 months

BYOC & Managed

Managing ClickHouse and S3 permissions requires dedicated DevOps time.

Launching a Management API and Dashboard to deploy Bring Your Own Cloud setups in clicks with a SQL-powered UI.

Remote Configuration

SDKs currently rely on static variables and cannot be throttled during massive traffic spikes.

Remote configuration allowing Edge functions to fetch batching limits dynamically, enabling server-side throttling.