Skip to main content

March 2026

  • Features: Added support for tuning BM25. Configure your own b and k1 parameters to adjust ranking behavior and refine result relevance.
  • Blog: Published a post about SMVE — an algorithm for scaling multi-vector retrieval, focusing on improved efficiency and performance for large-scale workloads.

February 2026

  • Features: Enabled native support for state-of-the-art retrieval models via multi-vector retrieval. ~5.5x faster than PLAID, ~7.5 faster than MUVERA with 1-bit, 2-bit and scalar quantization to optimize storage efficiency.
  • Features: Added native support for f8 and f16 vector types to optimize embedding storage size and improve efficiency when storing large document collections.

January 2026

  • Features: Added support for multivec indexing and querying, enabling more advanced multi-embedding retrieval use cases.
  • Features: Introduced the list<str>.starts_with operator in our query language, enabling efficient prefix filtering on array string fields.

November 2025

  • Features: Added support for regular-expression filtering in our query language, enabling more flexible querying logic.
  • Benchmarks: Published the new TopK Benchmarks results, showcasing performance and scalability across production-like workloads for multiple providers.
  • Billing: Shipped billing, unlocking end-to-end usage tracking and payment workflows.
  • Performance: Shipped distributed compaction to optimize cost and improve indexing throughput per collection.

October 2025

  • Features: Added update()API to simplify partial record updates.
  • Features: Shipped support for delete-by-filter operations for more flexible bulk deletions.
  • Docs & SDKs: Added support for Cohere’s embed-v4 model in semantic_index() and exposed organization limits in ddb-management-plane for improved visibility and management.

September 2025

  • Docs & SDKs: Published SDK reference for Python and JavaScript clients, and added detailed docstrings for a smoother developer experience in both SDKs
  • Python SDK: Released AsyncClient for easier async workflows.
  • String Operators: Added lt, lte, gt, gte, min, and max.
  • Query Helpers: Introduced any and all helpers.
  • List Support: Added contains and in operators for list fields.
  • Observability: Added Query Latency and Write Latency charts to the Usage Metrics in console.
  • New Data Type: Added int8 vector support.
  • Eager Caching: Added support for eager caching to mitigate tail latencies for concurrent read/write workloads.
  • Faster GETs: Added option to cache raw document data and improved performance with zero-copy design.
  • Performance: Improved performance for queries with default & strong consistency level.

August 2025

  • Lists Data Type: Native support for list fields.
  • Performance: Added skip_refine query option to improve speed when reranking isn’t needed.
  • Monitoring: Usage Metrics now available directly in the console.

July 2025

June 2025

  • Sparse Vectors: Added support for f32 and u8 sparse vector fields.
  • Benchmarks: Published billion-scale benchmarks for dense & sparse vector search with filtering.