Release version 0.10.0: November 2024

Apache Pinot updates since the last StarTree release

For details on Pinot changes, see Releases (opens in a new tab).

StarTree Cloud

StarTree extensions for Apache Pinot

  • Added flexibility for users to provide the number of retries in case of failures when atomic sync is configured. This allows users to also upgrade their StarTree Cloud environment while data is being ingested.
  • Improved performance while ingesting data from Delta Lake or using SegmentImportTask by changing the default value of parameter "push.mode" to "metadata".
  • Added several improvements to Delta Lake 3.0 connector to support Delta Protocol Reader version 3 and Writer version 7.
  • Added support for ingesting data from DynamoDB CDC streams using the DynamoDB message decoder. [link] (opens in a new tab)
  • Added native support for ingesting Prometheus-formatted metrics data into tables in StarTree Cloud. Users can now leverage the price/performance of StarTree Cloud for their metrics solution built on Prometheus. [link] (opens in a new tab)
  • Added the ability to merge smaller segments into large segments to improve performance in an upsert enabled table leveraging SegmentRefreshTask. [link] (opens in a new tab)
  • Added TTL for metadata and deleted keys for upsert-enabled tables using Offheap upsert. This will improve scalability and manageability by reducing the size of managed keys and metadata.
  • Added data consistency guarantees when running queries while upserts are being processed. Sometimes the result set would not be consistent without this guarantee. [link] (opens in a new tab)
  • Improved the server restart time, when needed, by preloading a snapshot of primary keys in an upsert enabled table. In absence of this feature, the primary keys will be built during the server restart, resulting in long restart times. [link] (opens in a new tab)
  • Improved scalability and reliability for Dedup by moving the metadata from on-heap implementation to off-heap implementation, similar to off-heap upsert.
  • Added several health checks to ensure tables in StarTree Cloud are always optimized for best performance. The list of health checks includes a check to ensure no table in production is running with a single replica of data. [link] (opens in a new tab)

Data Manager

  • Added the ability for users to modify the schema and table configuration even after a table has been created, enabling greater flexibility. Users can optimize their table for better performance using Data Manager.
  • Added enhanced validation to ensure accurate field type and data type configurations during table creation, reducing errors and improving data integrity.

ThirdEye

  • ThirdEye is now available in the StarTree Cloud Free Tier (opens in a new tab).
  • Improved onboarding with the new alert creation flow. Creating alerts is now simpler and faster. Creating dimension exploration alerts is now possible in no-code.
  • Added new Impact dashboard. This dashboard provides managers and alert owners a clear and intuitive understanding of the health and performance of all monitored metrics.