docs/en/release_notes/release-2.4.md
Release date: April 21, 2023
storage_medium. When BEs use both SSD and HDD as storage devices, if the property storage_cooldown_time is specified, StarRocks sets storage_medium to SSD. Otherwise, StarRocks sets storage_medium to HDD. #18649The following bugs are fixed:
InsertOverwriteJob state change log. #19061bad by using statements like ADMIN SET REPLICA STATUS PROPERTIES ("tablet_id" = "10003", "backend_id" = "10001", "status" = "bad");. If the number of BEs is less than or equal to the number of replicas, the corrupted replica cannot be repaired. #17876Release date: February 22, 2023
The following bugs are fixed:
USE <catalog_name>.<database_name>, the non-Leader FE node forwards the SQL statement, with <catalog_name> excluded, to the Leader FE node. As a result, the Leader FE node chooses to use the default_catalog and eventually fails to find the specified database. #17302get_applied_rowsets failed for queries on newly cloned tablets on a follower FE. #17192SET variable = default on a follower FE. #17549Release date: January 19, 2023
The following bugs are fixed:
limit are inconsistent. #13574nullable information is inferred incorrectly in the view of a table. #157441024. #13911FORBID_INVALID_DATES. This SQL mode is disabled by default. When it is enabled, StarRocks verifies the input of the DATE type, and returns an error when the input is invalid. #14143Release date: December 14, 2022
The following bugs are fixed:
SHOW FULL TABLES. #13954statistic_collect_parallel does not take effect. #14352query_timeout with an upper limit of 259200 and a lower limit of 1.default_storage_medium. The storage medium of a table is automatically inferred by the system. #14394Release date: November 14, 2022
Supports property aliveStatus in HeartbeatResponse. aliveStatus indicates if a node is alive in the cluster. Mechanisms that judge the aliveStatus are further optimized. #12713
Optimized the error message of Routine Load. #12155
BE crashes after being upgraded from v2.4.0RC to v2.4.0. #13128
Late materialization causes incorrect results to queries on data lakes. #13133
The get_json_int function throws exceptions. #12997
Data may be inconsistent after deletion from a PRIMARY KEY table with a persistent index.#12719
BE may crash during compaction on a PRIMARY KEY table. #12914
The json_object function returns incorrect results when its input contains an empty string. #13030
BE crashes due to RuntimeFilter. #12807
FE hangs due to excessive recursive computations in CBO. #12788
BE may crash or report an error when exiting gracefully. #12852
Compaction crashes after data is deleted from a table with new columns added to it. #12907
Data may be inconsistent due to incorrect mechanisms in OLAP external table metadata synchronization. #12368
When one BE crashes, the other BEs may execute relevant queries till timeout. #12954
Release date: October 20, 2022
Supports creating asynchronous materialized views based on multiple base tables to accelerate queries with JOIN operations. Asynchronous materialized views support all table types. For more information, see Materialized View.
Supports overwriting data via INSERT OVERWRITE. For more information, see Load data using INSERT.
[Preview] Provides stateless Compute Nodes (CN) that can be horizontally scaled. You can use StarRocks Operator to deploy CN into your Kubernetes (K8s) cluster to achieve automatic horizontal scaling. For more information, see Deploy and manage CN on Kubernetes with StarRocks Operator.
Outer Join supports non-equi joins in which join items are related by comparison operators including <, <=, >, >=, and <>. For more information, see SELECT.
Supports creating Iceberg catalogs and Hudi catalogs, which allow direct queries on data from Apache Iceberg and Apache Hudi. For more information, see Iceberg catalog and Hudi catalog.
Supports querying ARRAY-type columns from Apache Hive™ tables in CSV format. For more information, see External table.
Supports viewing the schema of external data via DESC. For more information, see DESC.
Supports granting a specific role or IMPERSONATE permission to a user via GRANT and revoking them via REVOKE, and supports executing an SQL statement with IMPERSONATE permission via EXECUTE AS. For more information, see GRANT, REVOKE, and EXECUTE AS.
Supports FDQN access: now you can use domain name or the combination of hostname and port as the unique identification of a BE or an FE node. This prevents access failures caused by changing IP addresses. For more information, see Enable FQDN Access.
flink-connector-starrocks supports Primary Key table partial update. For more information, see Load data by using flink-connector-starrocks.
Provides the following new functions:
The Primary Key table supports flushing VARCHAR-type primary key indexes to disks. From version 2.4.0, the Primary Key table supports the same data types for primary key indexes regardless of whether persistent primary key index is turned on or not.
Optimized the query performance on external tables.
Optimized the performance of window functions.
Optimized the performance of Cross Join by supporting predicate pushdown.
Histograms are added to CBO statistics. Full statistics collection is further optimized. For more information, see Gather CBO statistics.
Adaptive multi-threading is enabled for tablet scanning to reduce the dependency of scanning performance on the tablet number. As a result, you can set the number of buckets more easily. For more information, see Determine the number of buckets.
Supports querying compressed TXT files in Apache Hive.
Adjusted the mechanisms of default PageCache size calculation and memory consistency check to avoid OOM issues during multi-instance deployments.
Improved the performance of large-size batch load on Primary Key tables up to two times by removing final_merge operations.
Supports a Stream Load transaction interface to implement two-phase commit (2PC) for transactions that are run to load data from external systems such as Apache Flink® and Apache Kafka®, improving the performance of highly concurrent stream loads.
Functions:
The following bugs are fixed:
DECIMAL data types returned by DESC are different from those specified in the CREATE TABLE statement. #7309
FE metadata management issues that affect the stability of FEs. #6685 #9445 #7974 #7455
Data load-related issues:
Data Lake analytics-related issues:
Metadata is inconsistent between the Leader FE and Follower FE nodes. #11215
BE crashes when the size of BITMAP data exceeds 2 GB. #11178
storage_page_cache_limit) is 20% of the system memory.enable_cbo.vectorized_engine_enable.