Back to Clickhouse

mergeTreeTextIndex

docs/en/sql-reference/table-functions/mergeTreeTextIndex.md

26.4.1.1-new2.5 KB
Original Source

mergeTreeTextIndex Table Function

Represents the dictionary of a text index in MergeTree tables. Returns tokens with their posting list metadata. It can be used for introspection.

Syntax {#syntax}

sql
mergeTreeTextIndex(database, table, index_name)

Arguments {#arguments}

ArgumentDescription
databaseThe database name to read text index from.
tableThe table name to read text index from.
index_nameThe text index to read from.

Returned value {#returned_value}

A table object with tokens and their posting list metadata.

Usage Example {#usage-example}

sql
CREATE TABLE tab
(
    id UInt64,
    s String,
    INDEX idx_s (s) TYPE text(tokenizer = splitByNonAlpha)
)
ENGINE = MergeTree
ORDER BY id;

INSERT INTO tab SELECT number, concatWithSeparator(' ', 'apple', 'banana') FROM numbers(500);
INSERT INTO tab SELECT 500 + number, concatWithSeparator(' ', 'cherry', 'date') FROM numbers(500);

SELECT * FROM mergeTreeTextIndex(currentDatabase(), tab, idx_s);

Result:

text
   ┌─part_name─┬─token──┬─dictionary_compression─┬─cardinality─┬─num_posting_blocks─┬─has_embedded_postings─┬─has_raw_postings─┬─has_compressed_postings─┐
1. │ all_1_1_0 │ apple  │ front_coded            │         500 │                  1 │                     0 │                0 │                       0 │
2. │ all_1_1_0 │ banana │ front_coded            │         500 │                  1 │                     0 │                0 │                       0 │
3. │ all_2_2_0 │ cherry │ front_coded            │         500 │                  1 │                     0 │                0 │                       0 │
4. │ all_2_2_0 │ date   │ front_coded            │         500 │                  1 │                     0 │                0 │                       0 │
   └───────────┴────────┴────────────────────────┴─────────────┴────────────────────┴───────────────────────┴──────────────────┴─────────────────────────┘