content/influxdb3/clustered/admin/databases/create.md
Use the influxctl database create command
to create a database in your {{< product-name omit=" Clustered" >}} cluster.
If you haven't already, download and install the influxctl CLI.
Run the influxctl database create command and provide the following:
%Y-%m-%d)[!Note] {{< product-name >}} supports up to 7 total tags or tag buckets in the partition template.
{{% code-placeholders "DATABASE_NAME|30d|500|200" %}}
influxctl database create \
--retention-period 30d \
--max-tables 500 \
--max-columns 250 \
--template-tag tag1 \
--template-tag tag2 \
--template-tag-bucket tag3,100 \
--template-tag-bucket tag4,300 \
--template-timeformat '%Y-%m-%d' \
DATABASE_NAME
{{% /code-placeholders %}}
Use the --retention-period flag to define a specific
retention period
for the database.
The retention period value is a time duration value made up of a numeric value
plus a duration unit.
For example, 30d means 30 days.
A zero duration (0d) retention period is infinite and data won't expire.
The retention period value cannot be negative or contain whitespace.
{{< flex >}} {{% flex-content "half" %}}
{{% /flex-content %}} {{% flex-content "half" %}}
0d: infinite/none3d: 3 days6w: 6 weeks1mo: 1 month (30 days)1y: 1 year30d30d: 60 days2.5d: 60 hours{{% /flex-content %}} {{< /flex >}}
Database names must adhere to the following naming restrictions:
_), dash (-), and forward-slash (/)_)[!Caution]
Underscore prefix reserved for system use
Names starting with an underscore (
_) may be reserved for InfluxDB system use. While {{% product-name %}} might not explicitly reject these names, using them risks conflicts with current or future system features and may result in unexpected behavior or data loss.
mydb
sensor_data
prod-metrics
logs/application
webserver123
my database # Contains whitespace
sensor.data # Contains period
app@server # Contains special character
_internal # Starts with underscore (reserved)
very_long_database_name_that_exceeds_sixty_four_character_limit # Too long
In InfluxDB 1.x, data is stored in databases and retention policies. In {{% product-name %}}, databases and retention policies have been merged into databases, where databases have a retention period, but retention policies are no longer part of the data model. Because InfluxQL uses the 1.x data model, a database must be mapped to a v1 database and retention policy (DBRP) to be queryable with InfluxQL.
When naming a database that you want to query with InfluxQL, use the following naming convention to automatically map v1 DBRP combinations to an {{% product-name %}} database:
database_name/retention_policy_name
| v1 Database name | v1 Retention Policy name | New database name |
|---|---|---|
| db | rp | db/rp |
| telegraf | autogen | telegraf/autogen |
| webmetrics | 1w-downsampled | webmetrics/1w-downsampled |
In {{< product-name >}}, table (measurement) and column limits can be
configured using the --max-tables and --max-columns flags.
Default maximum number of tables: 500
Each measurement is represented by a table in a database. Your database's table limit can be raised beyond the default limit of 500. InfluxData has production examples of clusters with 20,000+ active tables across multiple databases.
[!Warning]
Excessive table counts can impact performance and stability
High table counts, especially those concurrently receiving writes and queries, can increase catalog overhead which can affect performance and stability. What constitutes "excessive" depends on multiple factors such as query latency requirements, write bandwidth, and cluster capacity to handle rapid backfills. If you're considering more than doubling the default limit, test your configuration thoroughly.
Increasing your table limit affects your {{% product-name omit=" Clustered" %}} cluster in the following ways:
{{< expand-wrapper >}} {{% expand "May improve query performance <em style='opacity:.5;font-weight:normal;'>View more info</em>" %}}
Schemas with many measurements that contain focused sets of tags and fields can make it easier for the query engine to identify what partitions contain the queried data, resulting in better query performance.
{{% /expand %}} {{% expand "More PUTs into object storage <em style='opacity:.5;font-weight:normal;'>View more info</em>" %}}
By default, {{< product-name >}} partitions
data by measurement and time range and stores each partition as a Parquet
file in your cluster's object store. By increasing the number of measurements
(tables) you can store in your database, you also increase the potential for
more PUT requests into your object store as InfluxDB creates more partitions.
Each PUT request incurs a monetary cost and will increase the operating cost of
your cluster.
{{% /expand %}} {{% expand "More work for the compactor <em style='opacity:.5;font-weight:normal;'>View more info</em>" %}}
To optimize storage over time, your {{< product-name omit=" Clustered" >}} cluster contains a compactor that routinely compacts Parquet files in object storage. With more tables and partitions to compact, the compactor may need to be scaled (either vertically or horizontally) to keep up with demand, adding to the operating cost of your cluster.
{{% /expand %}} {{< /expand-wrapper >}}
Default maximum number of columns: 250
Time, fields, and tags are each represented by a column in a table. Increasing your column limit affects your {{% product-name omit=" Clustered" %}} cluster in the following ways:
{{< expand-wrapper >}} {{% expand "May adversely affect query performance" %}}
At query time, the InfluxDB query engine identifies what table contains the queried data and then evaluates each row in the table to match the conditions of the query. The more columns that are in each row, the longer it takes to evaluate each row.
Through performance testing, InfluxData has identified 250 columns as the threshold beyond which query performance may be affected (depending on the shape of and data types in your schema).
{{% /expand %}} {{< /expand-wrapper >}}
{{< product-name >}} lets you define a custom partitioning strategy for each database. A partition is a logical grouping of data stored in Apache Parquet format in the InfluxDB 3 storage engine. By default, data is partitioned by day, but, depending on your schema and workload, customizing the partitioning strategy can improve query performance.
Use the --template-tag, --template-tag-bucket, and --template-timeformat`
flags to define partition template parts used to generate partition keys for the database.
For more information, see Manage data partitioning.
[!Warning]
Partition templates can only be applied on create
You can only apply a partition template when creating a database. You can't update a partition template on an existing database.