Add a space-partitioning dimension to a hypertable
Deprecated since v2.13.0. Use, add_dimension().Add an additional partitioning dimension to a .
The column selected as the dimension can either use interval
partitioning (for example, for a second time partition) or hash partitioning.
The add_dimension command can only be executed after a table has been
converted to a (via create_hypertable), but must similarly
be run only on an empty .
Space partitions: Using space partitions is highly recommended
for distributed s to achieve
efficient scale-out performance. For regular s
that exist only on a single node, additional partitioning can be used
for specialized use cases and not recommended for most users.Space partitions use hashing: Every distinct item is hashed to one of
N buckets. Remember that we are already using (flexible) time
intervals to manage sizes; the main purpose of space
partitioning is to enable parallelization across multiple
data nodes (in the case of distributed s) or
across multiple disks within the same time interval
(in the case of single-node deployments).
Now in a multi-node example for distributed s with a cluster
of one access node and two data nodes, configure the access node for
access to the two data nodes. Then, convert table conditions to
a distributed with just time partitioning on column time,
and finally add a space partitioning dimension on location
with two partitions (as the number of the attached data nodes).
In a distributed , space partitioning enables inserts to be
parallelized across data nodes, even while the inserted rows share
timestamps from the same time interval, and thus increases the ingest rate.
Query performance also benefits by being able to parallelize queries
across nodes, particularly when full or partial aggregations can be
“pushed down” to data nodes (for example, as in the query
avg(temperature) FROM conditions GROUP BY hour, location
when using location as a space partition).
Parallel I/O can benefit in two scenarios: (a) two or more concurrent
queries should be able to read from different disks in parallel, or
(b) a single query should be able to use query parallelization to read
from multiple disks in parallel.Thus, users looking for parallel I/O have two options:
Use a RAID setup across multiple physical disks, and expose a
single logical disk to the (that is, via a single tablespace).
For each physical disk, add a separate tablespace to the
database. allows you to actually add multiple tablespaces
to a single (although under the covers, a ‘s
s are spread across the tablespaces associated with that ).
We recommend a RAID setup when possible, as it supports both forms of
parallelization described above (that is, separate queries to separate
disks, single query to multiple disks in parallel). The multiple
tablespace approach only supports the former. With a RAID setup,
no spatial partitioning is required.That said, when using space partitions, we recommend using 1
space partition per disk. does not benefit from a very large number of space
partitions (such as the number of unique items you expect in partition
field). A very large number of such partitions leads both to poorer
per-partition load balancing (the mapping of items to partitions using
hashing), as well as much increased planning latency for some types of
queries.
True if the dimension was added, false when if_not_exists is true and no dimension was added
When executing this function, either number_partitions or
chunk_time_interval must be supplied, which dictates if the
dimension uses hash or interval partitioning.The chunk_time_interval should be specified as follows:
If the column to be partitioned is a TIMESTAMP, TIMESTAMPTZ, or
DATE, this length should be specified either as an INTERVAL type or
an integer value in microseconds.
If the column is some other integer type, this length
should be an integer that reflects
the column’s underlying semantics (for example, the
chunk_time_interval should be given in milliseconds if this column
is the number of milliseconds since the UNIX epoch).
Supporting more than one additional dimension is currently
experimental. For any production environments, users are recommended
to use at most one “space” dimension.