Cherry's blog: PostgreSQL - Setting Parameters for Production

There are several key parameters that should be adjusted for optimal performance, security, and stability. The default settings are usually designed for small systems and need tuning based on your hardware, workload, and specific use cases. Here's a breakdown of important parameters to adjust:

1. Memory Settings

shared_buffers

Memory the database server uses for shared memory buffers
25% of physical RAM if physical RAM > 1GB
Larger settings for shared_buffers usually require a corresponding increase in max_wal_size and setting huge_pages

wal_buffers

The amount of shared memory used for WAL data that has not yet been written to disk
Default value: 3% of shared_buffers, but not less than 64kB nor more than the size of one WAL segment (16MB)
The contents of the WAL buffers are written out to disk at every transaction commit, so extremely large values are unlikely to provide a significant benefit. However, setting this value to at least a few megabytes can improve write performance on a busy server where many clients are committing at once. The auto-tuning selected by the default setting of -1 should give reasonable results in most cases

work_mem

Used for sorting tuples by ORDER BY and DISTINCT operations, and for joining tables by merge-join and hash-join operations.
Be careful with high values, as this is allocated per query, potentially consuming a large amount of RAM during heavy workloads
Consider with formular "Total RAM * 0.25 / max_connections"

temp_buffers

Used for temporary buffers within each database session and for storing temporary tables.
This setting can be changed within individual sessions, but only before the first use of temporary tables within the session; subsequent attempts to change the value will have no effect on that session

maintenance_work_mem

Specifies the maximum amount of memory to be used by maintenance operations, such as VACUUM, CREATE INDEX, and ALTER TABLE ADD FOREIGN KEY.
Larger settings might improve performance for vacuuming and for restoring database dumps.
It’s recommended to set this value higher than work_mem; this can improve performance for vacuuming. In general it should be "Total RAM * 0.05"

effective_cache_size

Set how much RAM is available for caching the data pages, both in the Shared Buffers and in the OS Cache.
Used by the PostgreSQL query Optimizer to determine whether indexes fit in RAM.
A higher value makes it more likely index scans will be used, a lower value makes it more likely sequential scans will be used
Set value between 50% and 75% of the available RAM

huge_pages

Turn off Transparent HugePages.
Set huge_page_size = 0 (default value) for using the default huge page size on the system
Setting hugepage steps:

Set shared_buffer, huge_pages (on) parameter
Set shmmax, shmall parameters

shmmax = shared_buffer + 1 GB
shmall = sum of all shmmax

2. Connection Settings

max_connections

Determines the maximum number of concurrent connections to the database server.
When running a standby server, you must set this parameter to the same or higher value than on the primary server
Using PgBouncer or PgPool-II to handle connections.

superuser_reserved_connections

Determines the number of connection “slots” that are reserved for connections by PostgreSQL superusers.
The default value is three connections.

3. WAL and Checkpoint Settings

wal_level

Determines how much information is written to the WAL.
The default value is replica, which writes enough data to support WAL archiving and replication, including running read-only queries on a standby server

wal_compression

This parameter enables compression of WAL using the specified compression method. When enabled, the PostgreSQL server compresses full page images written to WAL when full_page_writes is on or during a base backup
Enabling compression can reduce the WAL volume without increasing the risk of unrecoverable data corruption, but at the cost of some extra CPU spent on the compression during WAL logging and on the decompression during WAL replay

max_wal_size

Maximum size to let the WAL grow during automatic checkpoints
The default is 1 GB.
Increasing this parameter can increase the amount of time needed for crash recovery

min_wal_size

If WAL disk usage stays below the value specified by min_wal_size, old WAL files are recycled for future use at a checkpoint rather than removed.
This feature ensures that enough WAL space is reserved to handle spikes in WAL usage, like when running large batch jobs.
Default value: 80 MB

checkpoint_timeout

The maximum amount of time between two WAL checkpoints.
A checkpoint is begun every checkpoint_timeout seconds, or if max_wal_size is about to be exceeded, whichever comes first. The default settings are 5 minutes and 1 GB, respectively.
Reducing checkpoint_timeout and/or max_wal_size causes checkpoints to occur more often. This allows faster after-crash recovery, since less work will need to be redone. However, one must balance this against the increased cost of flushing dirty data pages more often

checkpoint_completion_target

To avoid flooding the I/O system with a burst of page writes, writing dirty buffers during a checkpoint is spread over a period of time.
A fraction of the checkpoint interval - is adjusted so that the checkpoint finishes when the given fraction of checkpoint_timeout seconds have elapsed, or before max_wal_size is exceeded, whichever is sooner.
If checkpoint_completion_target equal 0.8, PostgreSQL write checkpoints slow enough that the current checkpoint is done when 80% of the estimated time to the start of the next checkpoint has passed.
Typically recommended to set it to no higher than 0.9 (the default) since checkpoints include some other activities besides writing dirty buffers

4. Autovacuum Settings

autovacuum_max_workers

Specifies the maximum number of autovacuum processes (other than the autovacuum launcher) that may be running at any one time.
Increase the number of autovacuum workers for large databases to maintain table health.

autovacuum_work_mem

Specifies the maximum amount of memory to be used by each autovacuum worker process.
It defaults to -1, indicating that the value of maintenance_work_mem should be used instead

autovacuum_vacuum_cost_limit

Specifies the cost limit value that will be used in automatic VACUUM operations
If -1 is specified (which is the default), the regular vacuum_cost_limit value will be used.
With big tables, increasing this parameter can help the autovacuum process handle more rows at once, reducing the time it takes to complete

5. Logging Settings

log_min_duration_statement

Log queries that take longer than a certain number of milliseconds to help identify slow queries.
When using this option together with log_statement, the text of statements that are logged

log_statement

Controls which SQL statements are logged.
Valid values are none (off), ddl, mod, and all (all statements)
mod logs all ddl statements, plus data-modifying statements such as INSERT, UPDATE, DELETE, TRUNCATE, and COPY FROM.

log_checkpoints

The default is on

log_connections & log_disconnections

Turn ON on production.

6. Disk and I/O Settings

random_page_cost

Estimate of the cost of a non-sequentially-fetched disk page.
This value can be overridden for tables and indexes in a particular tablespace by setting the tablespace parameter.
The default value for random_page_cost is 4.0. This suggests that random I/O is 4 times more expensive than sequential I/O, based on the assumption that moving the disk’s read/write head to non-adjacent locations takes more time compared to reading data stored sequentially. Low or High value will effect the Query Planner.
Based on types on disk (HDD, SSD,...), RAM, Database size, we set this parameter low or high.

seq_page_cost

Sets the planner's estimate of the cost of a disk page fetch that is part of a series of sequential fetches.
If using a system with faster storage (like SSDs) or if most data is cached in memory, you might want to reduce both seq_page_cost and random_page_cost to reflect the lower cost of I/O

effective_io_concurrency

Sets the number of concurrent disk I/O operations that PostgreSQL expects can be executed simultaneously.
SSDs and other memory-based storage can often process many concurrent requests, so the best value might be in the hundreds.

synchronous_commit

Specifies how much WAL processing must complete before the database server returns a “success” indication to the client.
Valid values of synchronous_commit are remote_apply, on (the default), remote_write, local, and off.

Ref:

Pages

Thursday, September 19, 2024

PostgreSQL - Setting Parameters for Production