Postgres DB growth

kdm · November 18, 2020, 12:47pm

Hi.
We’re using version 20.7.2. And we faced a situation, when postgres database grows infinitely.
Cleanup cron set up to start every 24h and SENTRY_EVENT_RETENTION_DAYS=30, but nodestore_node table size keeps growing and growing, about 9Gb a day.
Here’s nodestore_node table size comparing with the others, and volume sizes of another services.

    table_schema    |               table_name                | row_estimate |   total    |   index    |   toast    |   table
--------------------+-----------------------------------------+--------------+------------+------------+------------+------------
 public             | nodestore_node                          |  4.35109e+07 | 213 GB     | 7849 MB    | 202 GB     | 4107 MB
 public             | sentry_releasefile                      |       825410 | 370 MB     | 239 MB     | 8192 bytes | 131 MB
 public             | sentry_grouphash                        |        12398 | 357 MB     | 225 MB     |            | 132 MB
 public             | sentry_file                             |       884136 | 332 MB     | 201 MB     | 8192 bytes | 131 MB
 public             | sentry_fileblobindex                    |       933114 | 158 MB     | 102 MB     |            | 55 MB
 public             | sentry_commitfilechange                 |       119278 | 41 MB      | 24 MB      |            | 17 MB
 public             | sentry_fileblob                         |        76091 | 32 MB      | 21 MB      | 8192 bytes | 11 MB

sentry-postgres                                                    1                   231.5GB
sentry-zookeeper                                                   1                   406kB
sentry-kafka                                                       1                   45.41GB
sentry-redis                                                       1                   3.187MB
sentry-clickhouse                                                  1                   27.58GB
sentry-data                                                        6                   40.58GB

Last autovauum of nodestore_node was 2020-10-23 (before upgrade to 20.7.2).

         relname                 | last_vacuum |        last_autovacuum        |         last_analyze          |       last_autoanalyze
---------------------------------+-------------+-------------------------------+-------------------------------+-------------------------------
 nodestore_node                  |             | 2020-10-23 20:27:16.052523+00 | 2020-11-02 16:55:38.190337+00 | 2020-11-18 08:08:10.848068+00

Last time we did VACUUM FULL it freed up about 70Gb, when the whole PG volume was about 170Gb. But then, as you can see, it became even bigger.
Could you tell us, please, how to understand if it’s normal behaviour or not? And what we can do if it isn’t? I read that vacuuming db periodically is not really good option.

P.S. In Nginx logs we see this:

{ “@timestamp”: “2020-11-18T12:00:43+00:00”, “remote_addr”: “IP_ADDRESS”, “body_bytes_sent”: “41”, “server_name”: “SERVER_ADDRESS”, “dest_port”: “PORT”, “host”: “SERVER_ADDRESS”, “http_x_header”: “”, “query_string”: “”, “status”: “200”, “uri_path”: “/api/12/envelope/”, “request_method”: “POST”, “http_referrer”: “”, “content_type”: “application/json”, “http_x_forwarded_for”: “”, “upstream_response_time”: “0.002”, “upstream_connect_time”: “0.001”, “upstream_header_time”: “0.002”, “request_time”: “0.006”, “request_id”: “2b295c63627d3e77d963a978460ab47e”, “http_user_agent”: “” }

It spams for about 20 times a second (with different project IDs). And it had been denied (but we had no problems with recieving events/issues), before we upgraded to 20.7.2 and added relay service. Can it be related to our problem?

BYK · November 18, 2020, 6:24pm

I think you have Performance enabled which collects a lot of run-time data. Will defer to @matt and @markstory for this tho.

BYK · November 18, 2020, 6:31pm

Btw sharing your rate of events would help us make more educated guesses on this.

zeeg · November 18, 2020, 6:45pm

Regarding disk space of Postgres…

Nodestore is extremely expensive (especially because of TOAST tables) in SQL, which is why we dont run that in Postgres ourselves. I’m not sure what backends are still supported, but I’d start by moving that to a simple key/value store.

You can also use pg_repack to avoid the downtime that a VACUUM FULL will have. Eventually you have to do one of these, because Postgres cant consistently reclaim disk space.

Regarding the envelope endpoint… thats our new ingestion URL and envelopes are the new consolidated format for events.

kdm · November 18, 2020, 9:25pm

Thank you for such fast reacting.
According to Performance page, yes, it’s enabled. There are 33 806 540 Events total since upgrading to 20.7.2 (Oct-28). Whats the way to disable it? Just removing “organizations:performance-view” from sentry.conf.py? Will that clean its data?
Event rate looks like this:
image (2)
Regarding moving to a key/value store, is there any guide for this?

kdm · November 26, 2020, 6:57am

We’ve removed “organizations:performance-view” from sentry.conf.py, but it just removed Performance page from WebUI, and we still recieve millions of transaction Events, so DB keeps growing.

Is there a way to disable it completely?

Topic		Replies	Views
Postgres is growing rapidly - Manual cleanup On-Premise	17	14090	September 16, 2021
Postgres nodestore_node table 124gb On-Premise	3	7992	May 27, 2021
Unable to run "sentry cleanup" due to lack of disk space On-Premise	4	11696	December 7, 2016
Sentry performances and cleanup On-Premise	7	7253	August 30, 2018
How to clear volume sentry-postgres? On-Premise	1	2800	January 26, 2022

Postgres DB growth

Related topics