Data volumes persistence

IlyaKochnev · May 14, 2020, 8:31am

Hello!

I’m planning to migrate from Sentry version 9 to 10.
In Sentry v9 all data stored in the Postgres database. So it was the only data source I need to worry about to keep my installation state and data.
Now in version 10, there are a lot of new components, that require their own storage such as Redis, Kafka, Clickhouse, Symbolicator, Zookeeper.
Which of them require persistence volumes in order to keep Sentry state and data save over restarts, migrations, etc?
Which of them require ephemeral storage to process the runtime data and it’s safe to drop/lose such storage without data corruption or loss?
I plan to run the installation in K8s and have all components clustered/duplicated anyway, so there should be no single point of failure. I’m OK to lose a portion of incoming data in the case of a single component failure. However, I’d like to preserve the already collected data.
Please advise, what consequences will volume data loss have of these components for Sentry?

BYK · May 14, 2020, 10:25am

The essential bits for storing historical data would be Postgres and Clickhouse data volumes. All others are less important. That said Redis may have in-flight or pending jobs, Kafka (and Zookeeper) may have events yet to be processed or post-processed, and Symbolicator would hold processed minidumps or native crashes which may get lost or take extra resources to recreate.

IlyaKochnev · May 14, 2020, 1:15pm

@BYK thanks for the reply!
Is there documentation describing relations between microservices and their purpose in the Sentry landscape and what are they responsible for?
There was a doc for on-prime v9, but I can’t see it anymore. Will there be a doc for v10?

BYK · May 15, 2020, 6:36pm

The relations are described by the docker-compose.yml file and their service dependencies (for this one, I believe yaml code is pretty readable and doesn’t need extra docs). Regarding the purposes, they are mostly about the new architecture and the following posts should shed some light:

We may create an overall architecture doc in the near future (noting this request down) but it doesn’t exist right now.

Topic		Replies	Views
Backup of Sentry components On-Premise	3	2332	April 9, 2021
Persistence requirements for Sentry On-Premise	1	1455	July 11, 2018
Is sentry deployed by docker store data outside the docker container? On-Premise	3	1257	January 22, 2021
Deploying on-premise Sentry 10 on AWS On-Premise	3	2891	May 8, 2020
Sentry and Docker volumes On-Premise	5	2727	April 14, 2021

Data volumes persistence

Related topics