How many containers with consumers i should run for Sentry?

ingtarius · September 21, 2021, 3:14pm

Hello!
I run 21.3.0 Sentry in production and follow this onpremise/docker-compose.yml at 48c855aa3def4557ef799d878c75832662b5c67d · getsentry/onpremise · GitHub file for deploy different containers for Sentry.
In this file, all containers is uniq and have only 1 copy.
I see slow updating my events in Sentry, redis cache is growning very fast. I try to tune it and face this issue:
how many different containers i can run?
For example, i run 3 snuba-transactions-consumer containers and see, that only one is working (suppose, any other "consumer"containers also work only in single mode?)
sentry-web and worker is clear, as many as posible
what about symbolicator? snuba? relay? sentry consumers?

Thanks for any support. It will be great to find this info in docs

BYK · September 21, 2021, 4:52pm

I think this may help you: https://develop.sentry.dev/self-hosted/troubleshooting/#workers

ingtarius · September 21, 2021, 8:14pm

This link is very interesting. I will try tomorrow run this type of workers.
But my question is about another components.
How many consumer containers i can run? How many snuba?

ingtarius · September 21, 2021, 8:24pm

For example, i have 3 container with snuba_transactions_consumer, but only one of them write log about processing
Like this:

Sep 21 23:22:14 snuba-transactions-consumer03 [792]: 2021-09-21 20:22:14,178 Completed processing <Batch: 20 messages, open for 2.38 seconds>.

BYK · September 22, 2021, 11:36am

I don’t know why you are focusing on Snuba consumers. They already have built-in multi-process support and are very unlikely to be your bottlenecks. If you are having issues with a large redis, that is a clear indication of the need for more and dedicated workers as redis is used as the job pool for them.

ingtarius · September 22, 2021, 12:14pm

I focused on this consumers becouse first my affort was increase all components for speed up sentry. I add more sentry-workers, consumers etc and find, that increasing consumers have no influence on queue. I re-check logs and find, that only one consumer work like expected.
Now my redis is fine, but i still want to know - how many copy of sentry components i can run?
Many times ago sentry has only worker, web and cron.
Now it have 18 different containers ( relay, sentry-ingest-consumer, sentry-post-process-forwarder, sentry-subscription-consumer-events, sentry-subscription-consumer-transactions, sentry-web, sentry_cron, sentry_worker, snuba-consumer, snuba-outcomes-consumer, snuba-replacer, snuba-sessions-consumer, snuba-subscription-consumer-events, snuba-subscription-consumer-transactions, snuba-transactions-consumer, snuba, symbolicator etc)

As i see, i can run consumers (all containers communicated with kafka) in one copy. Two containers doesnt work faster? Also snuba-replacer? sentry-post-process-forwarder?

BYK · September 22, 2021, 1:05pm

You can scale many components, including Snuba. You just need to adjust some settings accordingly but I don’t know enough about those. I think @fpacifici can help.

fpacifici · September 24, 2021, 4:46pm

Hi,
most Snuba consumers (transactions-consumer, errors-consumer, outcomes-consumer, sessions-consumer) can be scaled out either by adding more containers or by adding processes to a single container.

Adding containers
First you need to increase the number of partitions in Kafka for the topic relevant to them. Each Kafka partition can only be consumed by one consumer at a time in order to preserve Kafka in order delivery. So you need to increase the partitions and then increase the number of consumers otherwise only one will do the work.
Errors and Transactions are semantically partitioned by project id so a single Sentry project id will always be in the same partition. Scaling out by adding consumers helps when you have a lot of projects it does not help for a single project. (outcomes and sessions do not have this constraint).

Adding processes
In order to increase capacity for a single partition (also works when you have many projects to an extent) you can increase the number of processes that are consuming messages by setting these three parameters in the consumer CLI:

--processes
--input-block-size
--output-block-size

block sizes are in bytes and contain the size of the shared memory area the consumer will use. The right size depends on your load and event size.

Though I would start scaling the consumer only when you experience backlog in Kafka, which means the consumer cannot keep up. Redis, as mentioned above, is unrelated.

Best
Filippo

system · December 13, 2021, 2:27pm

This topic was automatically closed 15 days after the last reply. New replies are no longer allowed.

Topic		Replies	Views
Where is bottlenect in sentry onpremise? On-Premise	6	1525	March 22, 2021
Tuning workers performance On-Premise	2	1767	July 5, 2018
Sentry 10 reports ghost issues number in release On-Premise	4	1864	March 16, 2020
Adding more servers with worker role On-Premise	2	2826	August 14, 2018
How to snuba install on sentry or sentry pod	7	3115	June 22, 2020

How many containers with consumers i should run for Sentry?

Related topics