Is the Kafka key necessary?

younghedong · April 27, 2021, 10:18am

When a project reports a large number of errors, because project_id is specified as the key of kafka, all the data hits the same Partition, which leads to lag of the clickhouse consumer.
Question：
I can delete the key in the producer?

BYK · April 27, 2021, 10:57am

@fpacifici thoughts on this?

fpacifici · April 28, 2021, 12:36am

Hi,

unfortunately semantic partitioning (by project id) of the main events topics is a functional requirement, so you cannot remove the key from there.
This is a requirement to preserve sequential consistency between the order of the events in a project and some functionalities that happen downstream of Kafka. Specifically, alerts require this partitioning to provide correct results (that is why you cannot create a metric alert across projects), mutability of events (delete/merge/unmerge) requires this guarantee and several post processing action like external integrations as they need to be executed when events are already stored.

There is a way to speed up the consumer though by processing events on multiple cores.
the snuba consumers have three parameters:

processes number of processes that process events concurrently (defaults to 1)
input-block-size size in bytes of the input buffer that is dispatched to different processes. Ensures it is big enough to contain a lot of events (like more than 50Mb)
output-block-size size in bytes of the buffer to reassemble the messages. Bigger than the input block size.

You can set them here:

github.com

getsentry/onpremise/blob/a95b9fa011ebf3d8fb0fcd6e20d35527c72911cf/docker-compose.yml#L142-L147


snuba-consumer:
  <<: *snuba_defaults
  command: consumer --storage errors --auto-offset-reset=latest --max-batch-time-ms 750
# Kafka consumer responsible for feeding outcomes into Clickhouse
# Use --auto-offset-reset=earliest to recover up to 7 days of TSDB data
# since we did not do a proper migration

and

github.com

getsentry/onpremise/blob/a95b9fa011ebf3d8fb0fcd6e20d35527c72911cf/docker-compose.yml#L156-L158


snuba-transactions-consumer:
  <<: *snuba_defaults
  command: consumer --storage transactions --consumer-group transactions_group --auto-offset-reset=latest --max-batch-time-ms 750 --commit-log-topic=snuba-commit-log

If you set the three your consumer should run on multiple cores and increase the throughput a lot (at the cost of more system resources used).

Hope this helps and happy to help if there are further questions

Filippo

younghedong · May 10, 2021, 7:55am

It works at clickhouse consumer, but in post-process-forwarder has the same problem，there is no process parameters.

How to improve post-process-forwarder processing speed？

fpacifici · May 10, 2021, 11:08pm

Hi,

sorry for that there is no immediate solution yet.
The multiprocess consumer relies on python shared memory support which came with python 3.8. Sentry is still being migrated to 3.8 and we are actively working on porting the multiprocess consumer there.

Though this won’t be available before the end of July.

Best
Filippo

younghedong · May 11, 2021, 7:09am

Is it possible to increase the number of post-process-forwarder instances?

fpacifici · May 12, 2021, 6:09am

You should be able to start multiple forwarders, though they consume from the same topic as the Snuba consumer, which is partitioned in the same way. If you receive too many events on a single project, they will be on the same partition thus processed by the same post process forwarder.

Still you should be able to start multiple instances just by scaling out the post-process-forwarder service with docker-compose. This is not Sentry specific, it is just the scale option on docker-compose --scale post-process-forwarder=NUMBER OF REPLICAS
You can also increase the messages processed before committing to reduce the number of commits and that could give you a performance gain --commit-batch-size 500 in the docker compose file.

Best
Filippo

system · August 10, 2021, 6:10am

This topic was automatically closed 90 days after the last reply. New replies are no longer allowed.

Topic		Replies	Views
Clickhouse - How to recover data from sentry db On-Premise	8	3283	January 16, 2022
Perfomance metric alarm related to Snuba-events-subscriptions-consumers On-Premise	3	1926	July 30, 2021
No longer used Kafka topics? - (snuba-consumer : events) On-Premise	3	1070	December 14, 2021
Snuba Consumers - Kafka Issues (NOT_COORDINATOR_FOR_GROUP) causing process restart On-Premise	6	4125	November 17, 2020
How can I scale out the snuba?! On-Premise	6	4479	January 5, 2021

Is the Kafka key necessary?

Related topics