Clickhouse - How to recover data from sentry db

egua5261 · October 6, 2021, 10:30pm

Hi There,

I run sentry on-premises in kubernetes.
I recently experienced a space issue with clickhouse, which runs internally in the kubernetes cluster. I ended up removing the persistent volume claim (PVC) to be able to expand the volume, the volume expanded however I noticed the old volumes were removed and sentry events and performance data from the date before I expanded the volume are not showing up in sentry.

I’m wondering if there is a way to get the data back from the sentry database, which runs externally.

Can someone please assist?

Thanks!

BYK · October 7, 2021, 12:29pm

@fpacifici @lynnagara is there any way to backfill Clickhouse from node store/postgres?

chhetripradeep · October 8, 2021, 1:57am

The only way i can think of recovering the events is possible if you have them in kafka. What is the retention policy of the snuba-related kafka topics ? If you have events in kafka, you can do something like this:

kafka-consumer-groups --bootstrap-server <kafkahost:port> --group <group_id> --topic <topic_name> --reset-offsets --to-earliest --execute

This will execute the reset and reset the consumer group offset for the specified topic back to earliest kafka msg you have in kafka. You’ll need to this for all snuba related topics.

This way snuba consumers will reread those events and start inserting them to CH.

egua5261 · October 11, 2021, 4:37am

@chhetripradeep @BYK thanks for your responses.

I reseted the offsets on all of the below kafka consumer groups to the previous month, which is within the retention policy using the --to-datetime parameter of the kafka-consumer-groups script/command.

snuba-post-processor
snuba-counsumers
ingest-consumer
transactions_group
snuba-replacers

The offsets were reset successfully on the events topic, i notice the LAG, the difference between CURRENT-OFFSET and LOG-END-OFFSET increased after executing the offsets, which was expected, i then restarted the clickhouse pods/statefulset. However the performance data is still not showing up before the date the volume was expanded so it doesnt look like it worked.

Any other assistance will be appreciated

BYK · October 12, 2021, 2:10pm

Then my guess is there’s an issue with the Kafka topic creation or partition setup. If you can afford data loss, you can try deleting and recreating kafka and zookeeper volumes (the nuclear option)

egua5261 · October 13, 2021, 5:02am

@BYK Can you please tell me what data would be lost if the kafka and zookeeper volumes are recreated?

We already can’t see the performance data prior the start of October.

Thanks!

BYK · October 13, 2021, 12:50pm

You’d lose all in-flight data that is not yet processed. That means you will not lose anything you already see on the UI but any events that are waiting to be processed will be gone.

egua5261 · October 18, 2021, 1:32pm

@BYK I tried the nuclear option that is deleting the kafka and zookeeper volumes then reinstalled sentry, which recreated the volumes however the data in question is still not showing up.

system · January 16, 2022, 1:32pm

This topic was automatically closed 90 days after the last reply. New replies are no longer allowed.

Topic		Replies	Views
Sentry data loss incident On-Premise	4	2498	July 27, 2021
Clikchouse backup/restore On-Premise	9	2063	December 8, 2020
Sentry not logging events On-Premise	1	4033	July 13, 2020
KafkaError OFFSET_OUT_OF_RANGE error On-Premise	7	14228	July 22, 2021
Events have stopped appearing On-Premise	5	3034	January 3, 2021

Clickhouse - How to recover data from sentry db

Related topics