Standard docker setup.
After upgrade events were accepted for a few hours and then stopped. Started again after server restart and then stooped again. Server restart does not help anymore.
I have kafka related errors in consumer logs:
+ '[' c = - ']'
+ snuba consumer --help
+ set -- snuba consumer --storage events --auto-offset-reset=latest --max-batch-time-ms 750
+ set gosu snuba snuba consumer --storage events --auto-offset-reset=latest --max-batch-time-ms 750
+ exec gosu snuba snuba consumer --storage events --auto-offset-reset=latest --max-batch-time-ms 750
2020-08-10 19:05:49,717 New partitions assigned: {Partition(topic=Topic(name='events'), index=0): 16227028}
2020-08-10 19:05:54,980 Partitions revoked: [Partition(topic=Topic(name='events'), index=0)]
+ '[' c = - ']'
+ snuba consumer --help
+ set -- snuba consumer --storage events --auto-offset-reset=latest --max-batch-time-ms 750
+ set gosu snuba snuba consumer --storage events --auto-offset-reset=latest --max-batch-time-ms 750
+ exec gosu snuba snuba consumer --storage events --auto-offset-reset=latest --max-batch-time-ms 750
%3|1597086403.040|FAIL|rdkafka#producer-1| [thrd:kafka:9092/bootstrap]: kafka:9092/bootstrap: Connect to ipv4#172.18.0.10:9092 failed: Connection refused (after 0ms in state CONNECT)
%3|1597086403.051|FAIL|rdkafka#consumer-2| [thrd:kafka:9092/bootstrap]: kafka:9092/bootstrap: Connect to ipv4#172.18.0.10:9092 failed: Connection refused (after 12ms in state CONNECT)
%3|1597086404.037|FAIL|rdkafka#producer-1| [thrd:kafka:9092/bootstrap]: kafka:9092/bootstrap: Connect to ipv4#172.18.0.10:9092 failed: Connection refused (after 0ms in state CONNECT, 1 identical error(s) suppressed)
%3|1597086404.037|FAIL|rdkafka#consumer-2| [thrd:kafka:9092/bootstrap]: kafka:9092/bootstrap: Connect to ipv4#172.18.0.10:9092 failed: Connection refused (after 0ms in state CONNECT, 1 identical error(s) suppressed)
Traceback (most recent call last):
File "/usr/local/bin/snuba", line 33, in <module>
sys.exit(load_entry_point('snuba', 'console_scripts', 'snuba')())
File "/usr/local/lib/python3.7/site-packages/click/core.py", line 722, in __call__
return self.main(*args, **kwargs)
File "/usr/local/lib/python3.7/site-packages/click/core.py", line 697, in main
rv = self.invoke(ctx)
File "/usr/local/lib/python3.7/site-packages/click/core.py", line 1066, in invoke
return _process_result(sub_ctx.command.invoke(sub_ctx))
File "/usr/local/lib/python3.7/site-packages/click/core.py", line 895, in invoke
return ctx.invoke(self.callback, **ctx.params)
File "/usr/local/lib/python3.7/site-packages/click/core.py", line 535, in invoke
return callback(*args, **kwargs)
File "/usr/src/snuba/snuba/cli/consumer.py", line 146, in consumer
consumer.run()
File "/usr/src/snuba/snuba/utils/streams/processing.py", line 132, in run
self._run_once()
File "/usr/src/snuba/snuba/utils/streams/processing.py", line 138, in _run_once
msg = self.__consumer.poll(timeout=1.0)
File "/usr/src/snuba/snuba/utils/streams/kafka.py", line 674, in poll
return super().poll(timeout)
File "/usr/src/snuba/snuba/utils/streams/kafka.py", line 412, in poll
raise ConsumerError(str(error))
snuba.utils.streams.consumer.ConsumerError: KafkaError{code=COORDINATOR_LOAD_IN_PROGRESS,val=14,str="JoinGroup failed: Broker: Coordinator load in progress"}
+ '[' c = - ']'
+ snuba consumer --help
+ set -- snuba consumer --storage events --auto-offset-reset=latest --max-batch-time-ms 750
+ set gosu snuba snuba consumer --storage events --auto-offset-reset=latest --max-batch-time-ms 750
+ exec gosu snuba snuba consumer --storage events --auto-offset-reset=latest --max-batch-time-ms 750
2020-08-10 19:07:12,593 New partitions assigned: {Partition(topic=Topic(name='events'), index=0): 16227028}
2020-08-10 20:17:24,210 Partitions revoked: [Partition(topic=Topic(name='events'), index=0)]
+ '[' c = - ']'
+ snuba consumer --help
+ set -- snuba consumer --storage events --auto-offset-reset=latest --max-batch-time-ms 750
+ set gosu snuba snuba consumer --storage events --auto-offset-reset=latest --max-batch-time-ms 750
+ exec gosu snuba snuba consumer --storage events --auto-offset-reset=latest --max-batch-time-ms 750
%3|1597090682.504|FAIL|rdkafka#producer-1| [thrd:kafka:9092/bootstrap]: kafka:9092/bootstrap: Connect to ipv4#172.18.0.7:9092 failed: Connection refused (after 1ms in state CONNECT)
%3|1597090682.510|FAIL|rdkafka#consumer-2| [thrd:kafka:9092/bootstrap]: kafka:9092/bootstrap: Connect to ipv4#172.18.0.7:9092 failed: Connection refused (after 2ms in state CONNECT)
%3|1597090690.505|FAIL|rdkafka#producer-1| [thrd:kafka:9092/bootstrap]: kafka:9092/bootstrap: Connect to ipv4#172.18.0.7:9092 failed: Connection refused (after 0ms in state CONNECT, 8 identical error(s) suppressed)
%3|1597090690.508|FAIL|rdkafka#consumer-2| [thrd:kafka:9092/bootstrap]: kafka:9092/bootstrap: Connect to ipv4#172.18.0.7:9092 failed: Connection refused (after 0ms in state CONNECT, 8 identical error(s) suppressed)
%3|1597090720.508|FAIL|rdkafka#producer-1| [thrd:kafka:9092/bootstrap]: kafka:9092/bootstrap: Connect to ipv4#172.18.0.7:9092 failed: Connection refused (after 0ms in state CONNECT, 29 identical error(s) suppressed)
%3|1597090720.511|FAIL|rdkafka#consumer-2| [thrd:kafka:9092/bootstrap]: kafka:9092/bootstrap: Connect to ipv4#172.18.0.7:9092 failed: Connection refused (after 0ms in state CONNECT, 29 identical error(s) suppressed)
Traceback (most recent call last):
File "/usr/local/bin/snuba", line 33, in <module>
sys.exit(load_entry_point('snuba', 'console_scripts', 'snuba')())
File "/usr/local/lib/python3.7/site-packages/click/core.py", line 722, in __call__
return self.main(*args, **kwargs)
File "/usr/local/lib/python3.7/site-packages/click/core.py", line 697, in main
rv = self.invoke(ctx)
File "/usr/local/lib/python3.7/site-packages/click/core.py", line 1066, in invoke
return _process_result(sub_ctx.command.invoke(sub_ctx))
File "/usr/local/lib/python3.7/site-packages/click/core.py", line 895, in invoke
return ctx.invoke(self.callback, **ctx.params)
File "/usr/local/lib/python3.7/site-packages/click/core.py", line 535, in invoke
return callback(*args, **kwargs)
File "/usr/src/snuba/snuba/cli/consumer.py", line 146, in consumer
consumer.run()
File "/usr/src/snuba/snuba/utils/streams/processing.py", line 132, in run
self._run_once()
File "/usr/src/snuba/snuba/utils/streams/processing.py", line 138, in _run_once
msg = self.__consumer.poll(timeout=1.0)
File "/usr/src/snuba/snuba/utils/streams/kafka.py", line 674, in poll
return super().poll(timeout)
File "/usr/src/snuba/snuba/utils/streams/kafka.py", line 412, in poll
raise ConsumerError(str(error))
snuba.utils.streams.consumer.ConsumerError: KafkaError{code=COORDINATOR_LOAD_IN_PROGRESS,val=14,str="JoinGroup failed: Broker: Coordinator load in progress"}
+ '[' c = - ']'
+ snuba consumer --help
+ set -- snuba consumer --storage events --auto-offset-reset=latest --max-batch-time-ms 750
+ set gosu snuba snuba consumer --storage events --auto-offset-reset=latest --max-batch-time-ms 750
+ exec gosu snuba snuba consumer --storage events --auto-offset-reset=latest --max-batch-time-ms 750
2020-08-10 20:19:00,698 New partitions assigned: {Partition(topic=Topic(name='events'), index=0): 16227028}
2020-08-10 22:05:42,556 Partitions revoked: [Partition(topic=Topic(name='events'), index=0)]
+ '[' c = - ']'
+ snuba consumer --help
+ set -- snuba consumer --storage events --auto-offset-reset=latest --max-batch-time-ms 750
+ set gosu snuba snuba consumer --storage events --auto-offset-reset=latest --max-batch-time-ms 750
+ exec gosu snuba snuba consumer --storage events --auto-offset-reset=latest --max-batch-time-ms 750
2020-08-10 22:05:46,575 New partitions assigned: {Partition(topic=Topic(name='events'), index=0): 16227028}
But kafka looks to be working fine.