Since yesterday our instance of Sentry doesn’t work. In fact the web app is up but no events are catched.
I’ve run the ./install.sh to upgrade to latest version.
I’ve also run the cleanup command: /usr/bin/docker-compose --file /home/sentry/onpremise/docker-compose.yml exec worker sentry cleanup --days 30
My server has it’s own nginx instance that listen on 443 and use our ssl certificates to pass the request to Sentry on port 9000. Maybe it’s now useless since an nginx container exists (but i don’t know how to configure it to listen on our host:443 + use certificates
But i don’t think it’s the origin of the problem.
Here is the logs:
sentry@vps560644:~/onpremise$ docker-compose logs -f | grep error -i
clickhouse_1 | Logging errors to /var/log/clickhouse-server/clickhouse-server.err.log
clickhouse_1 | Logging errors to /var/log/clickhouse-server/clickhouse-server.err.log
kafka_1 | [main-SendThread(zookeeper:2181)] INFO org.apache.zookeeper.ClientCnxn - Opening socket connection to server zookeeper/172.24.0.2:2181. Will not attempt to authenticate using SASL (unknown error)
kafka_1 | [main-SendThread(zookeeper:2181)] INFO org.apache.zookeeper.ClientCnxn - Socket error occurred: zookeeper/172.24.0.2:2181: Connection refused
kafka_1 | [main-SendThread(zookeeper:2181)] INFO org.apache.zookeeper.ClientCnxn - Opening socket connection to server zookeeper/172.24.0.2:2181. Will not attempt to authenticate using SASL (unknown error)
kafka_1 | [2020-07-22 14:20:35,165] ERROR [Controller id=1002 epoch=20] Controller 1002 epoch 20 failed to change state for partition __consumer_offsets-22 from OfflinePartition to OnlinePartition (state.change.logger)
postgres_1 | ERROR: relation "south_migrationhistory" does not exist at character 15
kafka_1 | [2020-07-22 14:20:35,185] ERROR [Controller id=1002 epoch=20] Controller 1002 epoch 20 failed to change state for partition __consumer_offsets-30 from OfflinePartition to OnlinePartition (state.change.logger)
kafka_1 | [2020-07-22 14:20:35,188] ERROR [Controller id=1002 epoch=20] Controller 1002 epoch 20 failed to change state for partition __consumer_offsets-8 from OfflinePartition to OnlinePartition (state.change.logger)
kafka_1 | [2020-07-22 14:20:35,189] ERROR [Controller id=1002 epoch=20] Controller 1002 epoch 20 failed to change state for partition __consumer_offsets-21 from OfflinePartition to OnlinePartition (state.change.logger)
kafka_1 | [2020-07-22 14:20:35,190] ERROR [Controller id=1002 epoch=20] Controller 1002 epoch 20 failed to change state for partition __consumer_offsets-4 from OfflinePartition to OnlinePartition (state.change.logger)
kafka_1 | [2020-07-22 14:20:35,191] ERROR [Controller id=1002 epoch=20] Controller 1002 epoch 20 failed to change state for partition outcomes-0 from OfflinePartition to OnlinePartition (state.change.logger)
kafka_1 | [2020-07-22 14:20:35,191] ERROR [Controller id=1002 epoch=20] Controller 1002 epoch 20 failed to change state for partition __consumer_offsets-27 from OfflinePartition to OnlinePartition (state.change.logger)
relay_1 | 2020-07-22T14:57:26Z [rdkafka::client] ERROR: librdkafka: Global error: BrokerTransportFailure (Local: Broker transport failure): kafka:9092/bootstrap: Connect to ipv4#172.24.0.10:9092 failed: Connection refused (after 36ms in state CONNECT)
relay_1 | 2020-07-22T14:57:26Z [rdkafka::client] ERROR: librdkafka: Global error: AllBrokersDown (Local: All broker connections are down): 1/1 brokers are down
relay_1 | 2020-07-22T14:57:27Z [rdkafka::client] ERROR: librdkafka: Global error: BrokerTransportFailure (Local: Broker transport failure): kafka:9092/bootstrap: Connect to ipv4#172.24.0.10:9092 failed: Connection refused (after 0ms in state CONNECT)
relay_1 | 2020-07-22T14:57:27Z [rdkafka::client] ERROR: librdkafka: Global error: AllBrokersDown (Local: All broker connections are down): 1/1 brokers are down
relay_1 | 2020-07-22T14:57:27Z [relay_server::actors::upstream] ERROR: authentication encountered error: could not send request to upstream
relay_1 | 2020-07-22T14:57:27Z [relay_server::actors::upstream] ERROR: authentication encountered error: could not send request to upstream
relay_1 | 2020-07-22T14:57:29Z [relay_server::actors::upstream] ERROR: authentication encountered error: could not send request to upstream
relay_1 | 2020-07-22T14:57:31Z [relay_server::actors::upstream] ERROR: authentication encountered error: could not send request to upstream
relay_1 | caused by: Failed to connect to host: No route to host (os error 113)
relay_1 | caused by: No route to host (os error 113)
relay_1 | caused by: No route to host (os error 113)
relay_1 | 2020-07-22T14:57:33Z [relay_server::actors::upstream] ERROR: authentication encountered error: could not send request to upstream
relay_1 | caused by: Failed to connect to host: Connection refused (os error 111)
relay_1 | caused by: Connection refused (os error 111)
relay_1 | caused by: Connection refused (os error 111)
relay_1 | 2020-07-22T14:57:36Z [relay_server::actors::upstream] ERROR: authentication encountered error: could not send request to upstream
relay_1 | caused by: Failed to connect to host: Connection refused (os error 111)
relay_1 | caused by: Connection refused (os error 111)
relay_1 | caused by: Connection refused (os error 111)
relay_1 | 2020-07-22T14:57:41Z [relay_server::actors::upstream] ERROR: authentication encountered error: could not send request to upstream
relay_1 | caused by: Failed to connect to host: Connection refused (os error 111)
relay_1 | caused by: Connection refused (os error 111)
relay_1 | caused by: Connection refused (os error 111)
kafka_1 | [2020-07-22 14:20:35,192] ERROR [Controller id=1002 epoch=20] Controller 1002 epoch 20 failed to change state for partition __consumer_offsets-7 from OfflinePartition to OnlinePartition (state.change.logger)
...
kafka_1 | [2020-07-22 14:20:35,220] ERROR [Controller id=1002 epoch=20] Controller 1002 epoch 20 failed to change state for partition __consumer_offsets-2 from OfflinePartition to OnlinePartition (state.change.logger)
kafka_1 | [2020-07-22 14:20:35,221] ERROR [Controller id=1002 epoch=20] Controller 1002 epoch 20 failed to change state for partition errors-replacements-0 from OfflinePartition to OnlinePartition (state.change.logger)
kafka_1 | kafka.common.StateChangeFailedException: Failed to elect leader for partition errors-replacements-0 under strategy OfflinePartitionLeaderElectionStrategy(false)
kafka_1 | [2020-07-22 14:20:35,221] ERROR [Controller id=1002 epoch=20] Controller 1002 epoch 20 failed to change state for partition __consumer_offsets-43 from OfflinePartition to OnlinePartition (state.change.logger)
...
relay_1 | 2020-07-22T14:57:49Z [relay_server::actors::project_upstream] ERROR: error fetching project states: attempted to send request while not yet authenticated
relay_1 | 2020-07-22T14:57:49Z [relay_server::actors::upstream] ERROR: authentication encountered error: could not send request to upstream
relay_1 | caused by: Failed to connect to host: Connection refused (os error 111)
relay_1 | caused by: Connection refused (os error 111)
relay_1 | caused by: Connection refused (os error 111)
nginx_1 | 2020/07/22 14:57:49 [error] 6#6: *5 connect() failed (111: Connection refused) while connecting to upstream, client: 172.24.0.1, server: , request: "GET /organizations/sentry/projects/ HTTP/1.0", upstream: "http://172.24.0.21:9000/organizations/sentry/projects/", host: "log.tomhealth.fr", referrer: "https://log.tomhealth.fr/organizations/sentry/issues/?project=10&query=is%3Aunresolved&statsPeriod=14d"
kafka_1 | [main-SendThread(zookeeper:2181)] INFO org.apache.zookeeper.ClientCnxn - Opening socket connection to server zookeeper/172.24.0.6:2181. Will not attempt to authenticate using SASL (unknown error)
nginx_1 | 2020/07/22 14:57:50 [error] 6#6: *7 connect() failed (111: Connection refused) while connecting to upstream, client: 172.24.0.1, server: , request: "GET /favicon.ico HTTP/1.0", upstream: "http://172.24.0.21:9000/favicon.ico", host: "log.tomhealth.fr", referrer: "https://log.tomhealth.fr/organizations/sentry/projects/"
relay_1 | 2020-07-22T14:57:50Z [relay_server::actors::project_upstream] ERROR: error fetching project states: attempted to send request while not yet authenticated
nginx_1 | 2020/07/22 14:57:51 [error] 6#6: *9 connect() failed (111: Connection refused) while connecting to upstream, client: 172.24.0.1, server: , request: "GET /organizations/sentry/projects/ HTTP/1.0", upstream: "http://172.24.0.21:9000/organizations/sentry/projects/", host: "log.tomhealth.fr", referrer: "https://log.tomhealth.fr/organizations/sentry/issues/?project=10&query=is%3Aunresolved&statsPeriod=14d"
nginx_1 | 2020/07/22 14:57:51 [error] 6#6: *11 connect() failed (111: Connection refused) while connecting to upstream, client: 172.24.0.1, server: , request: "GET /favicon.ico HTTP/1.0", upstream: "http://172.24.0.21:9000/favicon.ico", host: "log.tomhealth.fr", referrer: "https://log.tomhealth.fr/organizations/sentry/projects/"
nginx_1 | 2020/07/22 14:57:52 [error] 6#6: *15 connect() failed (111: Connection refused) while connecting to upstream, client: 172.24.0.1, server: , request: "GET /organizations/sentry/projects/ HTTP/1.0", upstream: "http://172.24.0.21:9000/organizations/sentry/projects/", host: "log.tomhealth.fr", referrer: "https://log.tomhealth.fr/organizations/sentry/issues/?project=10&query=is%3Aunresolved&statsPeriod=14d"
relay_1 | 2020-07-22T14:57:52Z [relay_server::actors::project_upstream] ERROR: error fetching project states: attempted to send request while not yet authenticated
nginx_1 | 2020/07/22 14:57:52 [error] 6#6: *17 connect() failed (111: Connection refused) while connecting to upstream, client: 172.24.0.1, server: , request: "GET /favicon.ico HTTP/1.0", upstream: "http://172.24.0.21:9000/favicon.ico", host: "log.tomhealth.fr", referrer: "https://log.tomhealth.fr/organizations/sentry/projects/"
nginx_1 | 2020/07/22 14:57:52 [error] 6#6: *19 connect() failed (111: Connection refused) while connecting to upstream, client: 172.24.0.1, server: , request: "GET /organizations/sentry/projects/ HTTP/1.0", upstream: "http://172.24.0.21:9000/organizations/sentry/projects/", host: "log.tomhealth.fr", referrer: "https://log.tomhealth.fr/organizations/sentry/issues/?project=10&query=is%3Aunresolved&statsPeriod=14d"
nginx_1 | 2020/07/22 14:57:53 [error] 6#6: *21 connect() failed (111: Connection refused) while connecting to upstream, client: 172.24.0.1, server: , request: "GET /favicon.ico HTTP/1.0", upstream: "http://172.24.0.21:9000/favicon.ico", host: "log.tomhealth.fr", referrer: "https://log.tomhealth.fr/organizations/sentry/projects/"
nginx_1 | 2020/07/22 14:57:53 [error] 6#6: *23 connect() failed (111: Connection refused) while connecting to upstream, client: 172.24.0.1, server: , request: "GET /organizations/sentry/projects/ HTTP/1.0", upstream: "http://172.24.0.21:9000/organizations/sentry/projects/", host: "log.tomhealth.fr", referrer: "https://log.tomhealth.fr/organizations/sentry/issues/?project=10&query=is%3Aunresolved&statsPeriod=14d"
nginx_1 | 2020/07/22 14:57:53 [error] 6#6: *25 connect() failed (111: Connection refused) while connecting to upstream, client: 172.24.0.1, server: , request: "GET /favicon.ico HTTP/1.0", upstream: "http://172.24.0.21:9000/favicon.ico", host: "log.tomhealth.fr", referrer: "https://log.tomhealth.fr/organizations/sentry/projects/"
relay_1 | 2020-07-22T14:57:54Z [relay_server::actors::project_upstream] ERROR: error fetching project states: attempted to send request while not yet authenticated
post-process-forwarder_1 | %3|1595429876.376|ERROR|rdkafka#consumer-1| [thrd:kafka:9092/bootstrap]: kafka:9092/bootstrap: Connect to ipv4#172.24.0.10:9092 failed: Connection refused
post-process-forwarder_1 | %3|1595429876.377|ERROR|rdkafka#consumer-1| [thrd:kafka:9092/bootstrap]: 1/1 brokers are down
post-process-forwarder_1 | %3|1595429876.378|ERROR|rdkafka#consumer-2| [thrd:kafka:9092/bootstrap]: kafka:9092/bootstrap: Connect to ipv4#172.24.0.10:9092 failed: Connection refused
post-process-forwarder_1 | %3|1595429876.378|ERROR|rdkafka#consumer-2| [thrd:kafka:9092/bootstrap]: 1/1 brokers are down
ingest-consumer_1 | %3|1595429876.427|ERROR|rdkafka#consumer-1| [thrd:kafka:9092/bootstrap]: kafka:9092/bootstrap: Connect to ipv4#172.24.0.10:9092 failed: Connection refused
ingest-consumer_1 | %3|1595429876.427|ERROR|rdkafka#consumer-1| [thrd:kafka:9092/bootstrap]: 1/1 brokers are down
relay_1 | 2020-07-22T14:57:58Z [relay_server::actors::project_upstream] ERROR: error fetching project states: attempted to send request while not yet authenticated
relay_1 | 2020-07-22T14:58:03Z [relay_server::actors::project_upstream] ERROR: error fetching project states: attempted to send request while not yet authenticated
relay_1 | 2020-07-22T14:58:05Z [relay_server::actors::upstream] ERROR: authentication encountered error: could not send request to upstream
kafka_1 | [2020-07-22 14:58:06,615] ERROR [Controller id=1002 epoch=21] Controller 1002 epoch 21 failed to change state for partition __consumer_offsets-22 from OfflinePartition to OnlinePartition (state.change.logger)
...
kafka_1 | [2020-07-22 14:58:06,766] ERROR [Controller id=1002 epoch=21] Controller 1002 epoch 21 failed to change state for partition __consumer_offsets-24 from OfflinePartition to OnlinePartition (state.change.logger)
kafka_1 | [2020-07-22 14:58:06,770] ERROR [Controller id=1002 epoch=21] Controller 1002 epoch 21 failed to change state for partition cdc-0 from OfflinePartition to OnlinePartition (state.change.logger)
...
kafka_1 | [2020-07-22 14:58:06,832] ERROR [Controller id=1002 epoch=21] Controller 1002 epoch 21 failed to change state for partition errors-replacements-0 from OfflinePartition to OnlinePartition (state.change.logger)
kafka_1 | kafka.common.StateChangeFailedException: Failed to elect leader for partition errors-replacements-0 under strategy OfflinePartitionLeaderElectionStrategy(false)
kafka_1 | [2020-07-22 14:58:06,832] ERROR [Controller id=1002 epoch=21] Controller 1002 epoch 21 failed to change state for partition __consumer_offsets-43 from OfflinePartition to OnlinePartition (state.change.logger)
...
relay_1 | 2020-07-22T14:58:10Z [relay_server::actors::project_upstream] ERROR: error fetching project states: attempted to send request while not yet authenticated
relay_1 | 2020-07-22T14:58:22Z [relay_server::actors::project_upstream] ERROR: error fetching project states: attempted to send request while not yet authenticated
relay_1 | 2020-07-22T14:58:22Z [relay_server::actors::events] ERROR: error processing event: failed to resolve project information
...