Its connecting with hostname “kafka” as per the default docker compose file. I was using the same hostname when testing with ping and kafkacat.
i will attach kafka and zookeeper logs shortly, kafka logs seemed fine when i looked at them, dont know about zookeeper since i wasn’t aware that had much to do with it (ive never used kafka).
Can you then try to see if the IP address resolved from the kafka hostname with your trials match the ones from the error messages? If not this looks like a bad DNS cache somewhere.
I can ping “kafka” in the Relay container and get the same IP as the logs show.
Sadly i couldn’t figure out a way for kafkacat to tell me the IP its connecting to. Heres the kafkacat output anyways: https://pastebin.com/CtDMTRbk
I have now tried recreating everything after removing all the containers and volumes and then doing docker system prune -a to get rid of absolutely everything that could be interfering.
This could be a issue specific to my system, but i’ve tried to go every step i can to make sure this is a clean install.
Just check out the release/20.11.1 branch instead of masted and also updated docker to the newest version. same issues. also, someone else has the same issue New install fails on Ubuntu 20.04 - kafta errors
thread title: Nothing can connect to Kafka
should be quite obvious. kafka is unreachable meaning no events whatsoever can be processed. how should there be anything on my issues page?
i have done that. a lot of times. only difference is that i removed the volumes manually by name (docker volume ls | grep sentry and then pipe those back into docker volume rm)
I’d try running that line manually after the install, restart the ZK instance and see if it helps. If it doesn’t, I’d try deleting all zookeeper-related volumes. Note that these volumes would have a prefix as they are not “global” docker volumes but scoped to the Sentry on-premise compose project.
I see that you’ve done this but I cannot really understand why ZK wouldn’t stay up.
at this point i have given up on getting it to work. it seems something is just deeply broken and i dont have the time or motivation to spend days on this. Im going to try if there are any updates/changes but other than that im done. disappointing.
I understand the frustration and disappointment and am sorry about the experience. All I can guess is something being broken at Docker or Network layer as there’s nothing special we are doing to connect to kafka and these all work in 2 different CI systems.
The weird thing is that so many other services run just fine with no network issues whatsoever. I run multiple databases, 2 instances of RabbitMQ, and at least 20 different other webservices on the same server just fine.
@laundmo I’m not denying there might be things we can do to improve the situation. Just saying we just don’t know yet and for all we know, both of our CI flows seem to be able to install sentry and get a test event through. So I’m guessing there’s at least some part related to your specific setup (maybe it is about Ubuntu 20.04, or the Docker version, or something else).
We will be trying these out soon but any indicators or direct help you can provide would also help others who might be experiencing similar issues.