-
Notifications
You must be signed in to change notification settings - Fork 16
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Program auto exits after a few seconds #33
Comments
Hello @Du-Li, it seems there was a dcp checkpoint timeout, could you please try to increase the checkpoint timeout? I will share an example config. Also, which version of go-kafka-connect-couchbase have you used?
|
Hi @Du-Li did you see any log like |
@erayarslan Thanks for your reply. Yes my CB is always up and those messages were rightly printed. My go-kafka-connect-cb was 0.0.20 and go-dcp-client was 0.0.22. I added the suggested |
It looks that 15 seconds are the default checkpoint interval 10s plus timeout 5s, so the configured |
Thanks @erayarslan. |
health check periodically sending ping request to couchbase to ensure its alive. |
My CB cluster is a local setup from community version 6.5. It worked fine with Kafka Connect and the CB connector. I am not sure why the dcp client's ping request failed. |
The program exited by the healthCheck interval+timeout despite a workload continuously updating CB and the program itself printing the right dcp events. |
i think i understand the issue. your cluster running on aws right? when we send ping request, we are controlling all kind of couchbase service like memd, mgmt, n1ql, fts, capi ... if your network policy not allowing one of these, ping gonna fail probably. |
I see. I am running the connector and kafka in EKS while the CB cluster out EKS. What kind of network policy is needed to make ping work? |
i dont think we should check them all because i think u have right policy to listen dcp. |
yep I can't agree more. as long as it is able to receive dcp events, it should be considered healthy. no need for comprehensive checking. |
v0.0.22 ping only needs memd (default port: 11210) and mnmt (default port: 8091) services. |
@erayarslan health check still failed after interval+timeout time. From the same machine the program was run, I tried
|
are you sure your version upgraded to go-kafka-connect-couchbase v0.0.22 and go-dcp-client v0.0.24? |
yes I am very sure. following is part of my go.mod file. ` require ( |
i cannot reproduce your behaviour so i expose healthcheck disable/enable config. with v0.0.23 u can disable healthcheck like
|
@erayarslan Thanks. That worked. |
thank you so much for contribution 🥳 |
Describe the bug
About 5 seconds after starting go-kafka-connect-couchbase connector, the health check failed and the connector auto exited.
To Reproduce
Steps to reproduce the behavior:
go run main.go
{"level":"debug","time":"2023-03-21T04:39:33Z","message":"vbucket discovery opened with membership type: static"}
{"level":"info","time":"2023-03-21T04:39:33Z","message":"member: 1/1, vbucket range: 0-1023"}
{"level":"debug","time":"2023-03-21T04:39:33Z","message":"loaded checkpoint"}
{"level":"debug","time":"2023-03-21T04:39:33Z","message":"stream started"}
{"level":"debug","time":"2023-03-21T04:39:33Z","message":"started checkpoint schedule"}
{"level":"info","time":"2023-03-21T04:39:33Z","message":"dcp stream started"}
{"level":"info","time":"2023-03-21T04:39:33Z","message":"metric middleware registered on path /metrics"}
{"level":"info","time":"2023-03-21T04:39:33Z","message":"api starting on port 8080"}
{"level":"debug","time":"2023-03-21T04:39:43Z","message":"no need to save checkpoint"}
{"level":"error","error":"context deadline exceeded","time":"2023-03-21T04:39:48Z","message":"health check failed"}
{"level":"debug","time":"2023-03-21T04:39:48Z","message":"vbucket discovery closed"}
{"level":"debug","time":"2023-03-21T04:39:48Z","message":"no need to save checkpoint"}
{"level":"debug","time":"2023-03-21T04:39:48Z","message":"stopped checkpoint schedule"}
{"level":"debug","time":"2023-03-21T04:39:49Z","message":"stream stopped"}
{"level":"debug","time":"2023-03-21T04:39:49Z","message":"api stopped"}
{"level":"debug","time":"2023-03-21T04:39:49Z","message":"dcp connection closed
{"level":"debug","time":"2023-03-21T04:39:49Z","message":"connections closed
{"level":"info","time":"2023-03-21T04:39:49Z","message":"dcp stream closed"}
Expected behavior
It's expected to continuously print the cb dcp events as long as a workload is fed into cb.
Screenshots
N/A
Version (please complete the following information):
Additional context
Add any other context about the problem here.
The text was updated successfully, but these errors were encountered: