Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

pod/spiderpool-agent "CrashLoopBackOff" error at the edge nodes when trying to install edgemesh using helm with enableCNI enabled #557

Open
qpanpony opened this issue Apr 2, 2024 · 7 comments

Comments

@qpanpony
Copy link

qpanpony commented Apr 2, 2024

Trying to install edgemesh using helm with custom values.yaml (with enableCNI enabled).
1712029356(1)
pod/edgemesh sucessfully runs in both cloud and edge nodes, however pod/spiderpool-agent at the edgeside nodes show "CrashLoopBackOff" error.
1712029201(1)
I've checked edgecore logs at one of the edge nodes ().
1712029770(1)
Seems some CNI-releated issues at the edge nodes.
Any Suggestions? Thank you.

@Poorunga
Copy link
Member

edgemesh-CNI is still in testing, you can disable it in helm.

@Poorunga
Copy link
Member

/cc @NKDYX

@shyer
Copy link

shyer commented Nov 13, 2024

我也遇到了这个问题,这是边缘POD无法访问集群API导致,我根据以下文档进行了相关配置。
https://edgemesh.netlify.app/zh/guide/edge-kube-api.html#%E8%83%8C%E6%99%AF
https://kubeedge.io/zh/docs/advanced/inclusterconfig/

可以访问集群API了,但是安装还是失败。调试发现,需要请求https://169.254.30.10:10550/readyz?timeout=2s
image

请求如何配置权限。这里的rabc也添加了。
https://github.com/kubeedge/kubeedge/blob/master/manifests/charts/cloudcore/templates/rbac_cloudcore_feature.yaml

@Icarus9913
Copy link
Contributor

https://github.com/spidernet-io/spiderpool/blob/335952e3e920d9df0ec943489386b676d29d6bb7/cmd/spiderpool-agent/cmd/daemon.go#L287-L326

spiderpool-agent在启动的时候会去探测以下api-server的readyz路由,看起来似乎是访问到了clusterIP ?
@shyer

@shyer
Copy link

shyer commented Nov 21, 2024

我调试,它访问的是:https://169.254.30.10:10550/readyz?timeout=2s,报401。
然后我手动请求clusterIP,curl -k https://10.96.0.1:443/readyz,反倒是正常的。

在kubeedge也提了一个issue:
kubeedge/kubeedge#5961

@Icarus9913
Copy link
Contributor

这个169网段是pod IP还是?根据上面那个代码,我怀疑是 config := ctrl.GetConfigOrDie()这个 ctrl "sigs.k8s.io/controller-runtime" SDK 读的环境变量还是哪里拿的的这个IP地址, 应该是个配置问题

@shyer
Copy link

shyer commented Nov 26, 2024

169.254.30.10 是kubeedge metaserver的地址,问题应该是通过metaserver请求readyz异常导致的(有提交了issue)

https://github.com/kubeedge/kubeedge/blob/master/docs/proposals/inclusterconfig.md#non-hostnetwork-edge-pods-access-metaserver

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

4 participants