2017-08-11 297 views
0

我是Rancher和一般容器的新手。在使用Rancher設置Kubernetes集羣時,我在訪問Kubernetes儀表板時遇到了問題。Rancher Kubernetes儀表板 - 服務不可用

rancher/server: 1.6.6 

Single node Rancher server + External MySQL + 3 agent nodes 

Infrastructure Stack versions: 
healthcheck: v0.3.1 
ipsec: net:v0.11.5 
network-services: metadata:v0.9.2/network-manager:v0.7.7 
scheduler: k8s:v1.7.2-rancher5 
kubernetes (if applicable): kubernetes-agent:v0.6.3 


# docker info 
Containers: 1 
Running: 1 
Paused: 0 
Stopped: 0 
Images: 1 
Server Version: 17.03.1-ce 
Storage Driver: overlay 
Backing Filesystem: extfs 
Supports d_type: true 
Logging Driver: json-file 
Cgroup Driver: cgroupfs 
Plugins: 
Volume: local 
Network: bridge host macvlan null overlay 
Swarm: inactive 
Runtimes: runc 
Default Runtime: runc 
Init Binary: docker-init 
containerd version: 4ab9917febca54791c5f071a9d1f404867857fcc 
runc version: 54296cf40ad8143b62dbcaa1d90e520a2136ddfe 
init version: 949e6fa 
Security Options: 
seccomp 
Profile: default 
Kernel Version: 4.9.34-rancher 
Operating System: RancherOS v1.0.3 
OSType: linux 
Architecture: x86_64 
CPUs: 2 
Total Memory: 7.798 GiB 
Name: ch7radod1 
ID: IUNS:4WT2:Y3TV:2RI4:FZQO:4HYD:YSNN:6DPT:HMQ6:S2SI:OPGH:TX4Y 
Docker Root Dir: /var/lib/docker 
Debug Mode (client): false 
Debug Mode (server): false 
Http Proxy: http://proxy.ch.abc.net:8080 
Https Proxy: http://proxy.ch.abc.net:8080 
No Proxy: localhost,.xyz.net,abc.net 
Registry: https://index.docker.io/v1/ 
Insecure Registries: 
127.0.0.0/8 
Live Restore Enabled: false 

訪問UI URL http://10.216.30.10/r/projects/1a6633/kubernetes-dashboard:9090/#顯示「服務不可用」

如果我使用CLI部分從UI,我得到如下:

> kubectl get nodes 
NAME    STATUS AGE  VERSION 
ch7radod3  Ready  1d  v1.7.2 
ch7radod4  Ready  5d  v1.7.2 
ch7radod1  Ready  1d  v1.7.2 

> kubectl get pods --all-namespaces 
NAMESPACE  NAME         READY  STATUS    RESTARTS AGE 
kube-system heapster-4285517626-4njc2    0/1  ContainerCreating 0   5d 
kube-system kube-dns-3942128195-ft56n    0/3  ContainerCreating 0   19d 
kube-system kube-dns-646531078-z5lzs    0/3  ContainerCreating 0   5d 
kube-system kubernetes-dashboard-716739405-lpj38 0/1  ContainerCreating 0   5d 
kube-system monitoring-grafana-3552275057-qn0zf 0/1  ContainerCreating 0   5d 
kube-system monitoring-influxdb-4110454889-79pvk 0/1  ContainerCreating 0   5d 
kube-system tiller-deploy-737598192-f9gcl   0/1  ContainerCreating 0   5d 

的設置使用的是專用的註冊表(Artifactory的) 。我檢查了Artifactory,我可以看到幾個與Docker相關的圖像。我正在通過private registry部分,我也看到了this文件。如果這個文件是必需的,我究竟在哪裏保存它,以便Rancher可以獲取它並配置Kubernetes儀表板?

UPDATE:

$ sudo ros engine switch docker-1.12.6 
> ERRO[0031] Failed to load https://raw.githubusercontent.com/rancher/os-services/v1.0.3/index.yml: Get https://raw.githubusercontent.com/rancher/os-services/v1.0.3/index.yml: Proxy Authentication Required 
> FATA[0031] docker-1.12.6 is not a valid engine 

我想可能是它由於NGINX,所以我停止了NGINX容器,但我仍然得到上述錯誤。早些時候,我已經在這個Rancher服務器上嘗試了相同的命令,它用於正常工作。它在代理節點上工作正常,但它們已經配置了1.12.6。

更新2:

> kubectl -n kube-system get po 
NAME         READY STATUS   RESTARTS AGE 
heapster-4285517626-4njc2   1/1 Running   0  12d 
kube-dns-2588877561-26993   0/3 ImagePullBackOff 0  5h 
kube-dns-646531078-z5lzs    0/3 ContainerCreating 0  12d 
kubernetes-dashboard-716739405-zq3s9 0/1 CrashLoopBackOff 67  5h 
monitoring-grafana-3552275057-qn0zf 1/1 Running   0  12d 
monitoring-influxdb-4110454889-79pvk 1/1 Running   0  12d 
tiller-deploy-737598192-f9gcl  0/1 CrashLoopBackOff 72  12d 
+0

rancherOS的哪個版本? –

+0

RancherOS v1.0.3 – Technext

+0

你是否支持代理?你可以通過使用'curl'或其他東西來達到'https:// raw.githubusercontent.com/rancher/os-services/v1.0.3/index.yml'嗎? –

回答

0

基於@ ivan.sim的suggestion,我發佈了'更新2'。這開始讓我終於看到了正確的方向。然後我開始在網上尋找CrashLoopBackOff錯誤,並且遇到了this鏈接,並嘗試使用以下命令(使用Rancher控制檯中的CLI選項),這實際上與@ ivan.sim上面建議的非常相似,但是這幫助我在節點其中儀表板進程正在運行:

> kubectl get pods -a -o wide --all-namespaces 
NAMESPACE  NAME         READY STATUS    RESTARTS AGE IP     NODE 
kube-system heapster-4285517626-4njc2    1/1 Running    0   12d 10.42.224.157  radod4 
kube-system kube-dns-2588877561-26993    0/3 ImagePullBackOff 0   5h <none>    radod1 
kube-system kube-dns-646531078-z5lzs    0/3 ContainerCreating 0   12d <none>    radod4 
kube-system kubernetes-dashboard-716739405-zq3s9 0/1 Error    70   5h 10.42.218.11  radod1 
kube-system monitoring-grafana-3552275057-qn0zf 1/1 Running    0   12d 10.42.202.44  radod4 
kube-system monitoring-influxdb-4110454889-79pvk 1/1 Running    0   12d 10.42.111.171  radod4 
kube-system tiller-deploy-737598192-f9gcl   0/1 CrashLoopBackOff 76   12d 10.42.213.24  radod4 

然後我去了那裏的進程運行的主機並嘗試下面的命令:

[[email protected] ~]$ 
[[email protected] ~]$ docker ps -a | grep dash 
282334b0ed38 gcr.io/google_containers/[email protected]:b537ce8988510607e95b8d40ac9824523b1f9029e6f9f90e9fccc663c355cf5d "/dashboard --insecur" About a minute ago Exited (1) 55 seconds ago k8s_kubernetes-dashboard_kubernetes-dashboard-716739405-zq3s9_kube-system_7b0afda7-8271-11e7-ae86-021bfe69c163_72 
99836d7824fd gcr.io/google_containers/pause-amd64:3.0                      "/pause"     5 hours ago   Up 5 hours     k8s_POD_kubernetes-dashboard-716739405-zq3s9_kube-system_7b0afda7-8271-11e7-ae86-021bfe69c163_1 
[[email protected] ~]$ 
[[email protected] ~]$ 
[[email protected] ~]$ docker logs 282334b0ed38 
Using HTTP port: 8443 
Creating API server client for https://10.43.0.1:443 
Error while initializing connection to Kubernetes apiserver. This most likely means that the cluster is misconfigured (e.g., it has invalid apiserver certificates or service accounts configuration) or the --apiserver-host param points to a server that does not exist. Reason: the server has asked for the client to provide credentials 
Refer to the troubleshooting guide for more information: https://github.com/kubernetes/dashboard/blob/master/docs/user-guide/troubleshooting.md 

後我得到了上面的錯誤,我又在網上搜索,並試圖一些事情。最後,this鏈接幫助。在所有代理節點上執行以下命令後,Kubernetes儀表板終於開始工作了!

docker volume rm etcd 
rm -rf /var/etcd/backups/* 
0

您莢無運行,則需要先解決這個問題。嘗試重新啓動整個羣集並查看處於運行狀態的以上所有羣集。

+0

問題似乎與Docker版本(17.03.1-ce)與Kubernetes的兼容性。我意識到這一點,但我在修理其他東西的時候忘了。雖然我正在做所有這些,但我最近把NGINX換成了HTTPS。現在,如果我試圖將docker版本更改爲1.12.6,則會收到錯誤消息。請在我的帖子中查看UPDATE。 – Technext