ceph版本 16.2.1
最近一个osd daemon 突然挂掉,日志如下
-- Logs begin at Mon 2021-04-26 15:03:23 CST, end at Mon 2021-04-26 15:33:47 CST. --
Apr 26 15:03:43 storsrv01 systemd[1]: Started Ceph osd.2 for 31a91072-a330-11eb-b831-d8d3855cd168.
Apr 26 15:03:52 storsrv01 bash[3357]: Running command: /usr/bin/chown -R ceph:ceph /var/lib/ceph/osd/ceph-2
Apr 26 15:03:52 storsrv01 bash[3357]: Running command: /usr/bin/ceph-bluestore-tool --cluster=ceph prime-osd-dir --dev /dev/ceph-004457f8-ea07-4964-97ba-903553da1906/osd-block-751d3951-e3eb-414c-bcb9-6861e0e18b98 --path /var/lib/ceph/osd/ceph-2 --no-mon-config
Apr 26 15:03:52 storsrv01 bash[3357]: stderr: failed to read label for /dev/ceph-004457f8-ea07-4964-97ba-903553da1906/osd-block-751d3951-e3eb-414c-bcb9-6861e0e18b98: (2) No such file or directory
Apr 26 15:03:52 storsrv01 bash[3357]: --> RuntimeError: command returned non-zero exit status: 1
Apr 26 15:03:52 storsrv01 systemd[1]: ceph-31a91072-a330-11eb-b831-d8d3855cd168@osd.2.service: Main process exited, code=exited, status=1/FAILURE
Apr 26 15:03:53 storsrv01 systemd[1]: ceph-31a91072-a330-11eb-b831-d8d3855cd168@osd.2.service: Failed with result 'exit-code'.
Apr 26 15:04:03 storsrv01 systemd[1]: ceph-31a91072-a330-11eb-b831-d8d3855cd168@osd.2.service: Service RestartSec=10s expired, scheduling restart.
Apr 26 15:04:03 storsrv01 systemd[1]: ceph-31a91072-a330-11eb-b831-d8d3855cd168@osd.2.service: Scheduled restart job, restart counter is at 1.
Apr 26 15:04:03 storsrv01 systemd[1]: Stopped Ceph osd.2 for 31a91072-a330-11eb-b831-d8d3855cd168.
请大神们帮忙看看 为何出现这种状况
相似问题