cloudera manager安装无法从代理接收心跳-无法将新主机添加到群集

ttcibm8c  于 2021-06-03  发布在  Hadoop
关注(0)|答案(5)|浏览(524)

我尝试在ubuntu 12.04.1 lts上使用标准版本安装cloudera manager,当我想添加新主机时,出现下一个错误:

Installation failed.Failed to receive heartbeat from agent.
Ensure that the host's hostname is configured properly.
Ensure that port 7182 is accesible on the Cloudera Manager server (check firewall rules).
Ensure that ports 9000 an 9001 are free on the host being added.
Check agent logs in /var/log/cloudera-scm-agent/ on the host being added (some of the logs can be found in the installation details).

/etc/hosts 我将其配置为:

127.0.0.1 localhost
127.0.0.1 hadoop-ubuntu
192.168.5.xyz hadoop-ubuntu.dana.local hadoop-ubuntu
192.168.3.xyz ro-m81.dana.local ro-m81
192.168.3.abc ro-m41.dana.local ro-m41

支持ipv6的主机需要以下线路

::1 ip6-localhost ip6-loopback
fe00::0 ip6-localnet
ff00::0 ip6-mcastprefix
ff02::1 ip6-allnodes
ff02::2 ip6-allrouters     
The**/var/log/cloudera-scm-agent/cloudera-scm-agent.log**shows the next error::   
[09/Oct/2013 16:04:23 +0000] 4532 MainThread agent ERROR Heartbeating to 192.168.5.xyz:7182 failed.
Traceback (most recent call last):
File "/usr/lib64/cmf/agent/src/cmf/agent.py", line 747, in send_heartbeat
response = self.requestor.request('heartbeat', dict(request=heartbeat))
File "/usr/lib64/cmf/agent/build/env/lib/python2.6/site-packages/avro-1.6.3-py2.6.egg/avro/ipc.py", line 145, in request
return self.issue_request(call_request, message_name, request_datum)
File "/usr/lib64/cmf/agent/build/env/lib/python2.6/site-packages/avro-1.6.3-py2.6.egg/avro/ipc.py", line 256, in issue_request
call_response = self.transceiver.transceive(call_request)
File "/usr/lib64/cmf/agent/build/env/lib/python2.6/site-packages/avro-1.6.3-py2.6.egg/avro/ipc.py", line 485, in transceive
result = self.read_framed_message()
File "/usr/lib64/cmf/agent/build/env/lib/python2.6/site-packages/avro-1.6.3-py2.6.egg/avro/ipc.py", line 489, in read_framed_message
response = self.conn.getresponse()
File "/usr/lib64/python2.6/httplib.py", line 990, in getresponse
response.begin()
File "/usr/lib64/python2.6/httplib.py", line 391, in begin
version, status, reason = self._read_status()
File "/usr/lib64/python2.6/httplib.py", line 349, in _read_status
line = self.fp.readline()
File "/usr/lib64/python2.6/socket.py", line 433, in readline
data = recv(1)
error: [Errno 104] Connection reset by peer

请帮我找出为什么我会出现这个错误或者我遗漏了什么。

ezykj2lf

ezykj2lf1#

首先使用“sudo service cloudera scm agent status”检查cloudera scm agent的状态是否正在运行
2.检查/var/log/cloudera scm agent中此目录中的代理日志文件/
分辨率资源:http://commandstech.com/what-is-heartbeat-in-hadoop-how-to-resolve-heartbeat-lost-in-cloudera-and-hortonworks/

okxuctiv

okxuctiv2#

我也有同样的问题。这就是我的窍门。
键入ifconfig并找到您的ip地址。不是127.0.0.1。
键入$hostname并找到您的主机名
编辑/etc/hosts文件
在那里为你的IP地址添加一个条目。像这样的

192.168.8.xxx   hostname.test.com   hostname

重新启动cloudera服务。转到sonic.test。com:7180 and 再试一次。应该有用。即使不起作用,去http://hostname.test.com:7180/cmf/home检查主机的状态。
结果发现,尽管我的心跳出现了错误,但主机实际上已经启动并运行了。

j2qf4p5b

j2qf4p5b3#

我和你有同样的问题,最后我解决了。
我的问题是代理的版本 cloudera-scm-agent 与服务器的 cloudera-scm-server ,你可以用 dpkg 或者 yum 检查你自己。

x759pob2

x759pob24#

检查群集中所有节点上的主机文件后,请确保在安装程序上打开端口7180和7182,在群集节点(安装程序除外)上打开端口9000。
我得到了“督察失败”。io exception抛出“cloudera安装错误,直到我查看安装程序(服务器)日志,发现客户端无法在端口9000上通信。

3pmvbmvn

3pmvbmvn5#

我面对同样的问题,然后我找到了解决办法。
我用了两台机器一台 master 再来一个给你 slave 主机具有 cloudera-scm-server .
我配置了 /etc/hosts 在这两台机器中,错误终于消失了。
主机ip为:192.168.1.10

In Master Machine /etc/hosts

127.0.0.1       localhost

192.168.1.10     <hostname>

从机ip为:192.168.1.8

In Slave Machine /etc/hosts

127.0.0.1       localhost

192.168.1.8     <hostname>

相关问题