我尝试在ubuntu 12.04.1 lts上使用标准版本安装cloudera manager,当我想添加新主机时,出现下一个错误:
Installation failed.Failed to receive heartbeat from agent.
Ensure that the host's hostname is configured properly.
Ensure that port 7182 is accesible on the Cloudera Manager server (check firewall rules).
Ensure that ports 9000 an 9001 are free on the host being added.
Check agent logs in /var/log/cloudera-scm-agent/ on the host being added (some of the logs can be found in the installation details).
在 /etc/hosts
我将其配置为:
127.0.0.1 localhost
127.0.0.1 hadoop-ubuntu
192.168.5.xyz hadoop-ubuntu.dana.local hadoop-ubuntu
192.168.3.xyz ro-m81.dana.local ro-m81
192.168.3.abc ro-m41.dana.local ro-m41
支持ipv6的主机需要以下线路
::1 ip6-localhost ip6-loopback
fe00::0 ip6-localnet
ff00::0 ip6-mcastprefix
ff02::1 ip6-allnodes
ff02::2 ip6-allrouters
The**/var/log/cloudera-scm-agent/cloudera-scm-agent.log**shows the next error::
[09/Oct/2013 16:04:23 +0000] 4532 MainThread agent ERROR Heartbeating to 192.168.5.xyz:7182 failed.
Traceback (most recent call last):
File "/usr/lib64/cmf/agent/src/cmf/agent.py", line 747, in send_heartbeat
response = self.requestor.request('heartbeat', dict(request=heartbeat))
File "/usr/lib64/cmf/agent/build/env/lib/python2.6/site-packages/avro-1.6.3-py2.6.egg/avro/ipc.py", line 145, in request
return self.issue_request(call_request, message_name, request_datum)
File "/usr/lib64/cmf/agent/build/env/lib/python2.6/site-packages/avro-1.6.3-py2.6.egg/avro/ipc.py", line 256, in issue_request
call_response = self.transceiver.transceive(call_request)
File "/usr/lib64/cmf/agent/build/env/lib/python2.6/site-packages/avro-1.6.3-py2.6.egg/avro/ipc.py", line 485, in transceive
result = self.read_framed_message()
File "/usr/lib64/cmf/agent/build/env/lib/python2.6/site-packages/avro-1.6.3-py2.6.egg/avro/ipc.py", line 489, in read_framed_message
response = self.conn.getresponse()
File "/usr/lib64/python2.6/httplib.py", line 990, in getresponse
response.begin()
File "/usr/lib64/python2.6/httplib.py", line 391, in begin
version, status, reason = self._read_status()
File "/usr/lib64/python2.6/httplib.py", line 349, in _read_status
line = self.fp.readline()
File "/usr/lib64/python2.6/socket.py", line 433, in readline
data = recv(1)
error: [Errno 104] Connection reset by peer
请帮我找出为什么我会出现这个错误或者我遗漏了什么。
5条答案
按热度按时间ezykj2lf1#
首先使用“sudo service cloudera scm agent status”检查cloudera scm agent的状态是否正在运行
2.检查/var/log/cloudera scm agent中此目录中的代理日志文件/
分辨率资源:http://commandstech.com/what-is-heartbeat-in-hadoop-how-to-resolve-heartbeat-lost-in-cloudera-and-hortonworks/
okxuctiv2#
我也有同样的问题。这就是我的窍门。
键入ifconfig并找到您的ip地址。不是127.0.0.1。
键入$hostname并找到您的主机名
编辑/etc/hosts文件
在那里为你的IP地址添加一个条目。像这样的
重新启动cloudera服务。转到sonic.test。com:7180 and 再试一次。应该有用。即使不起作用,去http://hostname.test.com:7180/cmf/home检查主机的状态。
结果发现,尽管我的心跳出现了错误,但主机实际上已经启动并运行了。
j2qf4p5b3#
我和你有同样的问题,最后我解决了。
我的问题是代理的版本
cloudera-scm-agent
与服务器的cloudera-scm-server
,你可以用dpkg
或者yum
检查你自己。x759pob24#
检查群集中所有节点上的主机文件后,请确保在安装程序上打开端口7180和7182,在群集节点(安装程序除外)上打开端口9000。
我得到了“督察失败”。io exception抛出“cloudera安装错误,直到我查看安装程序(服务器)日志,发现客户端无法在端口9000上通信。
3pmvbmvn5#
我面对同样的问题,然后我找到了解决办法。
我用了两台机器一台
master
再来一个给你slave
主机具有cloudera-scm-server
.我配置了
/etc/hosts
在这两台机器中,错误终于消失了。主机ip为:192.168.1.10
从机ip为:192.168.1.8