windows 2016上的mesos slave不与主机连接

s4chpxco  于 2021-06-26  发布在  Mesos
关注(0)|答案(1)|浏览(288)

我目前的设置如下:
mesos大师- 10.20.200.300:14081 -rhel 7号
Zookeeper- 10.20.200.300:14080 -rhel 7号
mesos代理- 10.21.210.310:5051 -windows 2016版
主机启动,可以连接到zookeeper。但是,在启动代理时,即使代理连接到zookeeper,也不会连接到主机。
master作为systemd进程启动,在/etc/mesos master下有以下参数-

hostname - mymaster.mesos.com    
quorum - 1    
work_dir - /var/lib/mesos   
advertise_ip - 10.20.200.300
advertise_port - 14081

下面是从主人,奴隶和Zookeeper日志。
主日志(在10.20.200.300:14081上运行):

E1208 12:22:21.269227  4302 process.cpp:2455] Failed to shutdown socket with fd 26, address 10.20.200.300:14081: Transport endpoint is not connected

zookeeper日志(运行于10.20.200.300:14080):

2017-12-08 12:22:21,185 [myid:] - INFO  [NIOServerCxn.Factory:0.0.0.0/0.0.0.0:14080:ZooKeeperServer@942] - Client attempting to establish new session at /10.21.210.310:63039     
2017-12-08 12:22:21,196 [myid:] - INFO  [SyncThread:0:ZooKeeperServer@687] - Established session 0x160372c2b770010 with negotiated timeout 10000 for client /10.21.210.310:63039

从属日志(在10.21.210.310:5051上运行):

I1208 12:22:21.179652  4224 slave.cpp:1007] New master detected at master@10.20.200.300:14081    
I1208 12:22:21.195278  4224 slave.cpp:1031] No credentials provided. Attempting to register without authentication     
I1208 12:22:21.195278  4224 slave.cpp:1042] Detecting new master     
I1208 12:22:21.210924  6156 slave.cpp:5135] Got exited event for master@10.20.200.300:14081     
W1208 12:22:21.210924  6156 slave.cpp:5140] Master disconnected! Waiting for a new master to be elected     
I1208 12:22:21.226510  2700 slave.cpp:5135] Got exited event for master@10.20.200.300:14081     
W1208 12:22:21.226510  2700 slave.cpp:5140] Master disconnected! Waiting for a new master to be elected

有人知道这些的原因吗?我已经测试了slave->master和master->slave之间的连接,它是成功的。

Test-NetConnection -ComputerName 10.20.200.300 -Port 14081
ComputerName     : 10.20.200.300     
RemoteAddress    : 10.20.200.300     
RemotePort       : 14081     
InterfaceAlias   : Ethernet     
SourceAddress    : 10.21.210.310     
TcpTestSucceeded : True    

[root@mesos-master]# telnet 10.21.210.310 5051
Trying 10.21.210.310...
Connected to 10.21.210.310.
Escape character is '^]'.

我用以下参数找到了探员-

C:\Mesos\mesos\build\src>C:\Mesos\mesos\build\src\mesos-agent.exe \
         --master=zk://10.20.200.300:14080/mesos \
         --work_dir=C:\Mesos\Logs \
         --launcher_dir=C:\Mesos\mesos\build\src \
         --ip=10.21.210.310 \
         --advertise_ip=10.21.210.310 \
         --advertise_port=5051

主/状态日志-

{
    "version": "1.3.1",
    "git_sha": "1beaede8c13f0832d4921121da34f924deec8950",
    "git_tag": "1.3.1",
    "build_date": "2017-09-05 18:02:12",
    "build_time": 1504634532,
    "build_user": "centos",
    "start_time": 1513010072.51033,
    "elected_time": 1513010072.67995,
    "id": "90f5702f-f867-41ac-8087-5d20c87ea96f",
    "pid": "master@10.20.200.300:14081",
    "hostname": "MYhost.COM",
    "activated_slaves": 0,
    "deactivated_slaves": 0,
    "unreachable_slaves": 0,
    "leader": "master@10.20.200.300:14081",
    "leader_info": {
        "id": "90f5702f-f867-41ac-8087-5d20c87ea96f",
        "pid": "master@10.20.200.300:14081",
        "port": 14081,
        "hostname": "MYhost.COM"
    },
    "log_dir": "/var/log/mesos",
    "flags": {
        "advertise_ip": "10.20.200.300",
        "advertise_port": "14081",
        "agent_ping_timeout": "15secs",
        "agent_reregister_timeout": "10mins",
        "allocation_interval": "1secs",
        "allocator": "HierarchicalDRF",
        "authenticate_agents": "false",
        "authenticate_frameworks": "false",
        "authenticate_http_frameworks": "false",
        "authenticate_http_readonly": "false",
        "authenticate_http_readwrite": "false",
        "authenticators": "crammd5",
        "authorizers": "local",
        "framework_sorter": "drf",
        "help": "false",
        "hostname": "MYhost.COM",
        "hostname_lookup": "true",
        "http_authenticators": "basic",
        "initialize_driver_logging": "true",
        "log_auto_initialize": "true",
        "log_dir": "/var/log/mesos",
        "logbufsecs": "0",
        "logging_level": "INFO",
        "max_agent_ping_timeouts": "5",
        "max_completed_frameworks": "50",
        "max_completed_tasks_per_framework": "1000",
        "max_unreachable_tasks_per_framework": "1000",
        "port": "14081",
        "quiet": "false",
        "quorum": "1",
        "recovery_agent_removal_limit": "100%",
        "registry": "replicated_log",
        "registry_fetch_timeout": "1mins",
        "registry_gc_interval": "15mins",
        "registry_max_agent_age": "2weeks",
        "registry_max_agent_count": "102400",
        "registry_store_timeout": "20secs",
        "registry_strict": "false",
        "root_submissions": "true",
        "user_sorter": "drf",
        "version": "false",
        "webui_dir": "/usr/share/mesos/webui",
        "work_dir": "/var/lib/mesos",
        "zk": "zk://localhost:14080/mesos",
        "zk_session_timeout": "10secs"
    },
    "slaves": [],
    "recovered_slaves": [],
    "frameworks": [],
    "completed_frameworks": [],
    "orphan_tasks": [],
    "unregistered_frameworks": []
}

我们是否需要测试任何其他连接或此错误是出于其他原因?

fwzugrvs

fwzugrvs1#

我想试试这个
在从机上设置主机名(可以说主机名=10.21.210.310)
检查windows计算机上的防火墙。允许传入连接到5051端口

相关问题