Paddle xpu多卡训练报错

afdcj2ne 于 2021-11-30 发布在 Java

关注(0)|答案(7)|浏览(421)

paddlepaddle2.1.1
python3.7.4
按照官网模板写的mnist手写图片识别算法，跑单卡xpu可以训练，跑多卡时分别尝试了fleet的collective模式和paddle.distributed.nit_parallel_env()，前者报错no cuda device，后者报错Operator is not registered。请问如何解决？是xpu不支持多卡么？

Paddle

来源：https://github.com/PaddlePaddle/Paddle/issues/35752

7条答案

按热度按时间

oewdyzsn1#

您好，我们已经收到了您的问题，会安排技术人员尽快解答您的问题，请耐心等待。请您再次检查是否提供了清晰的问题描述、复现代码、环境&版本、报错信息等。同时，您也可以通过查看官网API文档、常见问题、历史Issue、AI社区来寻求解答。祝您生活愉快～

Hi! We've received your issue and please be patient to get responded. We will arrange technicians to answer your questions as soon as possible. Please make sure that you have posted enough message to demo your request. You may also check out the API，FAQ，Github Issue and AI community to get the answer.Have a nice day!

赞(0）回复(0）举报 2021-11-30

8aqjt8rx2#

跑的都是动态图吧，先设置一下device，paddle.set_device('xpu')

赞(0）回复(0）举报 2021-11-30

inkz8wg93#

这个代码吧https://www.paddlepaddle.org.cn/tutorials/projectdetail/2203224