Pytorch遍历DataLoader时报错【BrokenPipeError:[Errno 32]Broken pipe】|电子爱好者

admin管理员组
文章数量:1642214

问题描述

GPU环境训练好模型，CPU环境部署过程成功后，尝试遍历DataLoader的时候出现了以下报错信息。具体如下：

Traceback (most recent call last):
  File "/usr/local/lib/python3.6/multiprocessing/resource_sharer.py", line 142, in _serve
    with self._listener.accept() as conn:
  File "/usr/local/lib/python3.6/multiprocessing/connection.py", line 455, in accept
    deliver_challenge(c, self._authkey)
  File "/usr/local/lib/python3.6/multiprocessing/connection.py", line 720, in deliver_challenge
    connection.send_bytes(CHALLENGE + message)
  File "/usr/local/lib/python3.6/multiprocessing/connection.py", line 200, in send_bytes
    self._send_bytes(m[offset:offset + size])
  File "/usr/local/lib/python3.6/multiprocessing/connection.py", line 404, in _send_bytes
    self._send(header + buf)
  File "/usr/local/lib/python3.6/multiprocessing/connection.py", line 368, in _send
    n = write(self._handle, buf)
BrokenPipeError: [Errno 32] Broken pipe

问题分析

逐步打断点调试定位错误代码为Pytorch遍历DataLoader引发。

loader =DataLoader(dataset,
	batch_size = batch_size,
    shuffle=shuffle,
    num_workers = 4,
    pin_memory=True)

DataLoader的 num_workers 涉及多进程读取数据，而Python由于设计时有GIL全局锁，导致多进程无法利用多核，问题大致就出于此。try-catch错误代码输出：

[Errno 11] Resource temporarily unavailable

大batch训练时，num_workers=4加速GPU训练效率，CPU推理过程不需要。

问题解决

将num_workers设置为 0 即可。

附录

https://discuss.pytorch/t/very-strange-dataloader-error-simplified-code-inside/31162
https://github/pytorch/pytorch/issues/14768

本文标签：遍历时报 DataLoader Pytorch BrokenPipeError

版权声明：本文标题：Pytorch遍历DataLoader时报错【BrokenPipeError:[Errno 32]Broken pipe】内容由热心网友自发贡献，该文观点仅代表作者本人，转载请联系作者并注明出处：https://m.elefans.com/xitong/1729333176a1196655.html，本站仅提供信息存储空间服务，不拥有所有权，不承担相关法律责任。如发现本站有涉嫌抄袭侵权/违法违规的内容，一经查实，本站将立刻删除。

更多相关文章

xp系统

使用分页导入的方式把大量数据从mysql导入单点的es时报错：Connection refused: no further information

9天前

我出现的问题： 意思是，拒绝连接:没有进一步的信息我的解决方案是：在yml文件中配置以下信息，问题就可以解决 spring:data:elastics

配置缓存时报错: Connection refused: no further information

9天前

项目配置缓存时出错, 错误信息如下 Servlet.service() for servlet [dispatcherServlet] in context with path [] threw exception [Request pr

pytorch——迁移学习实战宝可梦精灵分类

8天前

文章目录数据集数据集处理迁移学习网络原理代码实现数据集使用宝可梦精灵的图片数据集。数据集地址： 链接：https:pan.baidus1zDERMsV1AvwfZudhuae6E

npm install时报错：“Unexpected token ＜ in JSON at position 0 while parsing near ‘＜!DOCTYPE html＞”

8天前

git clone之后，npm i时报下面错误：解决办法：npm set registry https:registry.npmjs. 原因：

Pytorch调用预训练模型输出结果时报错argument ‘input‘ (position 1) must be Tensor, not collections.OrderedDict

8天前

在使用pytorch中的torchvision.models.segmentation.fcn_resnet50进行获得已经训练好的预训练模型时，所得结果的输出给我提示说argument 'input' (positio

惠普HP ProBook 笔记本U盘启动安装 Linux Ubuntu 时报错内存不足（error: out of memory）解决记录（内核DMA保护）

8天前

惠普HP ProBook 惠普笔记本，U盘启动安装Linux Ubuntu 时报错内存不足（error: out of memory）解决记录提要概述笔记本预装win

hbase查询时报错offset (0) + length (4) exceed the capacity of the array: 2

8天前

hbase查询时报错offset (0)length (4) exceed the capacity of the array: 2 错误原因：同一个列族中，有的列中没有指定值

git提交时报错:Updates were rejected because the tip of your current branch is behind

8天前

问题描述今天用git提交项目到gitee上时，执行下面语句时报错 git push -u origin mastererror: failed to push some refs to ‘xxxxx’ hint:

虚拟机打开时报错Operation inconsistent with current state

8天前

文章目录问题描述导致原因解决方案问题描述 Operation inconsistent with current state vmware 翻译：操作与当前状态不一致导致原因 1.可能是因为上一次的虚拟机操作

QT中在线程中使用opengl时报错Cannot make QOpenGLContext current in a different thread解决办法

8天前

激动的心，颤抖的手，终于解决了这个卡了小半年的bug，一直没有找到解决办法。几个月之后，偶然搜到一个大佬写的东西点这，看了之后其实

使用idea的mybatis-generator插件逆向生成时报错:No plugin found for prefix 'mybatis-generator' in the current

8天前

用mybatis-generator进行逆向生成，第一次时报错： No plugin found for prefix mybatis-generator in the current proj

tomcat加载时报The web application [dmscs] created a ThreadLocal with key of type

6天前

严重: The web application [dmscs] created a ThreadLocal with key of type [com.opensymphony.xwork2.inject.ContainerImpl$10

Maven项目中导入坐标依赖时报（Failure to transfer....）的错的问题

6天前

Maven项目中导入坐标依赖时报（Failure to transfer…）的错的问题今天在做Spring MVC的一个项目时导入坐标依赖的时候突然网断了一下(村里网络日常不稳定)&#x

树莓派4B安装系统、opencv4.1、pytorch踩坑记录

5天前

树莓派系统镜像下载因为国内网络原因，从树莓派官网下载太慢，而且可能下载到一半就会卡死，这里选择从清华大学开源软件镜像站下载，进入这个网站之后，找到下图中的三个项。这里三项分别对应着树莓派官网的三种系统镜像。带图形桌面和推荐软件：(r

用PyTorch实现图像聚类

3天前

作者|Anders Ohrn 编译|VK 来源|Towards Data Science 利用深度卷积神经网络(DCNN)进行监督图像分类是一个成熟的过程。通过预训练模板模型加上微调优化，可以在许多有意义的应用中获得非常高的准确率——比如

python手写输入法开发_AI手写输入法 - pytorch从入门到入道(二)

1天前

1 #coding: utf-8 2 from PyQt5.QtWidgets import * 3 from PyQt5.QtGui import * 4 from PyQt5.QtCore import * 5 importsys6 s

【Python】使用PdfFileMerger合并pdf时报错PdfReadError: Unexpected destination ‘__WKANCHOR_2‘

1天前

在python中使用PyPDF2扩展包的PdfFileMerger函数合并pdf时，代码如下： mergerPdfFileMerger()input1open(r"2.pdf

震惊！System Volume Information竟是遍历硬盘下的所有文件和目录时出现异常的真正元凶！！！

17小时前

我们在对硬盘下的所有目录遍历访问时,经常会出现异常,罪魁祸首就是(硬盘):System Volume Information这个文件夹,因为这个文件夹它拒绝访问,你就是再遍历,只要碰到它都完蛋,而且还是每个盘都有,当然这个文件夹也可以删掉

【常见 error】自定义 IP 添加到工程时报错解决办法

2小时前

在进行自定义 IP 后，将自定义 IP 添加到当前的工程时，出现如下报错： [IP_Flow 19-167] Failed to deliver one or more f