python Pytorchvideo模型Resnet输入形状

ztmd8pv5 于 2023-04-28 发布在 Python

关注(0)|答案(1)|浏览(168)

bounty还有5天到期。回答此问题可获得+500声望奖励。Paul Reiners希望引起更多关注这个问题。

我使用下面的代码来加载resnet50，但由于这是一个视频。我不确定预期的输入是什么。是([batch_size, channels, frames,img1,img2])吗？
任何帮助将是伟大的。

import pytorchvideo.models.resnet

def resnet():
  return pytorchvideo.models.resnet.create_resnet(
      input_channel=3,     # RGB input from Kinetics
      model_depth=50,      # For the tutorial let's just use a 50 layer network
      model_num_class=400, # Kinetics has 400 classes so we need out final head to align
      norm=nn.BatchNorm3d,
      activation=nn.ReLU,
  )

python

来源：https://stackoverflow.com/questions/75096662/pytorchvideo-models-resnet-input-shape

1条答案

按热度按时间

zf9nrax11#

输入Tensor的形状应为（B，C，T，H，W）
资料来源： www.example.com
文档中的一个用法示例。

import pytorchvideo.models as models

resnet = models.create_resnet()
B, C, T, H, W = 2, 3, 8, 224, 224
input_tensor = torch.zeros(B, C, T, H, W)
output = resnet(input_tensor)

赞(0）回复(0）举报 2023-04-28

我来回答

python Pytorchvideo模型Resnet输入形状

1条答案

相关问题

热门标签

最新问答