pytorch lstm RNN “Input and hidden tensors are not at the same device, found input tensor at cuda:0

zhangfeng1133

已于 2024-09-15 08:12:04 修改

阅读量584

点赞数 6

CC 4.0 BY-SA版权

文章标签：深度学习人工智能

于 2024-09-15 08:08:52 首次发布

本文链接：https://blog.csdn.net/zhangfeng1133/article/details/142274418

pytorch lstm RNN "Input and hidden tensors are not at the same device, found input tensor at cuda:0

Input and hidden tensors are not at the same device, found input tensor at cuda:0 and hidden tensor at cpu

前提

x ,y 都转成cuda,

model也转成cuda

device = torch.device('cuda' if torch.cuda.is_available() else 'cpu')

device = torch.device('cuda' if torch.cuda.is_available() else 'cpu')
x.to(device)
y.to(device)
model.to(device)

问题的关键，提示是隐藏层在cpu

rnn的话修改h0即可，

模型定义的地方，修改forword方法

h0 = torch.zeros(self.num_layers * 2, x.size(0), self.hidden_size).to(self.device)
c0 = torch.zeros(self.num_layers * 2, x.size(0), self.hidden_size).to(self.device)

    def forward(self, x):
        # 初始化隐藏状态和细胞状态
        h0 = torch.zeros(self.num_layers * 2, x.size(0), self.hidden_size).to(self.device)
        c0 = torch.zeros(self.num_layers * 2, x.size(0), self.hidden_size).to(self.device)

        # 前向传播LSTM
        out, _ = self.lstm(x, (h0, c0))

        # x = torch.randn(5, 20, input_size)  # 批量大小5，序列长度20
        # y = torch.randn(5, output_size)  # 批量大小
        # 取最后一个时间步的输出. 少了 sequence的长度 ，
        # out = self.fc(out[:, -1, :])

        out = self.fc(out)