【求助】强化学习backword()一直报错

IWhisper#697

2024/8/8镜像同步10 回复

你看看你的代码里面是不是有 w+=1的这种操作，这种会被认定成inplace操作，然后报错。。。。一个很坑的问题。。。

订阅后，新回复会通过你的通知中心匿名送达。

10 条回复

IWhisper#697机器人#0 · 2024/8/9

RuntimeError: one of the variables needed for gradient computation has been modified by an inplace operation: [torch.FloatTensor [64, 64]], which is output 0 of AsStridedBackward0, is at version 2; expected version 1 instead. Hint: the backtrace further above shows the operation that failed to compute its gradient. The variable in question was changed in there or anywhere later. Good luck! 已经没有inplace操作了，为啥还会一直报这个错呢

IWhisper#856机器人#1 · 2024/8/9

你看看你的代码里面是不是有 w+=1的这种操作，这种会被认定成inplace操作，然后报错。。。。一个很坑的问题。。。

IWhisper#697机器人#2 · 2024/8/9

【在 IWhisper#856 的大作中提到: 】 : 你看看你的代码里面是不是有 w+=1的这种操作，这种会被认定成inplace操作，然后报错。。。。一个很坑的问题。。。 检查好几遍了，没有这个问题啊<img src="/img/ubb/em/9.gif" alt="em9" style="display:inline;border-style:none">

IWhisper#856机器人#3 · 2024/8/9

你把报错的附近的几行发上来看一眼

IWhisper#697机器人#4 · 2024/8/9

【在 IWhisper#856 的大作中提到: 】 : 你把报错的附近的几行发上来看一眼 q_actor_loss = agent.critic_eval(states, mu).flatten().clone() actor_loss = -T.mean(q_actor_loss).clone() agent.actor_optim.zero_grad() actor_loss.backward(retain_graph=True) agent.actor_optim.step()

IWhisper#666机器人#5 · 2024/8/9

backward 【在 IWhisper#697 的大作中提到: 】 :   : q_actor_loss = agent.critic_eval(states, mu).flatten().clone() : actor_loss = -T.mean(q_actor_loss).clone() : ............

IWhisper#697机器人#6 · 2024/8/9

【在 IWhisper#666 的大作中提到: 】 : backward 只有backward()报新的错：RuntimeError: Trying to backward through the graph a second time (or directly access saved tensors after they have already been freed). Saved intermediate values of the graph are freed when you call .backward() or autograd.grad(). Specify retain_graph=True if you need to backward through the graph a second time or if you need to access saved tensors after calling backward. <img src="/img/ubb/em/15.gif" alt="em15" style="display:inline;border-style:none">

IWhisper#112机器人#7 · 2024/8/9

只看这里没用，backward报的错，但是其他地方可能漏了inplace 【在 IWhisper#697 的大作中提到: 】 :   : q_actor_loss = agent.critic_eval(states, mu).flatten().clone() : actor_loss = -T.mean(q_actor_loss).clone() : ............

IWhisper#856机器人#8 · 2024/8/9

这部分看起来莫得，你可能需要pdb进到forward里面去看

IWhisper#697机器人#9 · 2024/8/9

已经解决啦，换了一个torch版本<img src="/img/ubb/em/3.gif" alt="em3" style="display:inline;border-style:none">