Using _vmap in PyTorch to compute the Hessian-vector product (hvp) encounters a runtime error

Trying to use the minimize function with methods 
 - trust-ncg 
 - dogleg
 - newton-exact
 - trust-exact
 - trust-krylov

But succeeds with the other methods. Presumably, the other methods aren't computing Hessians. 

I tracked the error to [here](https://github.com/rfeinman/pytorch-minimize/blob/01726db66c24602784675af0b9da8e16e517752f/torchmin/function.py#L136C13-L137C1).

I can't quite figure out what is really responsible for the error but I suspect its `_vmap` failing to batch properly because my debugger indicates there is something wrong with the tensors that are yielded. If i look at the `batched_inputs[0]` variable in `_vmap_internals._vmap` and try print it, or view it, or do `+1` to it then i get the error `RuntimeError: Batching rule not implemented for aten::is_nonzero. We could not generate a fallback.
` 

Computing the hessian in a loop works but is hideous and slow. 
```
hvp_map = lambda V: torch.stack(
                [autograd.grad(grad, x, v, retain_graph=True)[0] for v in V], dim=0)
hess = hvp_map(self._I)
```

Is this a real issue or am missing something?


Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Using _vmap in PyTorch to compute the Hessian-vector product (hvp) encounters a runtime error #33

Metadata

Assignees

Labels

Projects

Milestone

Relationships

Development

Using _vmap in PyTorch to compute the Hessian-vector product (hvp) encounters a runtime error #33

Description

Metadata

Metadata

Assignees

Labels

Projects

Milestone

Relationships

Development

Issue actions