RFC: Always use special `.solve` for Kronecker linear operators #50

saitcakmak · 2023-02-16T03:09:25Z

As titled. These linear operators are generally much larger than their components. If fast_computations (in particular _fast_solves) is turned off, then we try to compute Cholesky over huge matrices, which leads to OOMs.

Balandat · 2023-02-16T06:22:17Z

linear_operator/functions/_solve.py

+    if isinstance(
+        linear_op,
+        (
+            CholLinearOperator,
+            TriangularLinearOperator,
+            KroneckerProductAddedDiagLinearOperator,
+            KroneckerProductLinearOperator,
+            KroneckerProductDiagLinearOperator,
+            KroneckerProductTriangularLinearOperator,
+            SumKroneckerLinearOperator,
+        ),
+    ):


Hmm will this always apply the special solve method? There may be situations in which we want to use Linear CG solves even for some operators with a special solve method.

Aside: The name "fast_computations" is a bit weird; whether it's fast or not will depend on the operator structure and the data size...

@gpleiss, @jacobrgardner, curious about your thoughts here

Aside: The name "fast_computations" is a bit weird; whether it's fast or not will depend on the operator structure and the data size...

Agreed. I regret it.

There may be situations in which we want to use Linear CG solves even for some operators with a special solve method.

@JonathanWenger and I brainstormed this a bit. One thought that we had was that a user could specify (via context manager, inline argument, etc.) when they want to go into iterative solving mode. All other solves would be performed using direct methods otherwise.

I think that could be useful. Another option would be to attach default rules for the decision which solves to use to the respective operators but then allow to override them (either way so a default exact solve may use an iterative instead and vice versa).

I agree with @Balandat on the default rules. That would nicely integrate with using a Kronecker, or banded specific solver. The interface Geoff and I were discussing was either via a context manager or with an optional argument that could specify the default per linear operator:

def solve(self, right_tensor: torch.Tensor, left_tensor: Optional[torch.Tensor] = None, linear_solver: LinearSolver = CG()) -> torch.Tensor:

However, there were some potential issues with this interface and the interplay with implementing torch.linalg.solve, if I remember correctly. In a perfect world torch.linalg.solve would dispatch on the specific kind of LinearOperator I suppose.

add kronecker to exception list

0adf688

saitcakmak requested a review from Balandat February 16, 2023 03:09

Balandat reviewed Feb 16, 2023

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

RFC: Always use special `.solve` for Kronecker linear operators #50

RFC: Always use special `.solve` for Kronecker linear operators #50

saitcakmak commented Feb 16, 2023

Balandat Feb 16, 2023

Balandat Feb 16, 2023

gpleiss Feb 27, 2023

Balandat Feb 28, 2023

JonathanWenger Feb 28, 2023

RFC: Always use special .solve for Kronecker linear operators #50

Are you sure you want to change the base?

RFC: Always use special .solve for Kronecker linear operators #50

Conversation

saitcakmak commented Feb 16, 2023

Balandat Feb 16, 2023

Choose a reason for hiding this comment

Balandat Feb 16, 2023

Choose a reason for hiding this comment

gpleiss Feb 27, 2023

Choose a reason for hiding this comment

Balandat Feb 28, 2023

Choose a reason for hiding this comment

JonathanWenger Feb 28, 2023

Choose a reason for hiding this comment

RFC: Always use special `.solve` for Kronecker linear operators #50

RFC: Always use special `.solve` for Kronecker linear operators #50