Ability to deal with matrix input #47

mBarreau · 2023-08-27T17:03:45Z

This PR aims at adding the possibility to deal with matrix input since this can be helpful in the context of PINN. Consequently, here are the main modifications:

Use mapcols in derivative with a matrix input
Enable different types between x and l in derivative
Add tests.

tansongchen · 2023-08-28T03:18:00Z

Could you explain mathematically what you are trying to do here? We might have more general solution to this...

mBarreau · 2023-08-28T05:23:53Z

@tansongchen
Sure, what I want to do is simple.
Let

Then

Is there a reason for the types in derivative for x and l to be the same? If both are independent subclasses of AbstractVector{T} that would allow for more freedom in the syntax :)

codecov · 2023-08-28T16:24:34Z

Codecov Report

Patch coverage: 100.00% and project coverage change: +0.29% 🎉

Comparison is base (7979d76) 85.18% compared to head (6ad719f) 85.48%.

Additional details and impacted files

@@            Coverage Diff             @@
##             main      #47      +/-   ##
==========================================
+ Coverage   85.18%   85.48%   +0.29%     
==========================================
  Files           6        6              
  Lines         243      248       +5     
==========================================
+ Hits          207      212       +5     
  Misses         36       36

Files Changed	Coverage Δ
src/derivative.jl	`100.00% <100.00%> (ø)`

☔ View full report in Codecov by Sentry.
📢 Have feedback on the report? Share it here.

mBarreau · 2023-08-29T10:10:15Z

@tansongchen , tests pass now.
Is that fine with you?

tansongchen · 2023-08-29T13:39:59Z

Thanks for contributing that, but I'm still preparing for an exam these days 😂 I will get back to you and take a closer look on Friday or this weekend!

tansongchen · 2023-09-01T11:12:48Z

Let me try to understand the point of these additional differentiation APIs.

Currently, there are two methods, derivative(f, x0, order) calculates the higher-order derivative

$$ \frac{\mathrm d^kf}{\mathrm dx^k}\big |_{x_0} $$

and derivative(f, x0, l, order) calculates the higher-order directional derivative in direction l

$$ \frac{\partial^kf}{\partial l^k}\big |_{x_0} $$

And now, you add two additional methods, which say that,

For a 1-by-N matrix input x, the function should calculate the derivative at each of its components, and then assemble the output back to a 1-by-N matrix;
For a M-by-N matrix input x, and a M-sized vector l, the function should calculate the directional derivative at each of its columns, and then assemble the output back to a 1-by-N matrix;

Is this correct? If is, I'm happy with this kind of shorthand notations, as long as they proved handful in PINN applications. But I would prefer to not use Union types and move the new APIs to a new block, as well as add some comments stating that they are just shorthands for multiple calculations, or they might be confused with matrix derivatives (see https://en.wikipedia.org/wiki/Matrix_calculus ). If you agree on that, I will take care of moving the code and adding comments, and then merge.

mBarreau · 2023-09-01T12:12:58Z

Hi,

First of all, you are totally correct with what I aim to do.

Let me justify it. You define the Flux/Lux model and you apply it to the input, and then you associate it with the targets and build your loss function.
The idea is to do the same with the physics residual. Since the rand function output a 1xN matrix, it is very convenient to define a residual model which behaves as the original model (input nXN and output MxN) such that you can build the loss in the exact same way.
This is then even simpler to resample or build more complex loss.

If you agree, then I would definitely support such an idea :) (and even write a small tutorial to show how easy it gets to write complex pinn using taylordiff).

tansongchen · 2023-09-02T08:12:54Z

Just did some cleaning up work and added some comments! Once CI passes I will merge. Thanks again for contribution

mBarreau · 2023-09-04T07:32:35Z

@tansongchen, can I ask why you write
AbstractMatrix{T} where T <: Number
And not
AbstractMatrix{<:Number} ?
The second option looks simpler to read and shorter.

tansongchen · 2023-09-05T05:50:02Z

They are equivalent when there is only one type parameter and the parameter is not used in function body. However, when there are two or more, explicit variable name would help to tell whether two types can be different or not. Also, in make_taylor function the type parameter is used for explicit conversion.

So to keep consistency with more complicated cases, I would personally prefer to write all type parameter as a variable :)

mBarreau mentioned this pull request Aug 28, 2023

Change AbstractVector to AbstractArray{T,1} #45

Merged

mBarreau and others added 9 commits September 2, 2023 07:40

TaylorDiff deals with matrix

91d781a

Add tests to matrix differentiation

6154c9c

Fix test

82df7b4

Fix broken test + formatting

88d37d5

Fix broken tests

70822df

Clean types in derivative.jl

5070019

Add a warning and increase test coverage

fbc1de1

Formatting + fix in test

d8a164a

clean up matrix APIs

6ad719f

tansongchen merged commit f776b85 into JuliaDiff:main Sep 5, 2023
10 checks passed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Ability to deal with matrix input #47

Ability to deal with matrix input #47

mBarreau commented Aug 27, 2023

tansongchen commented Aug 28, 2023

mBarreau commented Aug 28, 2023 •

edited

Loading

codecov bot commented Aug 28, 2023 •

edited

Loading

mBarreau commented Aug 29, 2023

tansongchen commented Aug 29, 2023

tansongchen commented Sep 1, 2023 •

edited

Loading

mBarreau commented Sep 1, 2023

tansongchen commented Sep 2, 2023

mBarreau commented Sep 4, 2023 •

edited

Loading

tansongchen commented Sep 5, 2023

Ability to deal with matrix input #47

Ability to deal with matrix input #47

Conversation

mBarreau commented Aug 27, 2023

tansongchen commented Aug 28, 2023

mBarreau commented Aug 28, 2023 • edited Loading

codecov bot commented Aug 28, 2023 • edited Loading

Codecov Report

mBarreau commented Aug 29, 2023

tansongchen commented Aug 29, 2023

tansongchen commented Sep 1, 2023 • edited Loading

mBarreau commented Sep 1, 2023

tansongchen commented Sep 2, 2023

mBarreau commented Sep 4, 2023 • edited Loading

tansongchen commented Sep 5, 2023

mBarreau commented Aug 28, 2023 •

edited

Loading

codecov bot commented Aug 28, 2023 •

edited

Loading

tansongchen commented Sep 1, 2023 •

edited

Loading

mBarreau commented Sep 4, 2023 •

edited

Loading