Add tril_ layer for lower triangular matrix operations #3018

Cydral · 2024-09-23T13:50:58Z

This PR introduces a new tril_ layer to dlib, which implements lower triangular matrix operations similar to PyTorch's torch.tril() function. The layer allows for flexible lower triangular matrix masking with customizable diagonal offset and diagonal value.

Key features:

Implements lower triangular masking for tensors
Supports custom diagonal offset and diagonal value
Offers three convenient alias templates: tril, tril_mask, and tril_diag

This addition enhances dlib's neural network capabilities, allowing for more complex architectures that require lower triangular matrix operations.
The new layer can be particularly useful in attention mechanisms, triangular matrix operations, and other scenarios where lower triangular masking is required in neural network architectures.

Cydral · 2024-09-23T14:25:22Z

In C++17, the use of a float in a template is not allowed. Therefore, I used an intermediate structure to pass the float to the tril_ layer. However, the GCC compilation still fails. Is there a ‘trick’ in Dlib’ to do this?

…in c++17

arrufat · 2024-09-24T06:33:16Z

I didn't check in detail what you're doing, but the one time I needed floats in a template parameter, I just used two integers (numerator and denominator). Similar to what std::ratio does.

In your case, something like:

template<long num, long den>
class tril_
{
public:
    tril_() : diag(static_cast<float>(num) / static_cast<float>(den)) {}
private:
    float diag;
};

Cydral · 2024-09-24T10:55:06Z

I didn't check in detail what you're doing, but the one time I needed floats in a template parameter, I just used two integers (numerator and denominator). Similar to what std::ratio does.

In your case, something like:
template<long num, long den>
class tril_
{
public:
    tril_() : diag(static_cast<float>(num) / static_cast<float>(den)) {}
private:
    float diag;
};

Thank you for your feedback, @arrufat. I appreciate your suggestion, and I've actually explored various approaches during this refactoring process. C++17 compatibility would indeed make the code more efficient to write, but given our current constraints, I've had to find a C++14 compatible solution.

In the end, I've implemented a mechanism very similar to what you've described. The refactored tril_ class now uses a combination of tags (for specific values like negative infinity) and a numerator/denominator approach for other numeric values.
The static compilation tests now pass. My own unit tests show quite satisfactory functionality... although I think direct float precision would have been preferable...

davisking · 2024-09-30T01:59:16Z

Looks good, thanks for the PR :)

Add tril_ layer for lower triangular matrix operations

354e5d0

Cydral added 3 commits September 23, 2024 19:41

Improved layer consistency

c2b51e2

Added constant_wrapper to fix the issue of the float in the template …

5b763e8

…in c++17

Looking for a solution for c++ 14

125f481

Cydral added 6 commits September 24, 2024 09:21

Refactor tril_ layer for improved flexibility and C++14 compatibility

b11eb52

Updates

c1b9b1f

Updates

8e6ad3e

Updates

0c7c2ba

Updates

16fc68b

Updates

55c6cc7

Updates

afe6e08

davisking merged commit 4e53f83 into davisking:master Sep 30, 2024
10 checks passed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Add tril_ layer for lower triangular matrix operations #3018

Add tril_ layer for lower triangular matrix operations #3018

Uh oh!

Cydral commented Sep 23, 2024

Uh oh!

Cydral commented Sep 23, 2024

Uh oh!

arrufat commented Sep 24, 2024

Uh oh!

Cydral commented Sep 24, 2024

Uh oh!

davisking commented Sep 30, 2024

Uh oh!

Uh oh!

Uh oh!

Add tril_ layer for lower triangular matrix operations #3018

Add tril_ layer for lower triangular matrix operations #3018

Uh oh!

Conversation

Cydral commented Sep 23, 2024

Key features:

Uh oh!

Cydral commented Sep 23, 2024

Uh oh!

arrufat commented Sep 24, 2024

Uh oh!

Cydral commented Sep 24, 2024

Uh oh!

davisking commented Sep 30, 2024

Uh oh!

Uh oh!

Uh oh!