Add support for grouped convolutions #2485

arrufat · 2022-01-13T07:46:18Z

In this PR I will try to add support for grouped convolutions in dlib.
I never had any interest/use for this kind of convolutions until yesterday, when I read this paper: A ConvNet for the 2020s.
The paper explores the main additions in Transformer networks and adds them to a convolutional network.
It makes use of recent additions to dlib:

Unfortunately, it makes also use of grouped convolutions, which are not currently supported in dlib.
That was the motivation I needed. So far I've written:

forward gpu
backward gpu
forward cpu
backward cpu

The gpu part is relatively easy, since it's just a matter of using the CuDNN API.
The cpu part might take longer to complete (I don't think I'll ever use it, but I will try to add it for completeness.)

I've already implemented the ConvNeXt models described in the paper, and the forward pass seems to work.
Let me know if the approach is sensible.

pfeatherstone · 2022-01-16T18:12:00Z

Yeah that paper is nice. I like these new simple building blocks. I hope they do actually work well beyond simple resnet modules.

arrufat · 2022-02-21T09:01:38Z

I got carried away with other stuff, I will finish this at some point, though 😅

pfeatherstone · 2022-03-15T17:02:08Z

Can this be implemented in the toeplitz matrix ?

arrufat · 2022-03-16T00:38:41Z

Can this be implemented in the toeplitz matrix ?

I was planning to get inspiration from here.
But feel free to step in, I don't think I will find time to do this soon, sadly enough :(

rTreutlein · 2022-03-25T11:17:28Z

I have a cpu implementation but I have to go through some company bureaucracy to be allowed to upload it.

pfeatherstone · 2022-03-29T14:27:59Z

Just noticed the toeplitz matrix isn't cached in the tensor_conv class. So there are a lot of allocations and deallocations. Right?

davisking · 2022-03-30T23:40:50Z

Just noticed the toeplitz matrix isn't cached in the tensor_conv class. So there are a lot of allocations and deallocations. Right?

There are, but there are other allocations and deallocations too. I doubt that one makes a meaningful difference to runtime speed considering what all goes on if someone is using this kind of model.

pfeatherstone · 2022-03-31T07:18:05Z

@davisking OK, I trust you. I had a quick look at the ncnn repo, and they have like a 100 specialisations of the convolution layer for different parameters like kernel sizes, groups, etc, whether or not the layer is 8-bit quantized, different architectures,... It looks like way too much work to do something similar in dlib to get CPU conv performance up to standard. Sorry this is unrelated to this PR. Just passing observation.

davisking · 2022-04-07T02:42:00Z

Yeah that's how you do conv on the CPU fast. The toeplitz matrix thing is a weird hack. I did it (as did others) because it's just a super easy way to support all the various conv types with all their different strides and all that. But fast conv code looks like this kind of stuff https://github.com/davisking/dlib/blob/master/dlib/image_transforms/spatial_filtering.h#L126. Or other similar kinds of setups. It depends on which kind of conv we are talking about.

arrufat added 3 commits January 13, 2022 16:32

Add support for grouped convolutions

3a04c79

it should at least build on CPU now

15a8675

fix tensor_conv() without bias

a17d5f7

arrufat mentioned this pull request Jan 13, 2022

Issue with the LayerNorm implementation #2486

Closed

Merge branch 'davisking:master' into grouped-convolutions

3be20f3

This comment was marked as outdated.

Sign in to view

davisking added the enhancement label Feb 21, 2022

Merge branch 'davisking:master' into grouped-convolutions

6b757e2

Merge branch 'davisking:master' into grouped-convolutions

1382ab5

arrufat force-pushed the grouped-convolutions branch 2 times, most recently from 9628aa0 to 1382ab5 Compare May 1, 2022 06:55

arrufat and others added 2 commits May 1, 2022 15:58

fix depth-wise convolution case in setup

5bb534b

Merge branch 'davisking:master' into grouped-convolutions

7c106ee

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add support for grouped convolutions #2485

Add support for grouped convolutions #2485

arrufat commented Jan 13, 2022 •

edited

Loading

pfeatherstone commented Jan 16, 2022

This comment was marked as outdated.

arrufat commented Feb 21, 2022

pfeatherstone commented Mar 15, 2022

arrufat commented Mar 16, 2022

rTreutlein commented Mar 25, 2022

pfeatherstone commented Mar 29, 2022

davisking commented Mar 30, 2022

pfeatherstone commented Mar 31, 2022

davisking commented Apr 7, 2022

Add support for grouped convolutions #2485

Are you sure you want to change the base?

Add support for grouped convolutions #2485

Conversation

arrufat commented Jan 13, 2022 • edited Loading

pfeatherstone commented Jan 16, 2022

This comment was marked as outdated.

arrufat commented Feb 21, 2022

pfeatherstone commented Mar 15, 2022

arrufat commented Mar 16, 2022

rTreutlein commented Mar 25, 2022

pfeatherstone commented Mar 29, 2022

davisking commented Mar 30, 2022

pfeatherstone commented Mar 31, 2022

davisking commented Apr 7, 2022

arrufat commented Jan 13, 2022 •

edited

Loading