What is: Dimension-wise Convolution?

A Dimension-wise Convolution, or DimConv, is a type of convolution that can encode depth-wise, width-wise, and height-wise information independently. To achieve this, DimConv extends depthwise convolutions to all dimensions of the input tensor $X \in \mathbb{R}^{D\times{H}\times{W}}$ , where $W$ , $H$ , and $D$ corresponds to width, height, and depth of $X$ . DimConv has three branches, one branch per dimension. These branches apply $D$ depth-wise convolutional kernels $k\_{D} \in \mathbb{R}^{1\times{n}\times{n}}$ along depth, $W$ width-wise convolutional kernels $k\_{W} \in \mathbb{R}^{n\times{1}\times{1}}$ along width, and $H$ height-wise convolutional kernels $k\_{H} \in \mathbb{R}^{n\times{1}\times{n}}$ kernels along height to produce outputs $Y\_{D}$ , $Y\_{W}$ , and $Y\_{H} \in \mathbb{R}^{D\times{H}\times{W}}$ that encode information from all dimensions of the input tensor. The outputs of these independent branches are concatenated along the depth dimension, such that the first spatial plane of $Y\_{D}$ , $Y\_{W}$ , and $Y\_{H}$ are put together and so on, to produce the output $Y\_{Dim} =$ { $Y\_{D}$ , $Y\_{W}$ , $Y\_{H}$ } $\in \mathbb{R}^{3D\times{H}\times{W}}$ .

Source	DiCENet: Dimension-wise Convolutions for Efficient Networks
Year	2000
Data Source	CC BY-SA - https://paperswithcode.com