And each pixel is what, a RGB-vector?

PartiallyTyped · on Sept 19, 2023

We usually do HxWxC, for height, width, and channels, so each pixel is addressed via the two first dims of the input, and then it has 3 channels. Of course, you can transpose the tensor to CxHxW or CxWxH. Different ordering behaves differently with respect to memory locality.

dartos · on Sept 19, 2023

In the context of neural networks, kind of. It’s 3 numbers on the input layer, but that could influence N parameters in later layers.

polynomial · on Sept 20, 2023

Yes, each pixel is a vector of the rasterized vector space.