Hacker Newsnew | past | comments | ask | show | jobs | submitlogin

And each pixel is what, a RGB-vector?


We usually do HxWxC, for height, width, and channels, so each pixel is addressed via the two first dims of the input, and then it has 3 channels. Of course, you can transpose the tensor to CxHxW or CxWxH. Different ordering behaves differently with respect to memory locality.


In the context of neural networks, kind of. It’s 3 numbers on the input layer, but that could influence N parameters in later layers.


Yes, each pixel is a vector of the rasterized vector space.




Guidelines | FAQ | Lists | API | Security | Legal | Apply to YC | Contact

Search: