There’s a simple explanation to everything, at least if you don’t care about how sufficient the explanation is. The simple explanation for Computer Vision is that this is the reversed technology for 3D Computer Graphics. The medium explanation makes the simple one look ashamed, while the advanced could be seen as advanced physics warping the universe .
Let’s start with the simple explanation - the ‘opposite of 3D Computer Graphics’ - which tells us that since 3D CG maps locations in 3D space to a 2D screen, Computer Vision should do the opposite, meaning we have one or more images of a location and want to find the scene from these images. This translation from 2D back to three dimensions can be done in a number of ways, most of them needing more complex explanations than the simple version can offer.
Moving on to the medium difficulty explanation, we see that Computer Vision has close relatives in fields such as Photogrammetry, Signal Processing and Image Analysis, that it can be used in for example robotics or for creating 3D scenes to walk around in, and that it is one of the most advanced fields in Computer Science at the moment, with very few systems which are both simple/fast to use and completely robust. Much of the focus is on cameras, and the external and internal parameters of these cameras as photos were taken.
The internal camera parameters include the zoom/focal length, the aspect ratio and the pixel size, while the external parameters are the position and rotation of the camera in three dimensions. Since the taken photos are always in two dimensions this leads to major mathematical computational difficulties. These difficulties have for some applications lead to combining the area with preparatory methods. An example is to create an estimated ‘pre-model’ with constraints for the final result, when trying to recreate a scene using Computer Vision methods, or putting constraints by trying to predict the next step of a tracked object.
Since this post is getting a bit long I’ll continue with the advanced explanation and some tips for books and online resources in another post…
No comments yet