Informatics Institute
Focus on research: informatician Jan-Mark Geusebroek

From flying an airplane to forecasting the weather: people use computers for very different things. But is a computer able to see and perhaps even judge whether something is beautiful or ugly? Informatician Jan-Mark Geusebroek of the Intelligent Systems Lab Amsterdam (ISLA), Informatics Institute, investigates how to he can teach a computer to look at things like people do.
Geusebroek works on various lines of research within informatics. At present he is working on very compact representation of images, whereby only the most important characteristics remain. ‘I'm looking for those features which describe an object', explains Geusebroek. ‘The computer needs to learn these features in order to achieve a level of understanding of things.' To give insight into his research, Geusebroek is teaching a robot dog to recognize objects, whereby a worldwide network of computers forms the ‘brain' of the dog.
After a couple of years' research, the robot dog is now able to recognize about a thousand objects. ‘It takes about 20 milliseconds for it to process a new photograph and about 6 to 10 photographs are required per object.' In theory, the dog should be able to learn to recognize a thousand new objects within a few minutes, assuming that the photographs are already available. Geusebroek admits that a lot of time is spent taking photographs and showing them to the robot-dog. Moreover, he has no idea how the ‘animal' will react when it has to identify large numbers of objects, e.g. in a department store with about twenty to fifty thousand objects. Despite the fact that there are still a lot of catches, the business community has shown a lot of interest in the smart robot doggie.
A large part of Geusebroek's research relates to the question ‘What must a computer learn in order to recognize objects?' To answer this question, he converts images into statistics. Geusebroek discovered that so-called Weibull statistics provides information that is important for object recognition. ‘You turn on your camera, take a picture and your image is a complete Weibull-distribution! I find that truly remarkable', says Geusebroek enthusiastically. ‘We were already aware that this distribution was recurrent, but I discovered that it is an elementary feature for image recognition.'
‘Nobody else had used Weibull statistics to this end; on the contrary, when I wrote an article about it about four or five years ago, I received remarks that it was ridiculous. I published it eventually, albeit as part of the methodology.' By now a large part of the ISLA group uses this method successfully. During comparative tests in which teams from various research institutes participated, the ISLA scored very well throughout.
Weibull statistics
Suppose you examine somebody's list of marks, a few sixes, a couple of eights, a nine and a five. If you chart these marks and the number of times a particular mark was obtained on a graph, you end up with a nice Gaussian distribution, a bell shape. In Weibull statistics you sort the marks, taking the difference between the successive numbers. You subsequently chart the distance between two successive numbers and the number of times that the number occurs. This gives a Weibull distribution. Geusebroek discovered that this distribution recurs in many pictures, whereby the exact shape of the final graph turns out to be characteristic for the spatial structure of the object.
View things as humans do
Geusebroek is of the opinion that a lot can be learned from how people look at things. ‘A human is born with a brain, an extremely complicated structure which has evolved over millions of years. I look at the world outside and assume that humans are adapted to it. Suppose you view your head as a large calculator, what do you need to calculate in order to think something about the image you are looking at? This is how I try to make my computer view things, and so I steal some parts of human perception, such as three colour channels. More channels would probably work better, but that would cost more pixel space; more colour is, thus, less resolution. Fewer colour channels wouldn't be a good choice, since you would be colour blind. Humans are also very sensitive to movement; we notice movements in both the intensity difference as well as the colour contrast, so we can see moving edges. I then use these facts in the computer algorithmics.'
It remains to be seen whether the computer will ever be able to view things as humans do. ‘In twenty years computers will probably have the same computing power as the human brain. However, the algorithmics are still missing. But we are making progress. Thanks to new fMRI techniques, neurobiologists and psychologists know approximately which area in the brain plays a role at a particular moment. However, we have no idea what is being calculated. I sometimes compare neurologists' work to examining a pentium-4 processor under the microscope, and then trying to figure out how it works. It is super-complex.'
Geusebroek is looking for the direct effect of physics on statistics. ‘When the suns sets, how does the contrast in grass change? Can an algorithm describe exactly how the structures change as a result of the setting sun?' Thanks to his research, Geusebroek has developed another manner of observation. ‘I have learned to look at things more carefully, I look for a structure in a photograph. I probably study a piece of tree-bark longer than other people.' Geusebroek's main aim, teaching the computer to observe things as a human being, stems in the first instance from curiosity. He admits laughingly that ‘it would be useful if my computer really was able to see things. I'd sit in the back of the car, and let the computer drive.'
Image processing
Geusebroek ended up at the UvA after an LTS Electro-technical study and an HTS in Electronics. 'I did my alternative national service at the UvA, following a combined training at the molecular cell biology group and at the Informatics Institute. I was supposed to make a really good picture from a grainy image of beautiful, three-dimensional DNA structures. That was my introduction to image processing.' Geusebroek got a position as PhD student at the ISLA. ‘A Belgian pharmaceutics company wanted to analyse samples, large plates of cell specimens. In order to follow a sample with living cells for any length of time, the microscope must remain focussed, but that often went wrong. I developed an algorithm that keeps a microscope continually focussed.' Geusebroek's research led to a worldwide patent.
Geusebroek found his next challenge in analysing colours within the specimens. ‘Colouration was evaluated by people. It was very intensive and subjective work. I immersed myself in its standardisation.' After writing his PhD thesis on this subject, Geusebroek was awarded a VENI grant from NWO for research into computers that ‘look'.
Erotic pictures
Geusebroek is now supervisor of three PhD students. In collaboration with researchers from Groningen, one of them is investigating how the eye moves when looking at a picture. ‘When you show somebody a picture of a person, many people look at the face first. But it takes about 100 milliseconds before you even see a face at all. What do you see in those first milliseconds? And does this depend on the content of the picture?'
Geusebroek also collaborates with the University of Bonn. ‘We show people pictures with an emotional content, for example pitiful, violent or erotic pictures. The fMRI-scanner shows which areas of the brain are activated. We then examine to what extent the content of the image determines the emotions. Do we still see emotions if, for example, we standardise the colour setting?' In collaboration with the department of Psychophysics in Utrecht, Geusebroek also investigates the sensitivity of the eye for distinguishing colours and structures. ‘How does a human see a tree? You don't absorb each leaf separately, yet you get the impression ‘tree'. How does this impression come about?
Ultimately, the informatician hopes to be able to forecast what people look at in a picture. When this is known, this information can be used to teach a computer to look at things better. ‘We hope that we will also be able to teach the system to make subjective judgements, so that the robot dog doesn't just learn to recognise things, but also to distinguish between what is beautiful and what isn't'






