Ph.D. thesis
Semi-automatic high-resolution rural land cover classification
This thesis presents a complete image analysis system which, from high-resolution 3 or 4-channel digital images (50cm, colour and optionally near infrared), and using the cadastre database, segments the images into agriculturally-homogeneous regions (fields, forests, vines, and so on), and classifies these regions, tagging each classified region with a confidence measure which indicates the system’s confidence in each classification. It includes a study of the value of texture features and transformed colour spaces for segmentation and classification, two methods for registering a graph onto an image, a novel probability model and associated per-region classification algorithms, and a high-precision period and orientation estimator.
Download
You can download the following files:
- Executive summary [pdf, 106 kbyte], an 11-page summary of the thesis, written for people who are not scientific experts in land cover classification, describing the main goals and techniques of this thesis, as well as its economical and business implications.
- Thesis dissertation [pdf, 10 Mbyte]. This is the main document.
- Book cover for thesis dissertation [png, 1.6 Mbyte] [tiff, 15 Mbyte]. Unofficial colourful cover page.
- Thesis defence slides [pdf, 21 Mbyte].
You can print a copy of the (unofficial) book cover together with the thesis dissertation file to get your own copy of this thesis. Choose an image format (PNG or TIFF) and print the corresponding book cover image at 150 DPI on an "A3wide" (453 x 328mm) or larger paper. Once printed, cut away the top and bottom margins using the red marks as a guide. This sheet contains the front cover, book spine, and back cover; fold appropriately. Then, cut away the left and right margins as required, depending on the thickness of the paper used for the thesis contents, and glue everything together.
Extended summary
Much research has been done on the subject of automatically determining land use and land cover in rural areas. However, error rates have always been too high for industrial application.
The French National Mapping Agency (Institut Géographique National, IGN), is interested in automatic land cover classification for the purpose of speeding up production of high-resolution topographic maps. The context at IGN is, however, slightly different than in most other research into automatic land cover classification: First, very-high spatial resolution digital imagery is available (50cm per pixel), but these images have a low spectral resolution (red, green, blue, and in some cases near-infrared channels), which precludes the use of standard hyperspectral classification techniques. Second, cadastre data is available; this data gives a very rough indication of the position of fields. Finally, IGN's interest stems from practical reasons: to reduce the amount of time spent by human photo-interpreters doing manual classifications. Therefore, IGN is not especially interested in obtaining a medium-quality land cover classification of the whole region of interest but would rather have a very high quality classification of only a portion of the terrain; the first one would have to be double-checked in full by a photo-interpreter, whereas the second one would not need further human verification, so that photo-interpreters would be able to concentrate on the remaining area.
In this Ph.D.'s thesis, I present a complete image analysis system which, from high-resolution 3 or 4-channel digital images (50cm, colour and optionally near infrared), and using the cadastre database, segments the images into agriculturally-homogeneous regions (fields, forests, vines, and so on), and classifies these regions, tagging each classified region with a confidence measure which indicates the system's confidence in each classification, and which can be used to filter out regions that are more likely to have been incorrectly classified.
The process starts with a hierarchical segmentation, using a colour space, texture parameters, and shape criteria adapted to the problem of segmenting agricultural regions. This segmentation is used to register the cadastre onto the image, giving large, usually homogeneous regions. Through this registration the system can also be used to update older classifications. Then, each of these registered cadastre regions ---or, if cadastre data is not available, small regions obtained by watershed segmentation--- is classified using novel probabilistic per-region classification algorithms which, unlike traditional per-pixel algorithms, do not produce salt-and-pepper noise, and which also output a classification confidence measure for each classified region. These classification algorithms are supervised, and need to be trained beforehand with a ground truth defined by a photo-interpreter.
As end product we get an image segmentation into classified agriculturally-homogeneous regions, and confidence measures for each part of the segmentation, which a human photo-interpreter can use to correct these results or to concentrate limited available time into the most likely errors.