PE&RS June 2015 - page 453

When c

falls outside S (like in U-shaped objects), the

proposal of Möller

et al.

(2013) uses an alternative normaliza-

tion factor such as the square root of the area of A

. To avoid

inconsistent formulas that depend on the objects’ shape, in

this paper another normalization factor was implemented

regardless of the object’s shape, which is the distance between

or c

and their respective furthermost vertex of R (v

R,max

) or F

F,max

) (Figure 2b, and 2c). Thus P

and P

were calculated as:

dist c c

dist c v

S X

X X max

= −

( , )

( ,

)

with X

∈

[R,F]

(2)

Next, metrics O

and P

, with X

∈

[R, F], are combined

through geometric averaging, defining the metrics G

and G

as G

√

, with X

∈

[R,F]. Metrics G

and G

assess areal

and positional geometric accuracy of object S in relation to

R and F. Over-segmented and under-segmented objects will

correspond to low values of G

and G

, respectively. The value

derived for these lie on a scale between 0 and 1.

Finally, all of the G

and G

metrics are used to calculate

the global metric M

for the whole segmentation to measure

the strength and type of mismatch (i.e., under- or over-seg-

mentation) between the reference dataset and the segmenta-

tion. For this, normalized distances D between the cumulative

distribution functions of G

and G

are calculated by applying

the non-parametric Kolmogorov-Smirnov (

) goodness-of-fit

test, which may be used to assess the difference between two

distributions. Thus, D

=max

|f(G

)-f(G

)| and D

–

= max

|f(G

f(G

)|. M

results from the difference between D

–

and D

. M

indicates under-segmentation while M

> 0 represents the op-

posite case of over-segmentation. Therefore, M

~0 is considered

indicative of optimal segmentation quality (Möller

et al.,

2013).

Furthermore, Möller

et al.

’s (2013) method undertakes a

filtering operation to define the set of objects S considered in

the analysis. Spatial intersection operations commonly cre-

ate narrow and long shapes known as sliver polygons (Mas,

2005). Sliver polygons are often not relevant to the analysis,

as they may appear not from a relevant difference between

the segmentation and the reference but due to minor errors of

geolocation of either one or both layers. For this reason they

are undesired. In the original approach proposed by Möller

et al.

(2013) sliver objects S are referred to as emerging from

many-to-many relations between R and F and are discharged

from all calculations, which happens also in this paper. All

other (non-sliver) objects S are included in the analysis. More

details are found in Möller

et al.

(2013).

Thematic Similarity Index (

TSI

)

The

TSI

was designed specifically in the framework of the

present research to assess the thematic quality of the objects

generated from a segmentation analysis according to the per-

spective of the specific user. The thematic quality of an object

depends on three features: (a) the thematic classes the object

encompasses when its borders are projected on the Earth’s

surface, as represented by the reference dataset, (b) the pro-

portion of the area occupied by each of the classes within the

object, and (c) the thematic similarity between those classes,

which is user dependent. The

TSI

is calculated for each object

of a segmentation as follows:

TSI

P P w

d cd













= =

∑ ∑

(3)

where

is the number of thematic classes within the object,

is the relative area (proportion) occupied by each class, and

the user-specific thematic similarity weight between the classes.

The

classes encompassed in the object and the proportion

of area

they occupy are defined by a basic spatial overlay

operation between the segmentation under evaluation and the

reference dataset. Thematic similarity is a more complex fea-

ture since the weights

are provided by the user and should

express their views on the relative similarity of classes. For

example, if an object is under-segmented and instead of being

pure contains two or more classes, the value of

reflects the

relative severity of this error for the specific user.

The description and quantification of thematic similarity

between classes is an issue that has received considerable

attention in the literature. For example, Ahlqvist and Gahegan

(2005) describe methods to estimate the semantic similarity

between any two classes by means of quantitative metrics.

Specifically, they describe how the definition of classes and

their (dis)similarity may be represented by a rough-fuzzy set

approach applied to the defining characteristics that practitio-

ners use to describe or define the classes, such as percentage

of tree cover for a forest. The quantitative metrics used by the

authors are “overlap” and “nearness” which are based on two

common approaches to estimate similarity between concepts:

the proportion of shared features (Tversky 1977) and the psy-

chological distance between related properties (e.g., Nosofsky,

1986). Many other metrics are available in the literature, such

as those described in Bouchon-Meunier

et al.

(1996). More re-

cently, discussion has been introduced in the

GEOBIA

commu-

nity on ontologies (Arvor

et al.

, 2013), which can be useful to

assist the measurement of thematic similarity between classes.

The specific approach adopted can depend of the application

in-hand. Critically, the

TSI

simply requires a pair-wise com-

parison between all land cover classes of interest that yields

a quantitative expression of their thematic similarity. The

derived values can then be summarized in a matrix (Figure

3) and used as weights

. The weights that form the matrix

Figure 2. Spatial operation and features considered in Möller

et al.

(2013) for the calculation of geometric metrics: (a) spatial intersec-

tion S between R and F, (b) comparison between S and R, and (c) comparison between S and F. R is a reference polygon; F is an object

of a segmentation under evaluation; S=R∩F; R*=R∩

F; F*=

R∩F; c

, c

, and c

are the centroids of R, F, and S respectively; c

R*,max

is the

furthermost centroid of R* from c

; c

F*,max

is the furthermost centroid of F* from c

; v

R,max

(alternative to c

R*,max

) is the furthermost vertex of

R from c

, and v

F,max

(alternative to c

F*,max

) is the furthermost vertex of F from c

PHOTOGRAMMETRIC ENGINEERING & REMOTE SENSING

June 2015

453

SEO Version

Warning.

You are currently viewing the SEO version of !text.
It has a number of design and functionality limitations.

We recommend viewing the Flash version or the basic HTML version of this publication.

419...,443,444,445,446,447,448,449,450,451,452 454,455,456,457,458,459,460,461,462,463,...518