Image Anal Stereol 2007;26:13-22 Original Research Paper
CHARACTERIZATION OF MAMMARY GLAND TISSUE USING JOINT ESTIMATORS OF MINKOWSKI FUNCTIONALS
Torsten Mattfeldt1, Daniel Meschenmoser2, Ursa Pantle2 and Volker Schmidt2
1Institute of Pathology, University of Ulm, 89081 Ulm, Germany, 2Institute of Stochastics, University of Ulm,
89081 Ulm, Germany
e-mail: torsten.mattfeldt@medizin.uni-ulm.de, daniel.meschenmoser@uni-ulm.de, ursa.pantle@uni-ulm.de,
volker.schmidt@uni-ulm.de
(Accepted March 13, 2007)
ABSTRACT
A theoretical approach to estimate the Minkowski functionals, i.e., area fraction, specific boundary length and specific Euler number in 2D, and their asymptotic covariance matrix proposed by Spodarev and Schmidt (2005) and Pantle et al. (2006a;b) is applied to real image data. These two-dimensional images show mammary gland tissue and should be classified automatically as tumor-free or mammary cancer, respectively. The estimation procedure is illustrated step-by-step and the calculations are described in detail. To reduce dependencies from chosen parameters, a least-squares approach is considered as recommended by Klenk et al. (2006). Emphasis is placed on the detailed description of the estimation procedure and the application of the theory to real image data.
Keywords: asymptotic covariance matrix, breast cancer, mammary carcinoma, mammary gland tissue, Minkowski functionals, random closed set, specific intrinsic volumes.
INTRODUCTION
Breast cancer is the most frequent malignant tumor in women. In routine diagnostics, it is usual to perform a histopathological grading, which is based on a three-tiered scheme with grades I, II, and III (Ellston and Ellis, 1991; Mattfeldt et al., 2004). As the reproducibility of tumor grading is unknown for individual cases, many attempts have been made to arrive at an objective and quantitative grading of tumor structure. Let us consider here the tumor texture, which reflects the degree of differentiation of the tumor. The tissue may be conceived as a random set with different phases, which all possess a positive volume fraction. This means consideration of the tumor tissue as a volume process (Mattfeldt and Fleischer, 2005; Mattfeldt et al., 2006). It consists of three phases: tumor cells, stroma and lumina, which altogether account for 100% of the tissue.
In diagnostic pathology we deal with histological sections, i.e., very thin slices, onto which windows, usually of rectangular or quadratic shape, are placed for evaluation under microscopical view. Hence we are faced in practice with random closed sets in 2D, which may be quantified in terms of the three Minkowski functionals: AA, the mean area of the interesting phase per unit reference area (area fraction); BA, the mean boundary length of the interesting phase per unit reference area; and ?A, the mean Euler number of the interesting phase per unit reference area. Notably
all these quantities have a stereological interpretation, hence they can be used for the estimation of stereological model parameters:
VV   =   AA                            (1)
4 SV   =      BA                         (2)
p
MV   =   2p?A                        (3)
whereVV is the volume fraction, SV is the mean surface area per unit reference volume, and MV is the mean curvature density (Stoyan et al., 1995). Eqs. 1–3 are all fundamental stereological formulae and hold for random closed sets under the conditions of isotropy and stationarity for arbitrary sections. Recently a new approach has been developed which allows a joint estimation of all three Minkowski functionals for a given image (Schmidt and Spodarev, 2005; Spodarev and Schmidt, 2005). It provides not only point estimates of VV, SV, and MV , but also estimates of their asymptotic variances and covariances. Up to now the aforementioned estimator has only been applied to simulated images, but not yet to real image material. As a first application to real images in a simple situation, we decided to compare mammary cancer tissue to normal (tumor-free) mammary tissue, see also earlier publications of our group (Mattfeldt et al., 1993; 1996; 2000; Mattfeldt and Stoyan, 2000).
The paper is organized as follows. In Section ‘Mathematical Methods’ the notation used throughout
13
Mattfeldt T et al: Characterization of mammary gland tissue
the paper is introduced. The specific intrinsic volumes are defined by means of Steiner’s formula. The method given in Spodarev and Schmidt (2005) and Pantle et al. (2006a;b) to estimate these quantities and their asymptotic covariance matrix is described where some technical details are omitted. Section ‘Application to image data’ deals with the estimation of specific intrinsic volumes from real image data showing mammary tissue. The procedure for the two-dimensional case is described in detail. With the estimated quantities a statistical test is considered to classify an image as tumor-free or as mammary cancer, respectively. The paper ends with a discussion and an outlook to further projects.
MATHEMATICAL METHODS
First we introduce some notation. For some fixed d > 2, denote the family of convex bodies, i.e., compact convex sets, in Rd by K and let R = {K a Rd :K = U?=1K,-, Ki G K, n G N} be the convex ring, i.e., the family of all polyconvex sets in Rd. By S = {M c Rd : M n K G R VK G K} we denote the extended convex ring. Then it holds KcRcS. Let Br(x) be the closed ball in Rd with radius r > 0 centered at x and let o G Rd be the origin. Further, kj denotes the volume of the j-dimensional unit ball for j = 0,...,d. For two sets A,B cRd the Minkowski sum A © B and the Minkowski difference A Q B are definedby A(BB = {a + b: a eA,b e B}and AGB = {xeRd:B+xcA}, respectively, where B = {x G Rd : -x G B} denotes the set B reflected at the origin.
For convex bodies K G K it can be proven that there exist d+ 1 functionals Vj : K -? [0,°°) for j = 0,... ,d, such that the volume of the so called parallel body K® Br(o) for r > 0 is given by Steiner’s formula
\K®Br(o)\

(4)
A proof of this formula can be found, e.g., in chapter 2 of Schneider and Weil, 1992. The functionals Vj are called intrinsic volumes. They are related to the Minkowski functionals Wj : K ->¦ [0,°°) in the
following way: Wj(K) = kj/(d)Vd-j(K) for all j = 0,...,d. The intrinsic volumes are not restricted to convex bodies. There is a unique additive extension to the convex ring R given by the inclusion-exclusion-formula. For any polyconvex set K G R, any n G N and any convex bodies Kh...,KneK with K = K1U ...UKn it holds
VJ (K) = Ê (-1 f-l       L      VJ (Kh n • • •n Kk)
k=\                     l<ï'i<...<k<n
(5)
for j = 0,...,d. Notice that the value of Vj(K) does not depend on the particular representation of K as the union of convex sets K{. The proof of existence and uniqueness of this extension can be found in Schneider, 1993. The formula itself can be shown by induction using the fact that the intrinsic volumes are additive, i.e., Vy-(0) = 0 and for all KhK2 G K with K1UK2eKitholdsVj(K1UK2)=Vj(K1) + Vj(K2)-VjiKx f]K2). Some of the intrinsic volumes have a nice geometric interpretation: Vd(K) is the usual volume of K, dVd-i (K) is the surface area of K and V0(K) is the Euler-Poincare´ characteristic of K.
In the following let S be a stationary random closed set in Rd with values in the extended convex ring S almost surely. Let {Wn} be a monotonically increasing sequence of compact convex observation windows
Wn = nW                            (6)
with
W£K,\W\>0ando£int(W).
(7)
Under   appropriate   assumptions,   the   expectation EVy-(S n Wn) is well defined and the limit
Vj (S)
lim
EV,-(SnW„)
(8)
exists for all j = 0,...,d, see, e.g., Schneider and Weil, 2000. The functionals Vj(Z) are called specific intrinsic volumes of S. In the two-dimensional case they are well-known under the notation AA = V2, BA = 2V i and %A = V0 and are connected to the stereological model parameters Vv, Sv and Mv by Eqs. 1-3.
To estimate the specific intrinsic volumes from a binary image we use a method developed in Spodarev and Schmidt (2005). It makes use of the local Euler-Poincare´ characteristic, which is defined as the expected Euler number of S in a neighborhood of a point x, i.e., as EV0(SnBr(x)). It can be shown that for any r > 0 and for any x G Rd it holds
d                       _
EVo (S nBr(x)) = £ rd-jkd-jVj (S).        (9) j=o
A proof of Eq. 9 can be found in chapter 5 of Schneider and Weil (2000), see also Spodarev and Schmidt (2005). Since this formula holds for any r > 0 we can plug in d + 1 pairwise different radii 0 < r0 < ... <rd, where we have to take care that the edge-corrected observation window W G Brj (o) has positive volume for j = 0,..., d. Since the radii are numbered in ascending order this holds if | W G Bd (o) | > 0. From
_
_
14
Image Anal Stereol 2007;26:13-22
Eq. 9 we get the following system of d + 1 linear equations
Av=y,                            (10)
where
A
( r$kd   rÌ~xkd-\    ¦¦¦    rok     1\
r,kd    ri     kd—\    ' ' '    r k      1
. .
rdkd
4-1
kd-i
(11)
rdk    1
v = (Vo(S),...,Vd(S))T and y = (EV0(S n Bro(0)),...,EVo(ZnBrd(0)))T. The matrix A is regular because the radii r0,...,rd are pairwise different and it can be computed without problems. With an appropriate estimator y of y we now get an estimator v of v by
v = A ly.
(12)
Since the local Euler characteristic, i.e., the vector y, can be estimated from one single image we also can estimate the vector of specific intrinsic volumes, i.e., the vector v, from one single image. To estimate the vector y of local Euler characteristics for different radii we consider the stationary random field Y j =
{Yj(x),xeRd}with
Yj(x) = Vo(ZnBrj(x))
(13)
for j = 0,..., d. The stationary of Y j follows directly from the stationary of S. An unbiased estimator y j of yy- = EY,-(o)isgivenby
9j=             Yj(x)ß{dx),
WQBrd(o)
(14)
where ß is an arbitrary probability measure concentrated on the reduced observation window W e Brd (o) to avoid edge effects. If ß is the normalized Lebesgue measure, i.e.,
m(-)
\-nW e Brd (o)\
\WQBrd{o) the estimator y j is given by
(15)
yj
1
\WQBrd(o)[
f     Yj{x)dx.
(16)
WOBrd(o)
To study the variance of the estimator v of the specific intrinsic volumes we consider a sequence of observation windows {Wn}, which satisfies condiditions Eq. 6 and Eq. 7. For each j =
0,..., d and for each n G N we can estimate y j on W„ by
ynj
1
\WnGBrd(o)\
f     Yj{x)dx.
(17)
WnOBrd(o)
Under appropriate assumptions, the covariances
Covl7(x)=Cov(Y(o),Y/-(x))
(18)
exist for all i,j = 0,...,d, see, e.g., Schmidt and Spodarev (2005). If the covariances are absolutely integrable and some further assumptions are fulfilled (cf. Pantle et al, 2006a), then the random vector  ^/\WÇ\(y0tn - yQ,...,%,n ~ yd)T  is
asymptotically   normally   distributed   with vector o and covariance matrix S =
mean with
°ij = kdCovij(x)dx. Therefore the random vector VW(vo,n - vo, • • •, vd,n - vd)J is also asymptotically normally distributed with zero mean vector and covariance matrix £y = A-1r(A-1)T.
The values of these estimators v) of the specific intrinsic volumes depend heavily on the choice of the radii r0,...,rd, see also the discussion in Klenk et al. (2006). To reduce this dependence a least-squares approach is considered. Let 0 < r0 < ... < rk^ be k > d +1 pairwise different radii. Similar to Eq. 10 we get a system of k linear equations with the difference that the vector y is k-dimensional and that A is not a squared matrix any more because it has k rows and d+ 1 columns. Anyhow, the minimization problem
\y-Av*
 min \y-Ax\
has a unique solution given by
ATy.
(19)
(20)
The estimator v* of the vector of specific intrinsic volumes does not depend on the choice of radii as much as the estimator v in Eq. 12. Furthermore, the random vector ^/WÄv n-vo,---,v*dn- vd)T is asymptotically normally distributed with'zero mean vector and covariance matrix £v* = A£AT where A = (ATA)-1AT.
The asymptotic covariance matrix £ can be estimated from the observation of the stationary random fields Y j defined in Eq. 13. For each n G N let W„o,...,W„£_i C W„. Let {U„ij} be a monotonously increasing sequence of bounded sets with Unij c Wni © W ?nj and \Unij\ > 0 for all n G N and for all i, j = 0,... ,k - 1. Additionally let lim,,^«, U„;; =
_
_
_
15
Mattfeldt T et al: Characterization of mammary gland tissue
supp(Covi j) and let the sets Uni j grow smaller in comparison to Wni j, i.e.,
lim [ nij,
nÌ   |W
0.
(21)
ni j
Furthermore, assume that
minxeUni j \Wnij n (Wni j + x) |
lim     Uni j ni j (W      = 1.   (22)
n"
W
ni j
For each n G N and i, j = 0,... ,k- 1 we consider the estimator
(yni j =
t
Unij
Covni j (x)
\Wnijn(Wnij-x)\
dx      (23)
of sij with
Covni j(x
 \Wnijn(Wnij-x)\dy~yi'nyj'n.
Wnijn(Wnij-x)1                         '
(24) This sequence Žn = (c7ni j) of estimators of £ is asymptotically unbiased, i.e., lim^o, ||E£n -£|| = 0, see, e.g., Schmidt and Spodarev (2005); Pantle et al.
(2006b), where ||£|| = Jj^^oi 2j denotes the matrix
norm. Under additional integrability conditions, Zn is L2 consistent for £, i.e., it holds lim^«^!^ - £||2 = 0. If S is the Boolean model with primary grain M0 for example, the integrability conditions are fulfilled if E|M0 ®Brd |2 < oo. From the estimators %n described in
Eq. 23 and Eq. 24 we get a sequence of estimators Žn of the asymptotic covariance matrix Zv* by
=
n
AZ„ATwithA= (ata)_1 At.       (25)
APPLICATION TO IMAGE DATA
Now we are ready to apply the statistical approach explained in the previous section to estimate the specific intrinsic volumes of real image data showing mammary tissue. There were ten cases of ductal mammary cancer tissue and ten cases showing normal, i.e., cancer-free, mammary tissue. From each case, a sample of 3 x 3 = 9 contiguous quadratic images was evaluated, where the first image was selected at random. Each image had a size of 510 x 510 pixels. The concatenation led to a large quadratic image with 1530 x 1530 pixels, which is needed for the estimation of the asymptotic variances. This means that the final observation   window   is   given   by   the   rectangle
(a) Original image of tumour-free mammary tissue.
(b) Segmentation of Fig. 1(a) leads to this image, which contains three phases: white—epithelial cells, gray—lumen, black—stroma.
Fig. 1. Tumour-free mammary tissue. Haematoxylin-Eosin stain (a) and segmented image (b), respectively. The edgelength of the quadrat corresponds to 0.4 mm at the scale of the specimen at this magnification.
[0,1529] x [0,1529]. Fig. 1a shows tumour-free mammary tissue and Fig. 2a shows invasive ductal mammary carcinoma. The edgelength of 510 pixels corresponds to 0.4 mm at the scale of the specimen at this magnification, i.e., Figs. 1a and 2a show only one of the nine contiguous images. The same images are shown in Figs. 1b and 2b, respectively, after interactive segmentation of stroma, epithelium and lumina. The
16
Image Anal Stereol 2007;26:13-22
stroma is represented by the black phase, the grey phase forms the lumina and the white phase stands for the epithelium without the lumina, i.e., the tumor cells.
(a) Original image of invasive ductal mammary carcinoma.
(b) Segmentation of Fig. 2a.
Fig. 2. Invasive ductal mammary carcinoma. In (a), stain and magnification identical with Fig. 1a. In (b), segmentation identical with Fig. 1b.
These three phase images were converted into three binary images by combining two phases. The foreground of the resulting images may consist of one of the three grey phases (e.g., the white phase) or of the union of two of the three phases (e.g., white and grey). We understand these binary images as realizations of stationary and isotropic random closed sets (cf.
Mattfeldt and Stoyan, 2000b). Since these images are two-dimensional the specific intrinsic volumesV0,2V1 and V2 represent the mean Euler number per unit area, the mean boundary length per unit area, and the area fraction, respectively.
In the following, the procedure to calculate the least-squares estimator given in Eq. 20 is described step by step.
1.
2.
Choose the number k > 3 and the values of the radii r0 < ... < rk_x. We put k = 15 and ri = 4.2+ 1.3i, i = 0,...,14, following the recommendation in Klenk et al. (2006). In this particular case, the matrix A defined in Eq. 11 is given by
A
/    17.647T     8.4    1 \ 30.25 n   11.0    1
445.21 %   42.2    1  501.76 TT   44.8    1
(26)
Estimate the local Euler characteristic in the reduced observation window W G Br14(o) = [23,1506] x [23,1506] for all radii r0,...,r14 by computing V0(ZnBri(x)) for all pixels x G We Bru(o) and averaging over all pixels. That means the estimator y from equation Eq. 16 is given in discretizedformby
yj
1
1484*
£         Vo(ZnBrj(x)). (27)
x=(x1,x2) x 1,x2e{23,...,1506}
An algorithm to estimate the local Euler characteristic for all radii simultaneously is given in Klenk et al. (2006).
3.   Now, the estimation of the specific intrinsic volumes is straightforward by computing the least-squares estimator v* = (ATA)-1ATy.
4.    To estimate the asymptotic covariance matrix the theory says we need an unboundedly increasing sequence of observation windows, cf. Eq. 6. In practice we have only one concatenated quadratic image with fixed size, and we assume it is large enough. So we choose the averaging set U = B3oo(o) and estimate the variances and covariances using a discretized version of the estimator given in Eq. 23, Eq. 24 and Eq. 25 where the integrals are replaced by sums.
In the present study, the application of the joint estimator described above to the white phase of the images led to the following mean values for the stereological model parameters Eqs. 1-3.
_
17
Mattfeldt T et al: Characterization of mammary gland tissue
Table 1. Means of the estimated values and the means of their asymptotic standard deviations. The latter means are not equal to the usual standard deviations between the cases within the groups.
normal group    carcinoma group
Vv                0.15                   0.43
Std(Vy)       0.037                 0.047
SV               24.01 mm-1      40.85 mm-1
Std(Sv)         2.68 mm-1        1.97 mm-1
My            106.91mm-2    297.91mm-2
Std(My)     122.07 mm-2     180.93 mm-2
In Table 1, the terms Std(Vy), Std(Sv) and Std(My) denote the means of the asymptotic standard deviations of the white phase per concatenated large image. Thus, they are not identical with the standard deviations of these model parameters ‘between images within cases’ and also not identical with the ordinary standard deviations of Vv, Sv and Mv ‘between cases within groups’ . The latter may be computed using standard statistical formulae even with a table calculator; however, this does not apply to the asymptotic standard deviations. In addition, the co variances between Vv, Sv and Mv were computed, but these are not reiterated here. In order to see which parameter discriminated best between the groups, the results were visualized graphically. For example, Figs. 3 and 4 show the estimated area fraction of the white phase and the estimated mean curvature density of the white phase per unit area, respectively.
0.8-i---------------------------------------------------1
0.7-----------*------------------------------
0.6------------------------------------------
0.5-----------±------------------------------
0.4-----------*------------------------------
0.3-----------*------------------------------
0.2------------------------------------------
0.1--------------------------------*---------
0.0-l---------------------------------------------
carcinoma       carcinoma-free Fig. 3. Estimated area fraction of the white phase.
From Fig. 4 one can see that it is not possible to base the decision whether an image shows mammary carcinoma or not only on the mean curvature density. Also a statistical test of the mean surface area per unit volume alone does not lead to a sufficient discrimination of the groups. However, the area fraction of the white phase (Figure 3) seems to be a better parameter to categorize the images into two groups. Since this paper is focused on explaining the theory and the algorithm of the new estimation approach we will only test the area fraction which yields acceptable results. Anyhow, it is clear that the two groups in Fig. 4 belong to two different settings. Therefore one could combine, e.g., area fraction and Euler number and consider vectorial tests to strengthen the results of a one-dimensional test. With the approach described in the last section this is possible and will be done in a further paper.
1200 -i-----------------------------------------------1
1000------------?----------------------------------
800------------------------------------------------
600------------------------------------*----------
400------------j----------------------------------
200------------?----------------------*----------
0------------------------------------*----------
-200------------------------------------------------
-400------------*----------------------------------
-600 -I------------------------------------------------
carcinoma       carcinoma-free
Fig. 4. Estimated mean curvature density Mv (in mm-2) of the white phase per unit area.
In the following, we write just ‘area fraction’ and omit the words ‘white phase’ for convenience. From the last section we know that the least-squares estimator is asymptotically normally distributed so we can construct a statistical test for the area fraction. Since the images have fixed size we don’t have an unboundedly increasing sequence of observation windows W„. Instead we claim that the estimators are approximately Gaussian because our images are large enough. The null hypothesis states that ‘the expected
18
Image Anal Stereol 2007;26:13-22
area fraction in image j corresponds to the mean area fraction in images showing normal mammary tissue,’ where the images 1-10 show normal mammary tissue and the images 11-20 show mammary carcinoma tissue. So if we write pj for the expected area fraction in image j and p0 denotes the mean area fraction in images showing normal mammary tissue, the null hypothesis reads as H0 : p j = p0. As we can see from Fig. 3, the estimated values of the area fraction in images showing mammary carcinoma tissue are greater than in images showing tumor-free mammary tissue. That’s why we consider the one-sided alternative hypothesis H1 : p j > p0. The significance level is a = 5%. The unknown expected area fraction p j in image j is estimated by the least-squares estimator given in Eq. 20 or, to be precise, by its third entry, and denoted by pj. When calculating the mean area fraction p0 one has to distinguish, if the image whose area fraction we want to test shows mammary cancer or normal tissue. In the first case we define p0 as the arithmetic mean of the estimated area fractions of the ten images showing tumor-free tissue, i.e.,
p 0 = 10 y\ pi ¦                      (28)
But if the considered image shows tumor-free tissue, we have to exclude it from the calculation of the mean, so we define p0 by
p0
10
91
(29)
Now we can calculate the test statistic
Tj = VW\
p j — p0
(30)
in image j, where a2 j denotes the estimated variance
of /\W\(pj - p0), see equations Eq. 23 and Eq. 24 and subsequent lines. In fact, it holds that y^W] = 1530 in the considered case, because all images are squares with sidelength 1530 pixel. Slutsky’s theorem yields that Tj is approximately standard Gaussian. With the 95 %-quantile z0.95 = 1 -64 the critical range is (1.64, oo), so the null hypothesis for image j is rejected if the value Tj is greater than 1.64. The results are shown in Table 2.
Table 2. Results of the test H0 : pj = p0 for images showing tumor-free mammary tissue (j = 1,...,10) and for images showing mammary cancer tissue (j = 11,...,20).
j	p j	Rejection of H0
1	0.047224	no
2	0.094392	no
3	0.138055	no
4	0.145576	no
5	0.155753	no
6	0.159763	no
7	0.163755	no
8	0.163812	no
9	0.217735	no
10	0.232209	no
11	0.277096	yes
12	0.297491	yes
13	0.342711	yes
14	0.342963	yes
15	0.418210	yes
16	0.423122	yes
17	0.442027	yes
18	0.520348	yes
19	0.570691	yes
20	0.693262	yes
The null hypothesis is not rejected for the images showing normal mammary tissue, but it is rejected for all ten images showing mammary carcinoma tissue, which means they are classified correctly.
Of course, we can, in a certain sense, exchange null and alternative hypothesis and test the null hypothesis that ‘the expected area fraction in image j corresponds to the mean area fraction in images showing mammary carcinoma tissue’ or, shortly, H0 : pj = p0. Here, p0 denotes the mean area fraction of images showing mammary cancer tissue and as above there are two definitions for p0 depending on what type of tissue the considered image shows, cf. Eq. 28 and Eq. 29. Again, the alternative is one-sided H1 : p j < p0 and the test statistic is defined by
Tj = rfW\n^
p j - p 0
(31)
The results of this test are shown in Table 3.
_
19
Mattfeldt T et al: Characterization of mammary gland tissue
Table 3. Results of the test H0 : p j = p0 for images showing tumor-free tissue (j = 1,..., 10) and for images showing mammary cancer tissue (j = 11,...,20).
j	pj	Rejection of H0
1	0.047224	yes
2	0.094392	yes
3	0.138055	yes
4	0.145576	yes
5	0.155753	yes
6	0.159763	yes
7	0.163755	yes
8	0.163812	yes
9	0.217735	yes
10	0.232209	yes
11	0.277096	yes
12	0.297491	yes
13	0.342711	yes
14	0.342963	no
15	0.418210	no
16	0.423122	no
17	0.442027	no
18	0.520348	no
19	0.570691	no
20	0.693262	no
Table 3 shows that the null hypothesis is rejected for all images showing normal tissue. But unfortunately there are three images out of the ten showing mammary carcinoma tissue for which the null hypothesis is rejected although it should not be.
DISCUSSION
Eqs. 1–3 are well-known fundamental stereological formulae, valid under the conditions of isotropy and stationarity in a model-based approach, and under the condition of IUR sampling from arbitrary structures in a design-based approach. However, there is a practical difference: the estimators of the model parameters in Eqs. 1 and 2 are very easy to implement with an image analyzer, but the estimation of the Euler number is non-elementary even in 2D. While estimation of VV and SV is already taught in basic courses on stereology, this does not apply for the Euler number. Nevertheless the Euler number is of interest for a quantitative characterization of carcinoma tissue of glandular origin, because the Euler
number is directly linked to fundamental pathological tumor properties such as solid architecture where ideally ? > 0, tubular architecture where ideally ? = 0, and cribriform architecture where ideally ? < 0. These textures may arise in all types of adenocarcinomas. Sometimes a cribriform texture can be found in mammary carcinomas; in practice it is most important to recognize this texture component in prostatic carcinomas, where it is known to be associated with a poorer prognosis as compared to tubular differentiation. Furthermore, the usual stereological approach, even if it encompasses the Euler number, leads merely to point estimates of the model parameters, but does not provide an insight into the covariance matrix, i.e., the asymptotic variances and covariances of VV, SV, and MV of the three phases remain unknown. Point estimates for VV and SV of the epithelial phase of tumour-free mammary tissue and mammary carcinomas obtained from conventional stereological methods were previously published (Mattfeldt et al., 2000, see Table 1 therein). The results were very similar to the present study. This shows the good reproducibility (robustness) of the method, in which the images were segmented interactively. Fully automatic segmentation of mammary tissue into epithelium, lumen, and tumour cells by image analysis would be desirable, but this aim is difficult to achieve at the moment. The plausibility of the results was checked by using a theorem of Tomkeieff, which states that the mean length of intercepts through particle profiles, l1, is related to the Minkowski functionals of the particles by the equation l1 = 4VV/SV (see Baddeley and Jensen, 2005, p. 33). According to this relation, one would expect values for l1 ˜ 0.025 mm in the tumour-free group and l1 ˜ 0.042 mm in the carcinoma group, which roughly corresponds to the visual impression, see Figs. 1b and 2b. In contrast to many other methods to estimate the specific intrinsic volumes the approach given in Spodarev and Schmidt (2005) yields not only the estimates, but also the (asymptotic) variances and covariances. All values can be computed from one large concatenated image. The classical approach to estimate the sample variance in each group is inappropriate here because we want to test each image separately. This might be interesting if there is only one image available in a practical application, which may be divided into subwindows.
In the special case of mammary tissue it turned out that the area fraction of the white phase, i.e., the part of the image that shows epithelial cells, is a good criterion to detect if an image shows mammary cancer tissue or not. The area fraction has the useful property of being independent of the magnification of the image. Although this was not important in our study, it may be relevant for
20
Image Anal Stereol 2007;26:13-22
the analysis of images where the scaling factor is unknown. A practical example for this situation has emerged more and more in the last decade. There are published datasets available in the internet consisting of images of various tumor types, e.g., also mammary and prostatic carcinomas, where expert groups have performed the histopathological grading for reference purposes. Usually, these images are given without any information on the final magnification, and often it will not be possible to retrieve the magnification factor any more. It would be attractive to perform a quantitative meta-analysis of these images by means of spatial statistics. Due to the aforementioned reasons, one will then be restricted to methods which are independent of the magnification. This holds for the VV component of the joint estimator described here. In contrast to the usual routine method, e.g., point counting, our method will also provide an estimate of the asymptotic variance, and thus yield valuable additional information. As one can conclude from Tables 2–3, the test on tumor-free tissue yields better results than the test on mammary carcinoma tissue.
The most common tumor types of the female breast are invasive ductal and lobular carcinoma. These designations indicate a tumor differentiation more similar to the ducts or to the lobules of the mammary parenchyma, respectively. For both types of breast cancers, an attempt is made towards grading of malignancy in routine diagnostics. This is important for prognosis prediction and therapy planning. It is intended to apply our method for the characterization of mammary carcinomas of different degrees of malignancy, and eventually to use it for the prediction of the grade of malignancy from spatial data, i.e., for the purpose of pattern recognition. Also it will be interesting to differentiate by this technique between ductal and lobular mammary carcinomas, which may be difficult in some cases (Mattfeldt and Fleischer, 2005). However, before these two more ambitious projects are put into practice, we thought it advisable to implement the methodology first in a simpler setting comparing tumor-free tissue to carcinoma tissue, where the differences between the classes of specimens are more pronounced. For the advanced applications, it may become useful to consider vectorial tests. With the described method it is possible to characterize the tissue high-dimensionally. If only one phase is considered, one obtains 9 instead of 3 numerical values per image (the three point estimates, the three asymptotic variances and the three asymptotic covariances of the Minkowski functionals). This rises to a whole bunch of characteristics if also the other two phases are taken into account. This will be subject of a further paper.
ACKNOWLEDGMENTS
The authors thank an anonymous reviewer of IAS for constructive comments, which led to substantial improvements in the revised edition.
REFERENCES
Baddeley A, Vedel-Jensen EB (2005). Stereology for Statisticians. Boca Raton: Chapman & Hall.
Elston CW, Ellis IO (1991). Pathological prognostic factors in breast cancer. I. The value of histological grade in breast cancer: Experience from a large study with long-term follow up. Histopathol 19:403-10.
Klenk S, Schmidt V, Spodarev E (2006). A new algorithmic approach to the computation of Minkowski functionals of polyconvex sets. Computat Geom Theory Appl 34:127-48.
Mattfeldt T, Frey H, Rose C (1993). Second-order stereology of benign and malignant alterations of the human mammary gland. J Microsc 171:143-51.
Mattfeldt T, Schmidt V, Reepschla¨ger D, Rose C, Frey H (1996). Centred contact density functions for the statistical analysis of random sets. J Microsc 183:158-69.
Mattfeldt T, Gottfried H-W, Schmidt V, Kestler HA (2000). Classification of spatial textures in benign and cancerous glandular tissues by stereology and stochastic geometry using artificial neural networks. J Microsc 198:143-58.
Mattfeldt T, Stoyan D (2000). Improved estimation of the pair correlation function of random sets. J Microsc 200:158-73.
Mattfeldt T, Kestler HA, Sinn HP (2004). Prediction of the axillary lymph node status in mammary cancer on the basis of clinicopathological data and flow cytometry. Med Biol Engin Comput 42:733-9.
Mattfeldt T, Fleischer F (2005). Bootstrap methods for statistical inference from stereological estimates of volume fraction. J Microsc 218:160-70.
Mattfeldt T, Eckel S, Fleischer F, Schmidt V (2006). Statistical analysis of reduced pair correlation functions of capillaries in the prostate gland. J Microsc 223:107-19.
Molchanov I (1997). Statistics of the Boolean Model for Practitioners and Mathematicians. Chichester: John Wiley & Sons.
Pantle U, Schmidt V, Spodarev E (2006a). Central limit theorems for functionals of stationary germ-grain models. Adv Appl Probab 38:76-94.
21
Mattfeldt T et al: Characterization of mammary gland tissue
Pantle U, Schmidt V, Spodarev E (2006b). On the estimation of the integrated covariance function of stationary random fields. Preprint (submitted).
Schmidt V, Spodarev E (2005). Joint estimators for the specific intrinsic volumes of stationary random sets. Stochast Processes Appl 115:959-81.
Schneider R (1993). Convex Bodies, The Brunn-Minkowski Theory. Encyclopedia of Mathematics and its Applications, 44. Cambridge: Cambridge University Press.
Schneider R, Weil W (1992). Integralgeometrie. Stuttgart: Teubner.
Schneider R, Weil W (2000). Stochastische Geometrie. Stuttgart: Teubner.
Spodarev E, Schmidt V (2005). On the local connectivity number of stationary random closed sets. In: C. Ronse, L. Najman, E. Decenciere, Mathematical Morphology: 40 Years On. Proceedings of the 7th International Symposium on Mathematical Morphology, April 18-20, 2005. Springer Series: Computational Imaging and Vision, 343-54.
Stoyan D, Kendall WS, Mecke J (1995). Stochastic Geometry and Its Applications. 2nd Ed. Chichester: John Wiley & Sons.
22