1. Introduction
From the geometrical viewpoint, a parametric statistical model can be considered a differentiable manifold, and the parameter space can be regarded as a coordinate system of the manifold [
1,
2]. Let
be a parametric statistical model with respect to some
-finite reference measure
, where
is a real
k-dimensional parameter vector belonging to some open subset
of the
k-dimensional real space
. For simplicity, a random variable
X and its observed value
x are uniformly denoted by
x in this paper.
When the density function
is sufficiently smooth in
and it is differentiable as a function of
, it is natural to introduce the structure of an
k-dimensional manifold in the statistical model
, where
plays the role of a coordinate system. The geometrical quantities, such as connection, divergence, flatness, curvature and tangent space, play a fundamental role in the statistical inference and asymptotic theory (see, for example, Komaki [
3,
4] and Harsha and Moosath [
5]).
In reliability engineering, a life testing experiment is one of the effective ways to obtain reliability information of a product. To save time and reduce the cost of a life testing experiment, censoring methodologies are often applied so that the experiment is terminated before all the items on the life-testing experiment fail. Some commonly used censoring schemes include the Type-I and Type-II censoring schemes, where the life-testing experiment will be terminated at a prefixed time point and the life-testing experiment will be terminated as soon as the
m-th (
m is prefixed) failure is observed, respectively. In other words, the experimental time is prefixed for Type-I censoring scheme and the number of observed failures is prefixed for the Type-II censoring scheme (see, for example, Ng [
6]). The Type-I and Type-II censoring schemes have been generalized to a more complicated and flexible censoring scheme such as progressive censoring schemes [
7,
8,
9] and hybrid censoring schemes [
10,
11]. For progressive Type-II censoring schemes, the conventional Type-II censoring scheme is extended to situations wherein censoring occurs in multiple stages. A progressive Type-II censored life-testing experiment will be carried out in the following manner. Suppose
n items are placed on a life-testing experiment and we assume that these
n items have lifetimes following distribution with density function
. It is planned that
m failures will be observed and
items are randomly removed (i.e., censored) from the experiment at the time of the
r-th failure. More specifically, at the time of the first failure (denoted by
),
randomly selected items from the
surviving items are removed from the life testing experiment; then, the experiment continues and at the time of the second failure (denoted by
),
randomly selected items from the
surviving items are removed from the experiment, and so on; finally, at the time of the
m-th item failure (denoted by
), the experiment terminates and all the remaining
surviving items are censored. Here,
is the progressive Type-II censoring scheme for the life testing experiment with
. Note that, when
,
, the progressive Type-II censoring scheme reduces to the conventional Type-II censoring scheme.
Since the comprehensive studies of information geometry by Amari [
1], information geometry has been productively used in different research fields including statistical learning, machine learning, neural networks, signal processing, information theory and so on (see, for example, Amari et al. [
2] and Amari [
12].) The information geometry methods are also widely used in statistics and reliability engineering. For example, Zhang et al. [
13] discussed the Amari-Chentsov structure on the accelerated life test model with applications to optimal designs with different optimal criteria. The methods of information geometry are also employed to investigate the Bayesian prediction by taking
-divergences as loss functions [
14]. In degradation modeling, a robust parameter estimation method was proposed in [
15] by minimizing the
f-divergence between the true model and suggested models.
In this paper, we investigate the tangent space, affine connection, -connection, torsion and Riemann-Christoffel curvature of the manifold of the exponential family of distributions with progressive Type-II censoring scheme. These geometric quantities can be applied to different areas in statistics such as Bayesian analysis. Note that one of the challenges of Bayesian analysis is to calculate the integrals for obtaining the posterior distribution, especially when the number of parameters is large. Instead of using numerical methods to approximate those integrals, the geometric quantities developed in this paper can provide an efficient theoretical method to approximate those integrals involved in Bayesian prediction. The main contributions and the organization of this paper are described as follows:
Asymptotic theory plays an important role in statistical inference, which consider the properties of statistical procedures as the sample size increases. Geometrically, an approximation to a manifold is a local linearization by the tangent space. Thus, the tangent space of the manifold of the exponential family of distributions with progressively Type-II censored data is discussed in
Section 2.
The local linearization accounts only for local properties of a statistical model. It is necessary to investigate the Fisher metric tensors, affine connection, and
-connection of the manifold in order to study the global or large-scale properties of the model. Therefore, these important geometric quantities are studied in
Section 3.
As an application of the geometric quantities, the asymptotic expansions of the posterior density and the posterior Bayesian predictive density of the model are provided in
Section 4.
To illustrate the results presented in this paper, the Rayleigh distribution is considered as an example in
Section 5. Moreover, Monte Carlo simulation results and a real data analysis are presented in
Section 6 to illustrate the main results.
2. The Statistical Model and Tangent Space
In this paper, we adopt the Einstein summation convention, that is, if an index occurs both as a superscript and as a subscript in a single expression, then the summation over that index is implied. For a density function
, let
, we introduce the following definitions (see [
1,
2] for more details):
: the Fisher metric tensors, the inverse of is denoted by , where ;
: the skewness tensor;
: the affine connection;
: the -connection.
The -connection and 1-connection are said to be the m-connection and e-connection, denoted by and , respectively. We also abbreviate some geometric terms by multiplication the metric tensors, i.e., , , .
Suppose that
is an exponential family of distributions (see, for example, Barndorff-Nielsen [
16]) with density function
and reliability function
where
is the number of functions of the parameter vector
,
is the cumulative distribution function, and
is the cumulant generating function defined as
with
and
are smooth functions of the parameter vector
, and
and
are smooth functions of the random variable
x. Here are two examples, the exponential and the Rayleigh distributions, of the members in the exponential family of distributions:
Exponential distribution with density function
and reliability function
we have
and the functions
and
,
and
. The dimension of the parameter vector
is
.
Rayleigh distribution with density function
and reliability function
We have and the functions , , , , , , and . The dimension of the parameter vector is .
Consider the life-testing experiment with progressive Type-II censoring described in
Section 1 with
n items placed on the life testing experiment and
m failures are planned to be observed, let the set of all admissible Type-II PCSs as
where
is the set of the non-negative integers. Under a given censoring scheme
, the set of progressively Type-II censored order statistics is denoted as
. The PCS
is prefixed prior to starting the life testing experiment.
Suppose the lifetime distribution of the items in the life testing experiment follows a distribution in the exponential family of distributions with density function in Equation (
1), the joint density function of the observed data,
, can be expressed as [
8,
10]
where
,
and
is the normalizing constant. By defining a new random variables
the joint density function in Equation (
4) can be expressed as
The parameter of this form is called the natural parameter of the joint density function of the exponential family of distributions with progressive Type-II censoring.
The tangent vector
of the manifold of the function
is spanned by the vectors
, and the set
is called the natural basis associated with the coordinate system
. Let
and the set
be the linear space of random variables spanned by
. The space
is called the 1-representation of the tangent space with progressively Type-II censored data. Here, the basis
of the 1-representation is given by
and the second and the third order derivatives of
are given by
3. The -Connections of Manifold Model
In this section, we investigate the
-connection of the manifold of the statistical model for the exponential family of distributions with progressively Type-II censored data. From Equation (
4), the normalization factor
can be defined as
Since the function under the integral is assumed to be continuously differentiable, the order of integration and differentiation can be switched, and hence, the first three derivatives of the function
with respect to the natural parameter
are given by
where the expectations
are taken with respect to the joint density function in Equation (
4). The derivatives in Equations (
7)–(
9) can be considered as the expected value, the covariance and the third-order central moments of
, respectively. The derivative in Equation (
7) can also be obtained from the condition
that is
The derivatives in Equations (
8) and (
9) can be obtained by calculating, respectively,
Equations (
8) and (
9) show that the
element of the metric tensors is given by
the
element of the skewness tensor is given by
and the
element of the affine connection is given by
Therefore, based on the joint density function
, the
-connection of the manifold of an exponential family of distributions is given by
which means that the natural parameter
is 1-affine, i.e.,
. Based on the information carried by the joint density function in Equation (
4), we can obtain the following results.
Theorem 1. The metric tensors and the α-connection of the exponential family of distributions are given by respectively.
From the
-connection, we can obtain the torsion and the Riemann-Christoffel curvature of the manifold. The torsion is represented by the torsion tensor whose components are given by [
1,
2]
which is a tensor anti-symmetric with respect to indices
. Note that the coefficients
of the
-connections are symmetric with respect to the first two indices
i and
j, then the tensor
vanishes for any
-connection. This shows that the manifold of the statistical model of the exponential family of distributions with progressively Type-II censored data is torsion-free.
The Riemann-Christoffel curvature of the manifold can be obtained as [
1,
2]
where
. The Riemann-Christoffel curvature based on the
-connection is called the
-Riemann-Christoffel curvature and its tensor is given by
where
. The tangent space of the manifold is said to be
-flat if the
-Riemann-Christoffel curvature
. We can also obtain the
-covariant derivative and the Laplace operator based on the
-connection and the metric tensors.
In the above process for obtaining those geometric quantities, we only use the information from the joint density function
. There is, in fact, another kind of information in the progressively Type-II censored order statistics
. We can consider the marginal density function of the
r-th progressively Type-II censored order statistic,
(see, for example, Kamps and Cramer [
17], Balakrishnan [
18], Balakrishnan and Aggarwala [
8], and Balakrishnan and Cramer [
10])
where
Based on the marginal density in Equation (
10), the expectations of the random variables
in Equation (
5) can be obtained as
where
the expectation
is taken with respect to the density function in Equation (
10). Suppose that the random variables
are independent, and let
we can obtain the following results.
Theorem 2. The Fisher metric tensors and the α-connection of the exponential family of distribution with progressively Type-II censored data are given by respectively.
Proof. For the metric tensors, they can be obtained directly from the definition of
. For the affine connection, from
, Equations (
7) and (
8), we can obtain
Then, the third-order tensor
can be specified as
□
4. Applications in Bayesian Predictive Inference and Asymptotic Expansions
In Bayesian inference for the exponential family of distributions, the parameter vector
is considered as a random variable. Given a prior density function for
,
, the joint posterior density function of the exponential family of distributions with progressively Type-II censored data can be expressed as
and the posterior Bayesian predictive distribution is given by
where
x is an unobserved set of observations to be predicted and it is independently distributed according to the same density
. The predictive density
is called the plug-in density function or the estimative density function, where
is an estimate of
based on the observed progressively Type-II censored sample
(see, for example, Geisser [
19]). Consider the Kullback-Leibler divergence as the loss function, the predictive distribution in Equation (
12) is the best predictive distribution in the sense that it minimizes the Bayes risk defined as [
20]
The integral defined in the predictive density in Equation (
12) can be difficult to integrate or the form is too complicated to be used in practice. In these situations, asymptotic or large-sample theory (see, for example, Barndorff-Nielsen and Cox [
21]) can be considered. In this section, we adopt the metric tensors and the
-connection introduced in
Section 2 and
Section 3 to study the asymptotic expansions of the posterior joint density and the Bayesian predictive density of the exponential family of distributions with progressively Type-II censored data. A similar asymptotic expansion of Bayesian prediction based on a full sample can be found in Zhang et al. [
14]. For simplicity, we only consider the information carried by the joint density function in Equation (
4), a similar process can be applied for the situation where the information obtained from the joint density function in Equation (
4) and the marginal density function in Equation (
10) together.
Theorem 3. Given a prior distribution for θ, the posterior distribution in Equation (11) can be expressed asymptotically as where and is an estimator of parameter set θ.
Proof. Using the Laplace method suggested by Nielsen and Cox [
21], the posterior distribution can be expressed asymptotically as
We have
which implies that
□
Based on the asymptotic expansion presented in Theorem 3, we can obtain the following result.
Theorem 4. Given a prior distribution for θ, the predictive distribution in Equation (12) can be expressed asymptotically as Proof. The proof is similar to the proof of Theorem 2 in Komaki [
3]. The proof can be completed by substituting
and
with
and
, respectively. □
If the prior distribution
is the Jeffreys prior
, then from the relationship
we have
The following results can be immediately obtained.
Corollary 1. Given the Jeffreys prior , the posterior distribution in Equation (11) can be asymptotically expanded as Corollary 2. Given the Jeffreys prior , the prediction (12) can be asymptotically expanded as These results show that the predictive density function, when the sample size n approaches infinity, is the estimative density function in the asymptotic sense.
5. Illustration Example
The illustration of the geometric quantities for exponential distribution has been provided in the literature (see, for example, [
12]). In this section, we use the Rayleigh distribution, a member of the exponential family of distributions, presented in
Section 2 as an example to illustrate our results. Suppose that
is the progressively Type-II censored order statistics form items with lifetimes follow the Rayleigh distribution with density function in Equation (
3), then the joint density function of
can be expressed as
where
.
,
,
,
and
. Let
Then, the first three derivatives of the function
can be obtained as
The maximum likelihood estimator (MLE) of the parameter
can be derived as
We first consider the information carried by the joint density in Equation (
4). The metric tensors have one element, that is,
The skewness tensor can be written as
The affine connection and the
-connection can be obtained as
respectively. The coefficients of the
m-connection and
e-connection are
respectively. For Bayesian inference, we consider the Jeffreys prior for the parameter
, i.e.,
then the posterior distribution of
is
which can be written as
Here, the predictive distribution is
which can be expanded asymptotically as
In the following, we consider the information obtained from the marginal density function in Equation (
10) and the joint density function in Equation (
4) together. Notice that
The
n-order moment about the origin of the
r-th progressively Type-II censored order statistic is given by
which implies
Thus, the affine connection is specified as
The metric tensor
and the skewness tensor
are the same as the expressions in Equations (
13) and (
14). The
-connection is reduces to
The coefficients of the
m-connection and the
e-connection are
respectively. Therefore, based on the Jeffreys prior
, the Bayesian predictive density function of the Rayleigh distribution with progressively Type-II censored data can be asymptotically expanded as
This shows that the predictive density function, with the increase of the sample size
n and the observed sample size
m, is the estimative density function in the asymptotic sense. The term
can be considered the correction term due to the information carried by the density function in Equation (
10).
6. Monte Carlo Simulation Study and Real Data Analysis
In this section, we present a Monte Carlo simulation study of the Bayesian prediction based on progressively Type-II censored data described in
Section 4. We also present a real data analysis based on the progressive Type-II censored data discussed in the literature. In the Monte Carlo simulation study, we consider different sample sizes
, (10, 35), (15, 40) and (20, 40) and three different censoring schemes:
- :
;
- :
;
- :
The progressively Type-II censored data,
, are generated based on the Rayleigh distribution in Equation (
3) with parameter
for different sample sizes and censoring schemes. For the proposed Bayesian prediction (BP), we consider two different priors: (i) the Jeffreys prior
; and (ii) the uniform prior
on interval
. For comparative purposes, we also consider the plug-in prediction (PP) approach in which the estimative density function
is also considered. For the plug-in approach, the parameter is estimated by using the maximum likelihood method based on the simulated progressive Type-II censored sample
. The estimated biases and mean square errors (MSEs) of different prediction approaches for predicting the probability density at
based on 10,000 simulations are presented in
Table 1.
From
Table 1, we observe that the performances of all prediction methods improve in terms of MSEs as the sample sizes
m and
n increase. In other words, the number of items being removed during the progressively Type-II censored experiment affects the performance of prediction. Moreover, we observe that the Bayesian prediction method with the Jeffreys prior can provide smaller biases and smaller MSEs compared to the plug-in prediction method in some cases.
To illustrate the practical applications of the approximate methods based on geometric quantities proposed in this paper, we analyze a real data set which contains the times to breakdown of an insulating fluid at 34 kV originally presented in Nelson [
22] (Table 6.1). A progressively Type-II censored sample of size
generated from the
observations by Balakrishnan et al. [
9] is analyzed here. The progressively censored sample and the progressive censoring scheme are presented in
Table 2.
Suppose that we assume the lifetimes of the insulating fluid tested at 34 kV follow a Rayleigh distribution and we are interested in predicting the probability density, based on the progressive Type-II censored data presented in
Table 2, the predicted density curves obtained from the plug-in prediction approach and the proposed Bayesian prediction approach with two different priors are presented in
Figure 1. From
Figure 1, we observe that the three prediction methods provide similar predicted density curves in this case. For instance, if we are interested in predicting density at
, based on the progressive Type-II censored data presented in
Table 2, the predicted values of plug-in prediction density
is
, and the Bayesian prediction densities
with Jeffreys prior
and
uniform prior
are
and
, respectively.