METHODS article

Front. Appl. Math. Stat., 14 February 2023

Sec. Statistics and Probability

Volume 9 - 2023 | https://doi.org/10.3389/fams.2023.1113477

On testing exponentiality under Type-I censoring

  • Statistics Program, Department of Mathematics, Statistics and Physics, College of Arts and Sciences, Qatar University, Doha, Qatar

Article metrics

View details

1

Citations

3,5k

Views

460

Downloads

Abstract

Two new goodness-of-fit testing procedures are introduced to test exponentiality when data are subject to Type-I censoring. We proposed four test statistics for this purpose. Under extensive Monte Carlo simulations, we showed that the proposed tests maintain the nominal significance level and show good power for both monotonic and non-monotonic hazard function alternatives even for small samples as n = 10. A real dataset is studied for illustrative purposes.

1. Introduction

In reliability and life testing problems, Type-I censoring has gained a significant amount of popularity due to the duration of the experiment being fixed prior to it being started and the fact that it is under the control of the experimenter.

It is of interest to study the lifetime of n items by performing a life testing experiment. By controlling the total time, the experiment can be terminated at the time of T, which can be determined before the life testing experiment begins. This means that d observations take the form of X1 : nX2 : n ≤ … ≤ Xd : n, and nd data values are censored, as discussed by Balakrishnan and Cohen [1] and Cohen [2].

The exponential distribution considered in this article is well-known and frequently uses lifetime models. The exponential model is a special case among many important statistical models such as Weibull and gamma distributions. The simplicity and the existence of closed form solutions for many problems make the exponential model appealing, which informs the current study (see also Balakrishnan and Basu [3]). We assume the following form of pdf for the exponential distribution with scale parameter θ

Suppose n items are placed in a life testing experiment, which will be terminated at a pre-determined time T > 0. Let X1 : n, X2 : n, …, Xd : n be the corresponding Type-I censored sample from a distribution function F. Consider the following goodness-of-fit hypothesis

For some positive scale parameter θ. Based on this, the current study was interested in testing for exponentiality.

The maximum likelihood estimator (MLE) of θ, based on censored data X1 : n, X2 : n, …, Xd : n is given by

Provided that d ≥ 1. However, hereafter we assume that d ≥ 1 and that at least one example of censored data are observed.

Pearson [4] was the first to study the problem of goodness-of-fit, which is a statistical procedure for testing the suitability of a specific model to describe a given set of complete or censored data. For a detailed discussion of this problem see D'Agostino and Stephens [5], Huber-Carol et al. [6], and Nikulin and Chimitova [7] among others.

Stephens [8] proposed a version of the Cramer-von Mises and Anderson-Darling goodness-of-fit test statistics for Type-I censored data. Pakyari and Balakrishnan [9] studied a goodness-of-fit testing procedure for the exponential distribution when the available data are Type-I censored. They studied the goodness-of-fit testing problem for the exponential model by treating the Type-I censored data as a complete sample and then performing classical goodness-of-fit tests for complete data.

Their method considered the Type-I censored sample X1 : nX2 : n ≤ … ≤ Xd : n as order statistics from a complete sample of size d, from a right-truncated exponential distribution at time T.

Pakyari and Resalati Nia [10] extended their work to test other lifetime models such as Weibull and log-normal distributions. We use this idea to present test statistics based on order statistics in Section 2. For the various goodness-of-fit procedures available for censored data see also Balakrishnan et al. [11], Balakrishnan et al. [12], Doring and Cramer [13], Lim and Park [14], Lin et al. [15], Noughabi [16], Pakyari and Balakrishnan [17], Pakyari [18], Pakyari and Baklizi [19], Park and Pakyari [20], and Qi et al. [21].

This article presents new testing procedures for testing the goodness-of-fit of the exponential model when data are Type-I censored. We study several testing procedures in this regard such as tests based on order statistics, tests based on quantiles, and tests based on binomial distribution. However, our proposed method is based on order statistics followed by tests based on quantiles. We investigate the empirical power of the proposed tests through an extensive Monte Carlo simulation study.

This study aims to provide some easy yet powerful goodness-of-fit testing procedures for exponentiality, which is known to be a special case among many well-applied lifetime models.

The paper is structured as follows. Section 2 introduces some test statistics which are constructed based on order statistics. In Section 3 we propose a test statistic based on a linear combination of quantiles vector. Tests based on binomial distribution are discussed in Section 4. In Section 5, we investigate the validity of the proposed tests by calculating the empirical significance levels and comparing them with the nominated levels. We then perform a Monte Carlo simulation study to access the empirical power of the proposed tests so that we can compare them with the power of some known tests described in the literature on this subject. Finally, we explain the proposed tests using a real data example.

2. Tests based on order statistics

Note that conditional on D = d,

Where the order statistics V1 : d, …, Vd : d are a random sample of size d from exponential distribution but right truncated at T; see Arnold et al. [22] and David and Nagaraja [23].

On finding the MLE of θ, it will be useful to transform the Type-I censored sample X1 : n, X2 : n, …, Xd : n to the complete uniformly distributed sample U1 : d, U2 : d, …, Ud:d using the following transformation:

Therefore, testing that the Type-I censored data X1 : n, X2 : n, …, Xd : n follow exponential distribution is equivalent to testing that the complete data U1 : d, U2 : d, …, Ud:d follow a uniform distribution.

If we then let be the deviation of each order statistics Ui:d from its expected value, then several goodness-of-fit test statistics can be considered:

Large values of these statistics will tend to reject the null hypothesis of exponentiality. In Section 5, we use the Monte Carlo simulation to determine the upper tail of the simulated values of the statistics T1, T2, and T3 as critical points for testing exponentiality.

3. Test based on quantiles

Note that the order statistics Ui : d defined by Equation (3) follow beta distribution with parameters (i, di + 1). Define

Where is the cumulative distribution function (CDF) and is the incomplete beta function. pi is the CDF of the beta distribution with parameters (i, di+1) evaluated at ui : d.

The quantiles vector

can be used as a measure of goodness-of-fit. Extreme values of pi, i.e. values close to zero or one are signs of "badness-of-fit"! It is noteworthy that, although pi's are uniformly distributed over (0, 1), they are not statistically independent.

We propose a test statistic in terms of a linear combination of pi and 1 − pi as follows:

Where , for i = 1, 2, …, d and p(i)'s are the ordered values of pi arranged from smallest to largest. Note that the test statistic TP will be calculated for values of p(i) in the interval (0, 1), i.e. we exclude the cases with p(i) = 0 or p(i) = 1. Note also that whilst ui : d's are ordered in terms of their values, the pi's are not necessarily ordered. Moreover, the test statistic TP, will be large whenever one of pi's are close to zero or one. Hence, large values of TP provide evidence that the null hypothesis H0 of exponentiality should be rejected.

4. Test based on binomial distribution

Let the discrete random variable D denote the number of observed failures before the termination time T. Then, under the validity of the null hypothesis, under the exponentiality of the model, D follows a binomial distribution with parameters (n, p), where p = F(T) = 1 − exp(−T/θ).

Hence, testing the null hypothesis of exponentiality (1), is equivalent to performing a binomial test say

Note that if we assume that the null hypothesis is true, i.e. under the validity of the exponential model, we expect to observe nF(T) failures. The usual binomial test may then be used to find the associated p-value.

For large values of sample size n, the binomial distribution is well approximated by the Gaussian model in which a z-test is performed to the test statistic Z, using continuity correction given by

However, using the Monte Carlo simulation we found that the test statistic TB does not maintain the nominated significance level for small sample sizes even for sample sizes n ≤ 40, so we did not include the power of TB in our simulation study.

In the following section, we perform a Monte Carlo simulation to assess the power of the proposed tests for various alternative models, and for a combination of various sample sizes n and censoring proportion 1 − F(T) = exp(−T/θ).

5. Simulation study

In this section, the performance of our proposed tests will be evaluated by studying the empirical significance level and the empirical power through extensive Monte Carlo simulations. We used the R pseudo-random generator with 50,000 iterations.

First, we investigate the null distribution of the test statistics presented in the previous section using the Monte Carlo estimate of the coefficient of skewness () and the coefficient of kurtosis (β2) when the underlying distribution is standard exponential. The results are shown in Table 1. The coefficient of skewness () and the coefficient of kurtosis (β2) are defined as:

and

From Table 1, it is clear that the null distribution of all the test statistics are far from normality, as and β2 are not close to 0 and 3 respectively, which are the coefficients of skewness and kurtosis of normal distribution. This is also evident from Figure 1, which depicts the simulated pdf curves for the test statistics under the validity of the null hypothesis. Indeed, it has been observed that all the test statistics are skewed to the right. Hence, we use empirical critical values to perform goodness-of-fit tests.

Table 1

F(T) = 1 − exp(−T/θ)
CoefficientTest statistic0.40.50.60.70.80.90.99
T1−0.3129−0.11800.02710.19120.37960.58750.6320
T21.32321.31331.48081.68211.89632.10002.0743
T30.46410.48810.56410.71170.85801.00951.0359
TP1.47831.64181.67601.75001.62081.48101.4247
β2T12.92652.83642.78592.82092.99063.39853.4865
T26.47885.82477.12277.79838.946010.40609.8452
T33.42503.27783.38253.61523.92514.52934.5615
TP7.36608.69628.59729.13487.72536.48015.9467

Monte Carlo estimate of the coefficient of skewness () and coefficient of kurtosis (β2) for the test statistics T1, T2, T3, and TB under the null distribution of exponentiality.

Figure 1

Figure 1

Monte Carlo simulated pdf curves for the test statistics T1, T2, T3, and TP under the assumption of exponential model.

We compare the empirical power of the proposed tests to those of the EDF-based test statistics proposed by Pettitt and Stephens [24] and Stephens [8].

Stephens [8] studied the modification of the Kolmogorov-Smirnov statistic for the Type-I censored data from an exponential model in the form of:

Where and with being the MLE of the scale parameter θ given by Equation (2).

Pettitt and Stephens [24] also studied the Cramér-von Mises statistic and the Anderson-Darling statistic under Type-I censoring in the form of:

and

We considered seven alternative models in three groups , and based on their behavior of hazard functions as follows:

: Group I alternative (Monotonic decreasing hazard rates):

1. Gamma distribution with shape parameter α = 0.5 and scale parameter β = 1.0, denoted by Gamma (0.5, 1.0).

2. Weibull distribution with shape parameter a = 0.5 and scale parameter b = 1.0, denoted by Weibull (0.5, 1.0).

3. Lomax distribution with shape parameter d = 4.0 and scale parameter c = 1.0, denoted by Lomax (1.0, 4.0).

: Group II alternative (Monotonic increasing hazard rates):

1. Gamma distribution with shape parameter α = 2.0 and scale parameter β = 1.0, denoted by Gamma (2.0, 1.0).

2. Weibull distribution with shape parameter a = 2.0 and scale parameter b = 1.0, denoted by Weibull (2.0, 1.0).

: Group III alternative (Non-monotonic hazard rates):

1. Log-normal distribution with location parameter μ = 0 and scale parameter σ = 0.5, denoted by Log-normal (0, 0.5).

2. Log-normal distribution with location parameter μ = 0 and scale parameter σ = 1.0, denoted by Log-normal (0, 1.0).

The following forms of probability density functions were used here.

The gamma distribution with density function

Where α > 0 is the shape parameter and β > 0 is the scale parameter.

The Weibull distribution with density function

Where a > 0 and b > are the shape and scale parameters, respectively.

The log-normal distribution with density function

Where −∞ < μ < ∞ is the mean and σ > 0 is the standard deviation of the transformed normal distribution.

Finally, the Lomax distribution (also known as Pareto Type II), with probability density function

With the scale parameter c > 0 and the shape parameter d > 0.

The plot of CDFs of the alternative distributions in groups , and are depicted in Figure 2.

Figure 2

Figure 2

CDF of the alternative groups , , and (left to right) and the CDF of a standard exponential distribution (dashed).

For a comprehensive discussion of these distributions, one may refer to Johnson et al. [25, 26] and Kleiber and Kotz [27].

Verifying the empirical significance level is of great importance for the validity of any goodness-of-fit test statistic. To assess the validity of our tests we investigate the empirical significance level by generating 100, 000 Type-I censored random data from the exponential distribution with a rate equal to one (standard exponential). We considered a combination of various sample sizes n and proportions (probability) of failures F(T) = 1 − exp(−T). The empirical significance levels at nominated level α = 0.10 are tabulated in Table 2. The values in this table confirm the validity of our proposed tests in terms of preserving the nominated significance level.

Table 2

F(T) = 1 − exp(−T/θ)
Alt. modelTest statistic0.40.50.60.70.80.90.99
n = 10T10.09910.09930.09980.10040.10360.09750.0987
T20.09990.10030.09970.10130.09740.09950.1006
T30.09960.09990.09890.10160.09630.09910.1005
TP0.09890.09740.10180.09950.09710.099930.1026
n = 20T10.09930.10130.09980.10040.09940.10350.0983
T20.09890.10140.09690.09940.10080.10280.1001
T30.09940.10030.09750.10070.10270.10150.1007
TP0.09950.09810.09770.10120.09990.10250.1021
n = 30T10.10020.09810.10110.10420.10110.10100.0972
T20.10010.09930.10040.10260.10340.10030.0988
T30.09960.10020.09980.10100.10350.09980.0994
TP0.09850.09980.10100.10130.10230.09760.1001

Empirical significance level for the test statistics T1, T2, T3, and TP when n = 10, 20, 30 with α = 0.10 and 105 iterations.

The power of the proposed tests together with the powers associated with the classical EDF-based tests are recorded in Tables 35 for sample sizes n = 10, n = 20, and n = 30, respectively for the three alternative groups , , and . Figures 46 depict the corresponding heatmaps to provide better visualization of the results. The greyscale is given in Figure 7.

Table 3

F(T) = 1 − exp(−T/θ)
Alt. modelTest statistic0.40.50.60.70.80.90.99
Gamma (0.5, 1.0)T10.07780.07770.09130.12350.19690.31190.4422
T20.00120.00070.00090.00240.02990.19010.4574
T30.00110.00070.00080.00230.03270.19600.4764
TP0.40670.31240.24780.18070.13950.10720.0896
KS0.28900.30230.31610.32650.33650.34550.3506
CM0.28390.30820.32840.34670.36350.37690.3819
AD0.48320.49850.51220.52600.53970.55250.5584
Gamma (2.0, 1.0)T10.03970.04070.06440.10580.14360.14220.1123
T20.36740.54990.71120.81990.88020.89500.6350
T30.38400.56330.71930.82460.88390.89660.6469
TP0.16540.07550.05010.05940.11180.23110.5117
KS0.01720.04240.08640.14310.20900.27660.3428
CM0.00690.02230.05620.11290.19200.26580.3914
AD0.00470.01420.03730.07850.14170.20800.3416
Weibull (0.5, 1.0)T10.14850.17280.21360.27760.37500.48770.5944
T20.01110.01280.01780.03260.10940.41730.7360
T30.00890.01030.01240.02490.10870.42950.7533
TP0.28830.25410.23610.21890.21540.23830.3204
KS0.32610.36320.39730.43140.46730.51120.5845
CM0.30320.34890.39190.43250.47860.53280.6230
AD0.54930.58290.61260.64270.67400.71200.7748
Weibull (2.0, 1.0)T10.03760.03690.03070.02010.05090.18880.4095
T20.50260.55770.51260.40000.32290.42340.6259
T30.52870.59740.57370.47550.35150.35890.5843
TP0.13270.23370.43560.65170.81800.86590.8547
KS0.11310.24630.40200.54210.65150.66830.6739
CM0.07640.21320.40210.58310.72520.74890.7691
AD0.05000.15700.32620.50660.66990.70840.7374
Log-normal (0, 0.5)T10.00750.00230.00200.0070.05830.25160.5692
T20.72640.90870.94130.88440.81730.83990.8377
T30.75450.92700.96820.93380.85510.81930.7770
TP0.08330.17590.42200.67340.84440.93090.9738
KS0.05970.28010.56900.74980.83490.87150.8759
CM0.02790.20980.52210.75160.86000.90690.9119
AD0.01840.15280.43930.69240.82260.88220.8999
Log-normal (0, 1.0)T10.02060.03080.04350.05210.06330.06850.0870
T20.41810.41110.40240.38480.36980.33500.1544
T30.45290.44180.43220.41660.39110.33470.1486
TP0.10140.11950.13500.14020.14790.15600.1982
KS0.08780.09220.10910.11520.12110.12990.1318
CM0.05980.09510.12110.12750.12970.13150.1354
AD0.03780.06450.08610.09990.10080.10230.1059
Lomax (1, 4.0)T10.00700.00510.00670.01170.02990.07430.1590
T20.00010.00020.00010.00040.00220.03610.1706
T30.00010.00010.00010.00020.00210.03640.1781
TP0.53900.39980.27040.16470.10730.07840.0727
KS0.09990.10730.11630.12750.13950.14950.1582
CM0.09310.10350.11530.12870.14230.15540.1658
AD0.10380.11310.12450.13910.15450.17000.1881

Empirical power for various alternative models, n = 10, α = 0.10 and 105 iterations.

Bold values indicate the largest estimated power in each case.

Table 4

F(T) = 1 − exp(−T/θ)
Alt. modelTest statistic0.40.50.60.70.80.90.99
Gamma (0.5, 1.0)T10.01820.02680.04940.10550.23090.43810.6051
T20.00060.00060.00040.00030.02720.29490.6679
T30.00010.00020.00060.00040.02750.28740.6760
TP0.65790.56610.47200.38920.30550.23630.1861
KS0.47190.49690.52080.54010.55760.57090.5810
CM0.49530.52800.55660.57940.59950.61640.6266
AD0.69140.71110.72700.74210.75520.76800.7769
Gamma (2.0, 1.0)T10.03560.09650.19810.30720.36050.28730.1951
T20.65290.84020.93820.97950.99150.99350.8940
T30.66650.84330.93610.97860.99040.99360.9105
TP0.10040.03200.02820.06300.14870.34300.7356
KS0.04970.10520.17880.25870.34430.43770.5446
CM0.02410.06930.14650.24500.35730.48050.6217
AD0.01590.05070.11710.20930.32080.45110.6081
Weibull (0.5, 1.0)T10.12430.18890.30360.45490.63250.78290.8686
T20.00220.00370.00550.01530.10680.61300.9267
T30.00150.00200.00260.00770.10240.61480.9325
TP0.53910.51880.50170.49840.51270.55020.6573
KS0.55170.60320.64890.69190.73630.78680.8607
CM0.56720.62470.67720.72430.77140.82230.8926
AD0.77000.80680.83830.86570.89070.91690.9516
Weibull (2.0, 1.0)T10.05190.05550.03380.01500.07470.46670.8202
T20.78150.82000.75880.64400.60070.81180.9483
T30.80370.86030.84010.76030.66060.77700.9449
TP0.13330.33590.63360.87650.97100.98990.9900
KS0.24900.44720.64580.80290.89660.91060.9257
CM0.21030.44860.68510.85380.94230.96200.9711
AD0.17190.40190.64870.83450.93280.97020.9662
Log-normal (0, 0.5)T10.00440.00190.00190.00220.08340.57930.9512
T20.93130.99580.99920.99650.99420.99780.9979
T30.94480.99720.99980.99910.99670.99680.9947
TP0.05240.25080.66120.92240.98840.99920.9999
KS0.19640.62360.88980.96910.99020.99540.9963
CM0.11810.55720.88860.97620.99420.99780.9986
AD0.08440.50290.87710.97710.99560.99850.9991
Log-normal (0, 1.0)T10.02750.04680.06500.07550.08260.08580.1259
T20.63700.63780.60740.58510.56370.52910.2006
T30.68610.69000.66630.64400.60860.54430.2060
TP0.11230.15180.17480.19600.20920.22970.3296
KS0.10590.12160.14870.15240.16820.17190.2016
CM0.13510.14770.15860.17820.19430.20590.2206
AD0.12510.13370.14080.17570.18450.19580.2147

Empirical power for various alternative models, n = 20, α = 0.10 and 105 iterations.

Table 5

F(T) = 1 − exp(−T/θ)
Alt. modelTest statistic0.40.50.60.70.80.90.99
Gamma (0.5, 1.0)T10.00480.01090.03270.09620.25670.53260.7399
T20.00010.00010.00020.00020.02380.37570.8141
T30.00010.00020.00020.00060.02350.35970.8173
TP0.79430.71990.63930.55160.45840.36300.3011
KS0.61890.65100.67550.69840.71790.73350.7442
CM0.65520.68890.71690.74050.76020.77730.7875
AD0.82310.84210.85640.86880.87900.88890.8956
Gamma (2.0, 1.0)T10.06790.18290.35640.51110.55270.42920.2861
T20.81000.93970.98770.99780.99950.99970.9749
T30.81740.94030.98650.99750.99940.99970.9809
TP0.04650.01570.02720.07680.20160.46170.8659
KS0.07050.14490.25200.35910.46320.57540.7054
CM0.03920.10600.22890.36420.49700.63260.7788
AD0.03040.09160.20760.34320.48600.63370.7861
Weibull (0.5, 1.0)T10.10630.20640.37890.60820.80000.92260.9644
T20.00060.00110.00270.00930.11050.74250.9816
T30.00030.00050.00090.00300.10570.73790.9832
TP0.71340.69590.69360.70150.71820.75740.8521
KS0.71100.76160.80540.84340.87820.91410.9576
CM0.73970.79370.83780.87480.90740.93780.9724
AD0.88530.91330.93430.95180.96530.97780.9906
Weibull (2.0, 1.0)T10.07480.07340.03510.01550.09910.68810.9623
T20.90470.93090.89460.81360.79680.95010.9957
T30.91910.95670.94820.90440.85590.93680.9958
TP0.16080.45550.79790.96180.99620.99960.9996
KS0.35150.60200.80060.92300.97470.98840.9982
CM0.32750.62970.84620.95480.99080.99770.9985
AD0.30160.60560.83620.95290.99000.99780.9989
Log-normal (0, 0.5)T10.00270.00200.00210.00120.10440.77790.9974
T20.98180.99971.00000.99990.99991.00001.0000
T30.98710.99981.00001.00001.00001.00001.0000
TP0.04530.35130.84080.98690.99961.00001.0000
KS0.31410.80190.97840.99790.99980.99991.0000
CM0.22030.75500.98110.99860.99991.00001.0000
AD0.17890.74530.98300.99920.99991.00001.0000
Log-normal (0, 1.0)T10.03760.06420.08480.10390.10370.09340.1576
T20.78670.77790.75590.72900.70920.66920.2460
T30.83180.83330.81600.79180.76370.69160.2606
TP0.13750.19230.23900.26570.28140.31590.4363
KS0.11750.12040.14030.16260.17110.17950.2113
CM0.13930.14240.15010.16460.18400.19850.2214
AD0.14920.14280.15950.17780.19210.20440.2379
Lomax (1, 4.0)T10.00010.00010.00020.00070.00900.07280.2148
T20.00010.00010.00010.00010.00320.04010.2905
T30.00010.00010.00010.00010.00020.03610.2948
TP0.79730.63380.45520.29430.18630.12560.1059
KS0.12120.13910.16330.18870.21670.24430.2718
CM0.12160.14420.17210.20460.23850.27110.2970
AD0.12930.14940.17640.20970.24560.28210.3185

Empirical power for various alternative models, n = 30, α = 0.10 and 105 iterations.

Bold values indicate the largest estimated power in each case.

Figure 3

Figure 3

Histogram of the complete data and fitted exponential pdf curve of the data in Table 6.

Figure 4

Figure 4

Heatmap of the empirical powers listed in Table 3 for n = 10, α = 0.10 and 105 iterations.

Figure 5

Figure 5

Heatmap of the empirical powers listed in Table 4 for n = 20, α = 0.10 and 105 iterations.

Figure 6

Figure 6

Heatmap of the empirical powers listed in Table 5 for n = 30, α = 0.10 and 105 iterations.

Figure 7

Figure 7

The greyscale used in Figures 46.

The test statistics T3 and TP outperformed the classical EDF-based statistics for groups and , respectively for the monotonic increase and non-monotonic hazard function alternatives for all sample sizes considered here. The test statistic T2 also had the best power in some cases in groups and . However, in the group alternative for monotonic decreasing hazard functions, the EDF-based test statistic AD performed better than the other tests. In Table 5, for log-normal (0, 0.5) alternative and n = 30, the empirical powers are equal to 1.00 for most tests when the censoring proportion F(T) is at least 60%. This shows the consistency of the test statistics considered here. Moreover, as one would expect the empirical power values of all the tests considered here increase when the sample size n increases and/or when the censoring proportion F(T) increases.

In summary, for the monotonically increasing and non-monotonic hazard rate alternatives, we recommend using the test statistics T3 and TP. For the Lomax model alternative, we recommend TP for a small amount of censoring proportion and the AD statistic for large values of F(T).

6. Numerical example

In this section, we study a numerical example to illustrate our proposed procedure and test statistics. The data concerning the times to breakdown of an insulating fluid tested at 34 kilovolts for n = 19 insulating fluids (see Nelson [28], Table 1.1, page 105).

Suppose we decided to terminate the experiment at time T = 15 so any data larger than 15 is censored. The complete and the Type-I censored data are summarized in Table 6.

Table 6

Complete data0.190.780.961.312.783.164.154.674.856.50
7.358.018.2712.0631.7532.5233.9136.7172.89
Type-I censored data0.190.780.961.312.783.164.154.674.856.50
7.358.018.2712.06

Insulating fluid tested at 34 kilovolts data.

The value of d is found to be d = 14 and using Equation (2), the MLE of θ is . Hence, F(T) = F(15) = 1 − exp(−15/10) = 0.78.

The values of the test statistics and the associated p-values are given in Table 7. The p-values are sufficiently large for all test statistics and thereby the null hypothesis of exponentiality is not significant and the exponential model fits the data. The histogram of the complete data and the fitted exponential pdf curve with scale parameter θ = 10 are depicted in Figure 3.

Table 7

CriterionT1T2T3TP
Test statistic0.24380.00800.07489.0941
p-value0.62240.82570.81100.6066

Test values of test statistics and their corresponding p-values for the data in Table 6 when testing for the exponential model.

7. Concluding remarks

In this paper, we proposed some new goodness-of-fit tests for exponentiality when the available data are Type-I censored. We employed two methods for this purpose: the first was based on the distance between the observed order statistics and its theoretical mean under the assumption of exponentiality.

The second method was based on the values of quantiles of uniform order statistics, which are known to follow the beta distribution, as is the fact that under the assumption of the null hypothesis, most of the quantiles pi's should be close to 0.5. We proposed test statistics based on the weighted mean of the logarithm of pi.

Among the four test statistics presented in this article, the test statistic T3, based on order statistics, exhibits the most powerful test followed by the test statistic T4, which is based on quantiles.

The large sample properties of the proposed estimators will be examined in a separate future study through Monte Carlo simulation.

Statements

Data availability statement

The original contributions presented in the study are included in the article/supplementary material, further inquiries can be directed to the corresponding author.

Author contributions

Sections 1–4 were prepared by RP. Sections 5, 6 were prepared by OA-H (60%) and RP (40%). All authors contributed to the article and approved the submitted version.

Acknowledgments

We are grateful to the associate editor and reviewers for providing useful comments and suggestions on an earlier draft, which led to this improved version of the article. We thank Open Access funding provided by the Qatar National Library.

Conflict of interest

The authors declare that the research was conducted in the absence of any commercial or financial relationships that could be construed as a potential conflict of interest.

Publisher’s note

All claims expressed in this article are solely those of the authors and do not necessarily represent those of their affiliated organizations, or those of the publisher, the editors and the reviewers. Any product that may be evaluated in this article, or claim that may be made by its manufacturer, is not guaranteed or endorsed by the publisher.

References

  • 1.

    BalakrishnanNCohenAC. Order Statistics and Inference: Estimation Methods.Boston, MA: Academic Press (1991).

  • 2.

    CohenAC. Truncated and Censored Samples: Theory and Methods.New York, NY: Marcel Dekker (1991).

  • 3.

    BalakrishnanNBasuAP. (Eds.). The Exponential Distributions: Theory, Methods, and Applications.Newark, NJ: Taylor and Francis (1995).

  • 4.

    PearsonK. X. On the criterion that a given system of deviations from the probable in the case of a correlated system of variables is such that it can be reasonably supposed to have arisen from random sampling. Philos Mag Ser. (1900) 50:15775. 10.1080/14786440009463897

  • 5.

    D'AgostinoRBStephensMA. (Eds.) Goodness-of-Fit Techniques. New York, NY: Marcel Dekker (1986).

  • 6.

    Huber-CarolCBalakrishnanNNikulinMSMesbahM. (Eds.). Goodness-of-Fit Tests and Model ValidityBoston, MA: Birkhäuser, (2002).

  • 7.

    NikulinMSChimitovaEV. Chi-Squared Goodness-Of-Fit Tests for Censored Data. Hoboken, NJ: John Wiley & Sons (2017).

  • 8.

    StephensMA. Tests based on EDF statistics, In: Goodness-of-Fit TechniquesD'AgostinoRBStephensMA, editors. New York, NY: Marcel Dekker (1986).

  • 9.

    PakyariRBalakrishnanN. Testing exponentiality based on Type-I censored data. J Stat Comput Simulat.83:2369378. 10.1080/00949655.2012.691974

  • 10.

    PakyariRResalati NiaK. Testing goodness-of-fit for some lifetime distributions with conventional Type-I censoring. Commun Stat Simulat Comput. (2017) 46:29983009. 10.1080/03610918.2015.1069347

  • 11.

    BalakrishnanNNgHKTKannanN. Goodness-of-fit tests based on spacings for progressively Type-II censored data from a general location-scale distribution. IEEE Trans. Reliabil. (2004) 53:34956. 10.1109/TR.2004.833317

  • 12.

    BalakrishnanNRadAHArghamiNR. Testing exponentiality based on Kullback-Leibler information with progressively Type-II censored data. IEEE Trans Reliabil. (2007) 56:3017. 10.1109/TR.2007.895308

  • 13.

    DoringMCramerE. On the power of goodness-of-fit tests for the exponential distribution under progressive Type-II censoring. J Stat Comput Simulat. (2019) 89:29973034. 10.1080/00949655.2019.1648468

  • 14.

    LimJParkS. Censored kullback-leibler information and goodness-of-fit test with type II censored data. IEEE Trans Reliabil. (2007) 75:63342. 10.1080/02664760701592000

  • 15.

    LinCTHuangYLBalakrishnanN. A new method for goodness-of-fit testing based on Type-II right censored samples. J Appl Stat. (2008) 34:105164. 10.1109/TR.2008.2005860

  • 16.

    NoughabiHA. Testing exponentiality based on Kullback–Leibler information for progressively Type II censored data. Commun Stat Simulat Comput. (2017) 46:7624-638. 10.1080/03610918.2016.1248569

  • 17.

    PakyariRBalakrishnanN. A general purpose approximate goodness-of-fit test for progressively Type-II censored data. (2012) IEEE Trans Reliabil. 61:238-242. 10.1109/TR.2012.2182811

  • 18.

    PakyariR. Goodness-of-fit testing based on Gini Index of spacings for progressively Type-II censored data. Commun Stat Simulat Comput. (2021) 10.1080/03610918.2021.1930052

  • 19.

    PakyariRBakliziA. On Goodness-of-fit testing for burr Type X distribution under progressively Type-II censoring. Comput Stat. (2022) 37:224965. 10.1007/s00180-022-01197-5

  • 20.

    ParkSPakyariR. Cumulative residual Kullbak-Leibler information with the progressively Type-II censored data. Stat Probabil Lett. (2015) 106:28794. 10.1016/j.spl.2015.07.029

  • 21.

    QiWAllenASLiYJ. Family-based association tests for rare variants with censored traits. PLoS ONE. (2019) 14:117. 10.1371/journal.pone.0210870

  • 22.

    ArnoldBCBalakrishnanNNagarajaHN. A First Course in Order statistics.New York, NY: John Wiley & Sons (1992).

  • 23.

    DavidHANagarajaHN. Order Statistics. 3rd ed. Hoboken, NJ: John Wiley & Sons (2003).

  • 24.

    PettittANStephensMA. Modified Cramér-von Mises statistics for censored data. Biometrika (1976) 63:2918.

  • 25.

    JohnsonNLKotzSBalakrishnanN. Continuous Univariate Distributions-Vol. 1. 2nd ed. New York, NY: John Wiley & Sons (1994).

  • 26.

    JohnsonNLKotzSBalakrishnanN. Continuous Univariate Distributions-Vol. 2.2nd ed. New York, NY: John Wiley & Sons (1995).

  • 27.

    KleiberCKotzS. Statistical Size Distributions in Economics and Actuarial Sciences.Hoboken, NJ: John Wiley & Sons (2003).

  • 28.

    NelsonW. Applied Life Data Analysis.New York, NY: John Wiley & Sons (1982).

Summary

Keywords

beta distribution, binomial distribution, exponential distribution, goodness-of-fit testing, maximum likelihood estimator, order statistics, quantiles, Type-I censoring

Citation

Pakyari R and Al-Hamed OM (2023) On testing exponentiality under Type-I censoring. Front. Appl. Math. Stat. 9:1113477. doi: 10.3389/fams.2023.1113477

Received

01 December 2022

Accepted

06 January 2023

Published

14 February 2023

Volume

9 - 2023

Edited by

Paul Horn, Cincinnati Children's Hospital Medical Center, United States

Reviewed by

Yiying Zhang, Southern University of Science and Technology, China; Fuxia Cheng, Illinois State University, United States

Updates

Copyright

*Correspondence: Reza Pakyari ✉

This article was submitted to Statistics and Probability, a section of the journal Frontiers in Applied Mathematics and Statistics

Disclaimer

All claims expressed in this article are solely those of the authors and do not necessarily represent those of their affiliated organizations, or those of the publisher, the editors and the reviewers. Any product that may be evaluated in this article or claim that may be made by its manufacturer is not guaranteed or endorsed by the publisher.

Outline

Figures

Cite article

Copy to clipboard


Export citation file


Share article

Article metrics