Another Look at the Shape Parameter of the Weibull Distribution
(Beta)
It is well known that the shape parameter of the Weibull distribution,
beta (β), represents the failure rate behavior. If beta is less than 1, then the failure
rate decreases with time; if beta is greater than 1, then the failure rate increases
with time. When beta is equal to 1, the failure rate is constant. Assume
that we have two failure data sets. From one data set, the estimated beta is 5; from
another data set, the estimated beta is 3. Both have increasing failure rates. If
the two data sets were combined together, would the estimated beta for the overall
data set be a value between 3 and 5, or would it be a value outside of this range? Could it
be less than 1, which would imply that the combined data set has a failure rate
that is decreasing with time? In the article, we will use examples to show the
relationship between these betas.
In order to answer the above questions, we will look at the issue from another
perspective. Assume there is the one set of life data that is fitted using a Weibull
distribution. This data set has a shape parameter β. If we separate the data
into two groups and fit each group with a separate Weibull distribution, we
get β1 and β2 as the shape parameters of the two
groups. If we suppose that β1 < β2,
will β1 < β < β2 always be true, or is
there some other relationship between these beta values?
Suppose that a testing engineer obtained the life data shown in Table 1. According to
previous experience, the 2-parameter Weibull distribution should be used to fit the data.
The analysis using Weibull++
shows that β = 0.9812 and η = 73.6036, as shown in Figure 1.
Table
1: Failure Data
State
(F or S) |
Time to State |
Item Index |
F |
2 |
1 |
S |
3 |
2 |
F |
5 |
3 |
S |
7 |
4 |
F |
11 |
5 |
S |
13 |
6 |
S |
17 |
7 |
S |
19 |
8 |
F |
23 |
9 |
F |
29 |
10 |
S |
31 |
11 |
F |
37 |
12 |
S |
41 |
13 |
F |
43 |
14 |
S |
47 |
15 |
S |
53 |
16 |
F |
59 |
17 |
S |
61 |
18 |
S |
69 |
19 |
F |
75 |
20 |

Figure 1: Data Analysis for
the Entire Data Set
To illustrate the subgroups parameters, we extract different subgroups
from the full data set. We could use different methods to accomplish this.
For the first method, we separate data from the middle point of the entire group. We
classify the early failure items as "Subgroup I" and the later failure
items as "Subgroup II." These two subgroups are listed in Table 2.
Table
2: Method I Subgroup Data
|
Subgroup I (Early) |
Subgroup II (Later) |
Item
Index |
State
(F or S) |
Time to
State |
State
(F or S) |
Time to
State |
1 |
F |
2 |
S |
31 |
2 |
S |
3 |
F |
37 |
3 |
F |
5 |
S |
41 |
4 |
S |
7 |
F |
43 |
5 |
F |
11 |
S |
47 |
6 |
S |
13 |
S |
53 |
7 |
S |
17 |
F |
59 |
8 |
S |
19 |
S |
61 |
9 |
F |
23 |
S |
69 |
10 |
F |
29 |
F |
75 |
The analysis using the 2-parameter Weibull distribution shows that Subgroup
I (Early) has β1 = 1.105 and η1 = 24.0872, while
Subgroup II (Later) has β2 =3.6717 and η2 = 70.8740.
A second method to extract different subgroups is to separate the data according to the failure
index. Entries with odd index numbers are put in Subgroup I and
entries with even index numbers are
assigned to Subgroup II. These two subgroups are listed in Table 3.
Table
3: Method II Subgroup Data
|
Subgroup I (Even) |
Subgroup II (Odd) |
Item
Index |
State
(F or S) |
Time to
State |
State
(F or S) |
Time to
State |
1 |
S |
3 |
F |
2 |
2 |
S |
7 |
F |
5 |
3 |
S |
13 |
F |
11 |
4 |
S |
19 |
S |
17 |
5 |
F |
29 |
F |
23 |
6 |
F |
37 |
S |
31 |
7 |
F |
43 |
S |
41 |
8 |
S |
53 |
S |
47 |
9 |
S |
61 |
F |
59 |
10 |
F |
75 |
S |
69 |
The analysis using the 2-parameter Weibull distribution shows that Subgroup
I (Odd) has β1' = 0.7419 and η1' = 62.4596, while
Subgroup II (Even) has β2' =2.4834
and η2' = 61.4978.
As we can see, for the first method, β < β1
< β2, and for the second method, β1' <
β < β2'. So it depends on how the subgroup was extracted
from the original data. From the above examples we also can see that there is
no fixed relationship between these betas.
Now let’s examine the definition of the standard deviation of the Weibull
distribution:

where
is the gamma
function evaluated at the value of
.
Table 4 lists the standard deviation values for the overall group and for each
subgroup.
Table
4: Distribution Parameters for Different Subgroups
|
Whole
Group |
Subgroup
Early |
Subgroup
Later |
Subgroup
Odd |
Subgroup
Even |
β |
0.981 |
1.105 |
3.6717 |
0.7419 |
2.4834 |
η |
73.6036 |
24.0872 |
70.8740 |
62.4596 |
61.4978 |
1/β |
1.0941 |
0.9050 |
0.2724 |
1.3479 |
0.4027 |
σT (standard
deviation) |
75.63642 |
21.0307 |
19.3781 |
102.75802 |
23.4846 |
We can see that the relationship between the 1/β of the whole group and
each of the subgroups are similar to the relationships of their standard deviations
σT. In other words, 1/β roughly
reflects the standard deviation of the distribution. For example, in our
first classification method I, as 1/β for the whole group is greater than either
subgroup; so is its σT is greater
than either subgroup.
It will be even clearer if we use the logarithmic transformation of the raw data to fit
a Gumbel distribution. As we know, the logarithm transform of Weibull data follows
the Gumbel distribution with μ = ln(η) and
, where μ is the
location parameter and σ is the scale parameter of the Gumbel distribution. The
standard deviation for the Gumbel distribution is given by:

From the above equation, we can see how 1/β reflects the spread of a data
set.
Conclusions
In this article, we answered the question about the relationship between the beta
for the entire data set and the betas for subgroups of that data set. As we can
see, this relationship is not fixed, and can change. Moreover, 1/β can be
interpreted as something similar to the standard deviation of a data
set. So 1/β not only represents the behavior of the failure rate, it also
reflects the spread of a data set.
|