The bootstrap method for Monte Carlo integration inference

Asamh Saleh M. Al Luhayb

doi:10.1016/j.jksus.2023.102768

View/Download PDF

Buy Reprints

PDF

Translate this page into:

Original article

08 2023

:35;

102768

doi:

10.1016/j.jksus.2023.102768

The bootstrap method for Monte Carlo integration inference

Asamh Saleh M. Al Luhayb

Department of Mathematics, College of Science, Qassim University, P.O. Box 6644, Buraydah 51452, Saudi Arabia

Received: 2022-5-9, Accepted: 2023-6-13,

Disclaimer:
This article was originally published by Elsevier and was migrated to Scientific Scholar after the change of Publisher.

Peer review under responsibility of King Saud University.

Abstract

In this paper, the use of bootstrap method with Monte Carlo integration is introduced for one dimension. This approach is based on generating observations from a known distribution for the bootstrap samples, then apply the Monte Carlo method on each bootstrap sample to estimate the integral of interest. The empirical distribution, or the bootstrap distribution, of the estimation results can be used as a good proxy for the distribution of the integral of interest. Based on the bootstrap distribution, the standard error of the estimate of the integral of interest can be derived. Also, the percentile and Normal confidence intervals with confidence level $(1 - α) %$ can be derived as well. The bootstrap method with Monte Carlo integration is easy to implement and straightforward to provide well results. Moreover, it provides small variance for the estimate of the integral of interest. Four examples with different functions and different domains are used to present the performance of the proposed method. From the study, we find that the method provides nearly identical results for the standard errors, regardless of the distributions used for generating observations for the bootstrap samples.

Keywords

Parametric bootstrap method

Monte Carlo integration

Percentile confidence interval

Normal confidence interval

Standard error

Simulation

Statistical model

Show Related Articles from PubMed

1 Introduction

In the real world applications, the standard or classical mathematical methods have been widely used to compute the exact integral of a function g defined on an interval $[a, b]$ . This can be achieved if the situation is simple; however, if the situation is complicated, the computation is quite hard or impossible in some cases. Therefore, the approximation methods can be good choices to compute the integrals for the complicated functions with minor errors. One of these approximation methods is the Monte Carlo integration method, which is described in many references, see e.g. (Kalos and Whitlock, 2009; Rizzo, 2019; Yang, 2014 ) for more details.

This approximation method for integrals is built with respect to the sampling distribution. It is crucial to choose a suitable distribution, which should be close to the function g, to generate random observations. The suitable distribution can lead to have a better approximation for the integral with small variance. Many references in the literature discussed the importance sampling or variance reduction, see e.g. (Hammersley and Morton, 1956; Oh and Berger, 1992; Rizzo, 2019; Rubinstein and Marcus, 1985; Tokdar and Kass, 2010; Van Dijk and Kloek, 1983 ) for detailed presentations.

Surely, the importance sampling is crucial for the Monte Carlo integration to derive a well approximation with small variance, but the importance sampling requires a long computational time and some information about the shape of g. These requirements motivate to use the bootstrap method with the Monte Carlo integration. Based on the bootstrap distribution, it is possible to derive well approximations for integrals with small standard errors. Moreover, the percentile and Normal confidence intervals of any integral can be derived with high accuracy.

This paper is organized as follows: Section 2 presents the Monte Carlo integration for one dimension along with some descriptions. In Section 3, the bootstrap method is described with explanations of computing the standard error and deriving the percentile and Normal confidence intervals for the integral of interest $θ$ . Section 4 presents the performance of bootstrap with Monte Carlo integration. The last section presents some concluding remarks.

2 Monte Carlo integration

The Monte Carlo method is a well-known concept used to derive the approximate integrals for complicated functions defined on certain domains. This method is built based on the probability theorem (Chung and Zhong, 2001; Jaynes, 2003; Loeve, 2017 ), where the integral of a function g can be computed by taking the expectation of function $g, E {g}$ . For univariate data, suppose that X is a random variable following a probability density function f. The expectation of $g (X)$ can be written mathematically as follows

(1)

E {g (X)} = \int_{D_{f}} g (x) f (x) dx

where

D_{f}

is the support of probability density function f.

Let $θ = \int_{a}^{b} g (x) dx$ and the random variables $X_{1}, X_{2}, \dots, X_{n}$ are independent and identically distributed from the Uniform distribution with parameters a and b. Further suppose that $x_{1}, x_{2}, \dots, x_{n}$ are the corresponding observations to the random variables $X_{1}, X_{2}, \dots, X_{n}$ . The estimated result of $θ$ can be found by

(2)

\hat{θ} = \frac{b - a}{n} \sum_{i = 1}^{n} g (x_{i})

It should be noted that the estimated value

\hat{θ}

converges to

E {g (X)} = \frac{θ}{b - a}

n \to \infty

with probability 1 based on the strong law of large numbers theorem, which is presented in Etemadi (1981); Fazekas and Klesov (2001). To make the Monte Carlo integration easy to follow and apply, we list its algorithm in three steps as follows:

Generate n observations from the Uniform distribution with parameters a and b.
Find $g (x_{i})$ for all $i = 1, 2, \dots, n$ .
Compute $\hat{θ} = \frac{b - a}{n} \sum_{i = 1}^{n} g (x_{i})$ .

The Uniform distribution can be replaced by any known distribution to generate the observations in the first step, but we choose the Uniform distribution here for simplicity in application.

3 The bootstrap method

The bootstrap method is a resampling concept proposed to measure the accuracy of a statistical estimate and to make inferences about unknown population parameters, e.g. mean, median, variance and confidence interval. In the literature, it has been widely used due to its simplicity to apply and efficiency to provide well estimates. With high accuracy, the bootstrap distribution for any statistic of interest can mimic the sampling distribution, where the sampling distribution is not always easy to obtain in real applications.

To implement the bootstrap method, parametric and nonparametric models are used to create multiple bootstrap samples, then the statistic of interest is computed based on each bootstrap sample. The empirical distribution of the results can be used as a proxy distribution for the sampling distribution and this allows making inferences about the statistic of interest. For more detailed presentation, it is beneficial to see the book of “An Introduction to The Bootstrap” by Efron and Tibshirani (1993) and the book of “Bootstrap Methods and Their Application” by Davison and Hinkley (1997).

To use the bootstrap method with Monte Carlo integration, we first need to introduce some notations. Suppose that the random quantities $X_{1}, X_{2}, \dots, X_{n}$ are independent and identically distributed following the probability distribution f and supported on $[a, b]$ . Let $x_{1}, x_{2}, \dots, x_{n}$ be the observations corresponding to these random quantities. Furthermore, let the integral of interest be $θ = \int_{a}^{b} g (x) dx$ . Now, it is easy to present the algorithm of the bootstrap method with Monte Carlo integration through the following steps.

Generate n observations from the probability distribution f.
Find $g (x_{i})$ for all $i = 1, 2, \dots, n$ .
Compute ${\hat{θ}}^{*} = \frac{b - a}{n} \sum_{i = 1}^{n} g (x_{i})$ .
Perform steps (1), (2) and (3) B times; this leads to ${\hat{θ}}^{* 1}, {\hat{θ}}^{* 2}, \dots, {\hat{θ}}^{* B}$ .

For better performance, the value of B is suggested to be large; e.g. $B = 1000$ (Al Luhayb, 2021; Al Luhayb et al., 2023; Efron, 1979, 1981 ). To derive a bootstrap estimate of $θ$ , we compute the average of ${\hat{θ}}^{* 1}, {\hat{θ}}^{* 2}, \dots, {\hat{θ}}^{* B}$ , and to provide a bootstrap standard error estimate for $\overline{{\hat{θ}}_{boot}^{*}} = \frac{1}{B} \sum_{j = 1}^{B} {\hat{θ}}^{* j}$ , we compute the standard deviation of ${\hat{θ}}^{* 1}, {\hat{θ}}^{* 2}, \dots, {\hat{θ}}^{* B}$ by Efron and Tibshirani (1986)

(3)

SE (\overline{{\hat{θ}}_{boot}^{*}}) = \sqrt{\frac{\sum_{j = 1}^{B} {({\hat{θ}}^{* j})}^{2} - \frac{{(\sum_{j = 1}^{B} {\hat{θ}}^{* j})}^{2}}{B}}{B - 1}}

For the $(1 - α) %$ percentile confidence interval, we order the values ${\hat{θ}}^{* 1}, {\hat{θ}}^{* 2}, \dots, {\hat{θ}}^{* B}$ from least to largest, then take the $(\frac{α}{2})$ th and $(1 - \frac{α}{2})$ th ordered values, where the former is the lower bound and the latest is the upper bound. This can be written as follows

(4)

θ \in ({\hat{θ}}_{(\frac{α}{2})}^{*}, {\hat{θ}}_{(1 - \frac{α}{2})}^{*})

For the $(1 - α) %$ Normal confidence interval, it is needed to compute $\overline{{\hat{θ}}_{boot}^{*}}$ and $SE (\overline{{\hat{θ}}_{boot}^{*}})$ , then we use the following equation (Hazra, 2017)

(5)

θ \in (\overline{{\hat{θ}}_{boot}^{*}} - Z_{(1 - α)} SE (\overline{{\hat{θ}}_{boot}^{*}}), \overline{{\hat{θ}}_{boot}^{*}} + Z_{(1 - α)} SE (\overline{{\hat{θ}}_{boot}^{*}}))

where

Z_{(1 - α)}

is the

(1 - α)

percentile of the standard Normal distribution.

4 Simulation studies

In this section, we present different bounded integrals needed to be computed analytically based on the bootstrap method with Monte Carlo integration. We choose different domains and functions as shown in Fig. 1. Also, we determine the Uniform and Normal distributions, restricted to the domain of the integral of interest, to generate observations for the bootstrap samples. To compute the bootstrap estimate of integral $θ$ with the standard error and the percentile and Normal confidence intervals, we set B equal to 1000 and the sample size of each bootstrap sample n equal to 1000. This means that the iteration of bootstrap is 1000 as $B = 1000$ . Table 1 presents different bounded integrals with their analytical results, and these integrals will be estimated based on the bootstrap method with Monte Carlo integration to make comparisons with the true results. By this strategy, we can make investigations on the performance of our method.

The shape of each function g ( x ) presented in Table 1.

Table 1 The exact values of different integrals.

Example	$θ$ =
1	$\int_{- 2}^{2} x^{2} dx = 5.333$
2	$\int_{0}^{\frac{π}{2}} - \sin (x) dx = - 1$
3	$\int_{0}^{5} 3 x^{5} + 2 x^{3} + 1 dx = 8130$
4	$\int_{0}^{\frac{π}{2}} x^{2} - \sin (x) dx = 0.292$

Tables 2–5 present the bootstrap estimates $\hat{θ}$ along with the standard errors of $\hat{θ}$ and the $90 %$ percentile and Normal confidence intervals of $θ$ for all examples presented in Table 1. It is obvious that the bootstrap estimates are nearly identical to the exact results. This is the power of using the bootstrap method with Monte Carlo integration, which is more needed for complicated integrals that is impossible to be computed theoretically. Through the bootstrap procedure, we use Uniform and Normal distributions with different parameters, but restricted to the integral’s bounds, and this leads to have nearly identical standard errors. From this observation, we can assure that the bootstrap distribution can be a good proxy distribution for the integral $θ$ ; there is no need to estimate the shape of $g (x)$ for importance sampling, or variance reduction. This helps to conserve time to running codes and it is possible to do analysis with less information. Also, based on the bootstrap method, the $90 %$ percentile and Normal confidence intervals can be easily derived. From Tables 2–5 , we note that the $90 %$ percentile confidence intervals are all wider than the $90 %$ Normal confidence intervals.

Table 2 The bootstrap estimate of

θ

in Example 1 along with the standard error and the percentile and Normal confidence intervals with confidence level

90 %

Bootstrap estimates	Uniform(-2, 2)	Normal(0, 5) (restricted on [-2, 2])
$\hat{θ}$	5.339	5.228
Standard Error	0.158	0.151
Percentile confidence interval	(5.093, 5.596)	(4.982, 5.463)
Normal confidence interval	(5.136, 5.541)	(5.036, 5.421)

Table 3 The bootstrap estimate of

θ

in Example 2 along with the standard error and the percentile and Normal confidence intervals with confidence level

90 %

Bootstrap estimates	Uniform(0, $\frac{π}{2}$ )	Normal(0,5) (restricted on [0, $\frac{π}{2}$ ])
$\hat{θ}$	−1.000	−0.993
Standard Error	0.015	0.015
Percentile confidence interval	(−1.026, −0.976)	(−1.020, −0.968)
Normal confidence interval	(−1.020, −0.981)	(−1.013, −0.973)

Table 4 The bootstrap estimate of

θ

in Example 3 along with the standard error and the percentile and Normal confidence intervals with confidence level

90 %

Bootstrap estimates	Uniform(0, 5)	Normal(3,5) (restricted on [0, 5])
$\hat{θ}$	8146.495	8155.398
Standard Error	385.488	387.729
Percentile confidence interval	(7514.391, 8783.074)	(7550.591, 8808.027)
Normal confidence interval	(7652.471, 8640.518)	(7658.503, 8652.292)

Table 5 The bootstrap estimate of

θ

in Example 4 along with the standard error and the percentile and Normal confidence intervals with confidence level

90 %

Bootstrap estimates	Uniform(0, $\frac{π}{2}$ )	Normal(3,5) (restricted on [0, $\frac{π}{2}$ ])
$\hat{θ}$	0.293	0.317
Standard Error	0.024	0.024
Percentile confidence interval	(0.254, 0.332)	(0.280, 0.355)
Normal confidence interval	(0.262, 0.323)	(0.286, 0.347)

5 Concluding remarks

In this paper, we illustrated the bootstrap method with Monte Carlo integration for one dimension, and the method was used through multiple examples. From the examples, it can be concluded that the bootstrap method with Monte Carlo integration is a good approach to compute well approximate integrals for different functions in different domains with small variances, regardless to the distribution being used to generate observations for the bootstrap samples. Based on the bootstrap estimates, the $(1 - α) %$ percentile and Normal confidence intervals for $θ$ can be derived with high accuracy. Our method can be beneficial for integrals that may be difficult or impossible to compute. Also, the method is easy to implement and straightforward, which is only relying on sampling from a known distribution, then taking the evaluation. This is repeated multiple times and this should be large, e.g. $B = 1000$ . To put the method into a practical use, we included the R codes in the appendix. To run the codes, it requires about ten seconds, which is nothing in real applications and this is one of the advantages.

As a future research, the method will be generalized for multiple dimensions with more complicated functions and domains. To achieve this generalization, we may use the copula concept, which is able to take the dependence structure between the variables into account, to generate observations for the bootstrap samples. For more detailed presentations about the copula concept, it is advised to see Coolen-Maturi et al. (2016); Muhammad (2016); Muhammad et al. (2016); Muhammad et al. (2018); Sklar (1959).

Acknowledgement

The researcher would like to thank the Deanship of Scientific Research, Qassim University for funding the publication of this project. Further thanks go to the reviewers for the valuable comments and suggestions.

Declaration of Competing Interest

The authors declare that they have no known competing financial interests or personal relationships that could have appeared to influence the work reported in this paper.

References

Al Luhayb A.S.M., . Smoothed bootstrap methods for right-censored data and bivariate data. Durham University; 2021. PhD thesis
Al Luhayb A.S.M., Coolen F.P.A., Coolen-Maturi T., . Smoothed bootstrap for right-censored data. Communications in Statistics - Theory and Methods. 2023;0(0):1-25.
[CrossRef] [Google Scholar]
Chung K.L., Zhong K., . A Course in Probability Theory. Academic Press; 2001.
Coolen-Maturi T., Coolen F.P.A., Muhammad N., . Predictive inference for bivariate data: Combining nonparametric predictive inference for marginals with an estimated copula. J. Stat. Theory Practice. 2016;10:515-538.
[Google Scholar]
Davison A.C., Hinkley D.V., . Bootstrap Methods and Their Application. New York, NY: Cambridge University Press; 1997.
Efron B., . Bootstrap methods: Another look at the jackknife. Ann. Stat.. 1979;7:1-26.
[Google Scholar]
Efron B., . Censored data and the bootstrap. J. Am. Stat. Assoc.. 1981;76:312-319.
[Google Scholar]
Efron B., Tibshirani R., . Bootstrap methods for standard errors, confidence intervals, and other measures of statistical accuracy. Stat. Sci.. 1986;1:54-77.
[Google Scholar]
Efron B., Tibshirani R.J., . An Introduction to The Bootstrap. Boca Raton, FL: Chapman & Hall; 1993.
Etemadi N., . An elementary proof of the strong law of large numbers. Zeitschrift für Wahrscheinlichkeitstheorie und verwandte Gebiete. 1981;55(1):119-122.
[Google Scholar]
Fazekas I., Klesov O., . A general approach to the strong law of large numbers. Theory Probab. Appl.. 2001;45(3):436-449.
[Google Scholar]
Hammersley J., Morton K., . A new monte carlo technique: antithetic variates. In: Mathematical Proceedings of the Cambridge Philosophical Society. Cambridge University Press; 1956. p. :449-475.
[Google Scholar]
Hazra A., . Using the confidence interval confidently. J. Thoracic Dis.. 2017;9(10):4125.
[Google Scholar]
Jaynes E.T., . Probability Theory: The Logic of Science. Cambridge University Press; 2003.
Kalos M.H., Whitlock P.A., . Monte Carlo Methods. John Wiley & Sons; 2009.
Loeve M., . Probability Theory. Courier Dover Publications; 2017.
Muhammad N., . Predictive inference with copulas for bivariate data. UK: Durham University; 2016. PhD thesis
Muhammad, N., Coolen, F.P.A., Coolen-Maturi, T., 2016. Predictive inference for bivariate data with nonparametric copula. In: The American Institute of Physics (AIP) Conference Proceedings, vol. 1750, pp. 0600,041–0600,048. https://doi.org/10.1063/1.4954609, https://aip.scitation.org/doi/abs/10.1063/1.4954609, https://arxiv.org/abs/https://aip.scitation.org/doi/pdf/10.1063/1.4954609.
Muhammad N., Coolen-Maturi T., Coolen F.P.A., . Nonparametric predictive inference with parametric copulas for combining bivariate diagnostic tests. Stati. Optim. Informat. Comput.. 2018;6:398-408.
[Google Scholar]
Oh M.S., Berger J.O., . Adaptive importance sampling in monte carlo integration. J. Stat. Comput. Simul.. 1992;41(3–4):143-168.
[Google Scholar]
Rizzo, M.L., 2019. Monte carlo integration and variance reduction. In: Statistical Computing with R. Chapman and Hall/CRC, pp. 147–182.
Rubinstein R.Y., Marcus R., . Efficiency of multivariate control variates in monte carlo simulation. Oper. Res.. 1985;33(3):661-677.
[Google Scholar]
Sklar A.W., . Fonctions de re’partitiona‘ n-dimensions et leurs marges. Publications de l’Institut de Statistique de l’Universite’ de Paris. 1959;8:229-231.
[Google Scholar]
Tokdar S.T., Kass R.E., . Importance sampling: a review. Wiley Interdiscip. Revi.: Comput. Stat.. 2010;2(1):54-60.
[Google Scholar]
Van Dijk, H.K., Kloek, T., 1983. Experiments with some alternatives for simple importance sampling in monte carlo integration. Tech. rep.
Yang X.S., . Introduction to Computational Mathematics. World Scientific Publishing Company; 2014.

Appendix A

Appendix B

Supplementary material

Supplementary data associated with this article can be found, in the online version, at https://doi.org/10.1016/j.jksus.2023.102768.

Appendix B

Supplementary material

The following are the Supplementary data to this article:

Supplementary data 1

Download

Introduction

Monte Carlo integration

The bootstrap method

Simulation studies

Concluding remarks

Fulltext Views
127

PDF downloads
83

View/Download PDF
Download Citations

BibTeX
RIS

Show Sections

[1] Al Luhayb A.S.M., . Smoothed bootstrap methods for right-censored data and bivariate data. Durham University; 2021. PhD thesis

[2] Al Luhayb A.S.M., Coolen F.P.A., Coolen-Maturi T., . Smoothed bootstrap for right-censored data. Communications in Statistics - Theory and Methods. 2023;0(0):1-25.
[CrossRef] [Google Scholar]

[3] Chung K.L., Zhong K., . A Course in Probability Theory. Academic Press; 2001.

[4] Coolen-Maturi T., Coolen F.P.A., Muhammad N., . Predictive inference for bivariate data: Combining nonparametric predictive inference for marginals with an estimated copula. J. Stat. Theory Practice. 2016;10:515-538.
[Google Scholar]

[5] Davison A.C., Hinkley D.V., . Bootstrap Methods and Their Application. New York, NY: Cambridge University Press; 1997.

[6] Efron B., . Bootstrap methods: Another look at the jackknife. Ann. Stat.. 1979;7:1-26.
[Google Scholar]

[7] Efron B., . Censored data and the bootstrap. J. Am. Stat. Assoc.. 1981;76:312-319.
[Google Scholar]

[8] Efron B., Tibshirani R., . Bootstrap methods for standard errors, confidence intervals, and other measures of statistical accuracy. Stat. Sci.. 1986;1:54-77.
[Google Scholar]

[9] Efron B., Tibshirani R.J., . An Introduction to The Bootstrap. Boca Raton, FL: Chapman & Hall; 1993.

[10] Etemadi N., . An elementary proof of the strong law of large numbers. Zeitschrift für Wahrscheinlichkeitstheorie und verwandte Gebiete. 1981;55(1):119-122.
[Google Scholar]

[11] Fazekas I., Klesov O., . A general approach to the strong law of large numbers. Theory Probab. Appl.. 2001;45(3):436-449.
[Google Scholar]

[12] Hammersley J., Morton K., . A new monte carlo technique: antithetic variates. In: Mathematical Proceedings of the Cambridge Philosophical Society. Cambridge University Press; 1956. p. :449-475.
[Google Scholar]

[13] Hazra A., . Using the confidence interval confidently. J. Thoracic Dis.. 2017;9(10):4125.
[Google Scholar]

[14] Jaynes E.T., . Probability Theory: The Logic of Science. Cambridge University Press; 2003.

[15] Kalos M.H., Whitlock P.A., . Monte Carlo Methods. John Wiley & Sons; 2009.

[16] Loeve M., . Probability Theory. Courier Dover Publications; 2017.

[17] Muhammad N., . Predictive inference with copulas for bivariate data. UK: Durham University; 2016. PhD thesis

[18] Muhammad, N., Coolen, F.P.A., Coolen-Maturi, T., 2016. Predictive inference for bivariate data with nonparametric copula. In: The American Institute of Physics (AIP) Conference Proceedings, vol. 1750, pp. 0600,041–0600,048. https://doi.org/10.1063/1.4954609, https://aip.scitation.org/doi/abs/10.1063/1.4954609, https://arxiv.org/abs/https://aip.scitation.org/doi/pdf/10.1063/1.4954609.

[19] Muhammad N., Coolen-Maturi T., Coolen F.P.A., . Nonparametric predictive inference with parametric copulas for combining bivariate diagnostic tests. Stati. Optim. Informat. Comput.. 2018;6:398-408.
[Google Scholar]

[20] Oh M.S., Berger J.O., . Adaptive importance sampling in monte carlo integration. J. Stat. Comput. Simul.. 1992;41(3–4):143-168.
[Google Scholar]

[21] Rizzo, M.L., 2019. Monte carlo integration and variance reduction. In: Statistical Computing with R. Chapman and Hall/CRC, pp. 147–182.

[22] Rubinstein R.Y., Marcus R., . Efficiency of multivariate control variates in monte carlo simulation. Oper. Res.. 1985;33(3):661-677.
[Google Scholar]

[23] Sklar A.W., . Fonctions de re’partitiona‘ n-dimensions et leurs marges. Publications de l’Institut de Statistique de l’Universite’ de Paris. 1959;8:229-231.
[Google Scholar]

[24] Tokdar S.T., Kass R.E., . Importance sampling: a review. Wiley Interdiscip. Revi.: Comput. Stat.. 2010;2(1):54-60.
[Google Scholar]

[25] Van Dijk, H.K., Kloek, T., 1983. Experiments with some alternatives for simple importance sampling in monte carlo integration. Tech. rep.

[26] Yang X.S., . Introduction to Computational Mathematics. World Scientific Publishing Company; 2014.

The bootstrap method for Monte Carlo integration inference

Abstract

Keywords

Parametric bootstrap method

Monte Carlo integration

Percentile confidence interval

Normal confidence interval

Standard error

Simulation

Statistical model

1 Introduction

2 Monte Carlo integration

3 The bootstrap method

4 Simulation studies

5 Concluding remarks

Acknowledgement

Declaration of Competing Interest

References

Appendix A

Appendix B

Supplementary material

Appendix B

Supplementary material

Suggested read for related articles: