Importance sampling for a robust and efficient multilevel Monte Carlo estimator for stochastic reaction networks
Overview
Bibliography:
Chiheb Ben Hammouda, Nadhir Ben Rached, and Raul Tempone. "Importance sampling for a robust and efficient multilevel Monte Carlo estimator for stochastic reaction networks." Statistics and Computing (2020): 1-25.
Authors:
Chiheb Ben Hammouda, Nadhir Ben Rached, Raul Tempone
Keywords:
Multilevel Monte Carlo. Continuous-time Markov chains. Stochastic reaction networks. Stochastic biological systems. Importance sampling
Year:
2020
Abstract:
The multilevel Monte Carlo (MLMC) method for continuous-time Markov chains, first introduced by Anderson and Higham (SIAM Multiscal Model. Simul. 10(1), 2012), is a highly efficient simulation technique that can be used to estimate various statistical quantities for stochastic reaction networks (SRNs), in particular for stochastic biological systems. Unfortunately, the robustness and performance of the multilevel method can be affected by the high kurtosis, a phenomenon observed at the deep levels of MLMC, which leads to inaccurate estimates of the sample variance. In this work, we address cases where the high-kurtosis phenomenon is due to catastrophic coupling (characteristic of pure jump processes where coupled consecutive paths are identical in most of the simulations, while differences only appear in a tiny proportion) and introduce a pathwise-dependent importance sampling (IS) technique that improves the robustness and efficiency of the multilevel method. Our theoretical results, along with the conducted numerical experiments, demonstrate that our proposed method significantly reduces the kurtosis of the deep levels of MLMC, and also improves the strong convergence rate from $\beta=1$ for the standard case (without IS), to $\beta=1+\delta$, where $0<\delta<1$ is a user-selected parameter in our IS algorithm. Due to the complexity theorem of MLMC, and given a pre-selected tolerance, $\text{TOL}$, this results in an improvement of the complexity from $\Ordo{\text{TOL}^{-2} \log(\text{TOL})^2}$ in the standard case to $\Ordo{\text{TOL}^{-2}}$, which is the optimal complexity of the MLMC estimator. We achieve all these improvements with a negligible additional cost since our IS algorithm is only applied a few times across each simulated path.