Publications Search

We present results from the Bayesian calibration of hydrological parameters of the Community Land Model (CLM), which is often used in climate simulations and Earth system models. A statistical inverse problem is formulated for three hydrological parameters, conditional on observations of latent heat surface fluxes over 48 months. Our calibration method uses polynomial and Gaussian process surrogates of the CLM, and solves the parameter estimation problem using a Markov chain Monte Carlo sampler. Posterior probability densities for the parameters are developed for two sites with different soil and vegetation covers. Our method also allows us to examine the structural error in CLM under two error models. We find that surrogate models can be created for CLM in most cases. The posterior distributions are more predictive than the default parameter values in CLM. Climatologically averaging the observations does not modify the parameters' distributions significantly. The structural error model reveals a correlation time-scale which can be used to identify the physical process that could be contributing to it. While the calibrated CLM has a higher predictive skill, the calibration is under-dispersive.

More Details

TYPE SAND Report YEAR 2014

DOI OSTI

Finite Difference Stencils Robust to Silent Data Corruption

Ray, Jaideep; Mayo, Jackson R.; Armstrong, Robert C.

Abstract not provided.

More Details

TYPE Conference YEAR 2014

OSTI

Compressed Sensing: Hope or Hype?

Anderson, Hyrum A.; Ray, Jaideep; Helms, Jovana

Abstract not provided.

More Details

TYPE Presentation YEAR 2014

OSTI

A multiresolution spatial parameterization for the estimation of fossil-fuel carbon dioxide emissions via atmospheric inversions

Atmospheric Chemistry and Physics

Ray, Jaideep; Van Bloemen Waanders, Bart

Abstract not provided.

More Details

TYPE Journal Article YEAR 2013

OSTI

Bayesian calibration of a ??e turbulence model for predictive jet-in-crossflow simulations

Ray, Jaideep; Lefantzi, Sophia; Arunajatesan, Srinivasan; Dechant, Lawrence

Abstract not provided.

More Details

TYPE Conference YEAR 2013

OSTI

Kalman-filtered compressive sensing for high resolution estimation of anthropogenic greenhouse gas emissions from sparse measurements

Ray, Jaideep; Lee, Jina; Lefantzi, Sophia; Van Bloemen Waanders, Bart

The estimation of fossil-fuel CO2 emissions (ffCO2) from limited ground-based and satellite measurements of CO2 concentrations will form a key component of the monitoring of treaties aimed at the abatement of greenhouse gas emissions. The limited nature of the measured data leads to a severely-underdetermined estimation problem. If the estimation is performed at fine spatial resolutions, it can also be computationally expensive. In order to enable such estimations, advances are needed in the spatial representation of ffCO2 emissions, scalable inversion algorithms and the identification of observables to measure. To that end, we investigate parsimonious spatial parameterizations of ffCO2 emissions which can be used in atmospheric inversions. We devise and test three random field models, based on wavelets, Gaussian kernels and covariance structures derived from easily-observed proxies of human activity. In doing so, we constructed a novel inversion algorithm, based on compressive sensing and sparse reconstruction, to perform the estimation. We also address scalable ensemble Kalman filters as an inversion mechanism and quantify the impact of Gaussian assumptions inherent in them. We find that the assumption does not impact the estimates of mean ffCO2 source strengths appreciably, but a comparison with Markov chain Monte Carlo estimates show significant differences in the variance of the source strengths. Finally, we study if the very different spatial natures of biogenic and ffCO2 emissions can be used to estimate them, in a disaggregated fashion, solely from CO2 concentration measurements, without extra information from products of incomplete combustion e.g., CO. We find that this is possible during the winter months, though the errors can be as large as 50%.

More Details

TYPE SAND Report YEAR 2013

DOI OSTI

Tuning a RANS k-e model for jet-in-crossflow simulations

Ray, Jaideep; Arunajatesan, Srinivasan; Dechant, Lawrence

We develop a novel calibration approach to address the problem of predictive ke RANS simulations of jet-incrossflow. Our approach is based on the hypothesis that predictive ke parameters can be obtained by estimating them from a strongly vortical flow, specifically, flow over a square cylinder. In this study, we estimate three ke parameters, C%CE%BC, Ce2 and Ce1 by fitting 2D RANS simulations to experimental data. We use polynomial surrogates of 2D RANS for this purpose. We conduct an ensemble of 2D RANS runs using samples of (C%CE%BC;Ce2;Ce1) and regress Reynolds stresses to the samples using a simple polynomial. We then use this surrogate of the 2D RANS model to infer a joint distribution for the ke parameters by solving a Bayesian inverse problem, conditioned on the experimental data. The calibrated (C%CE%BC;Ce2;Ce1) distribution is used to seed an ensemble of 3D jet-in-crossflow simulations. We compare the ensemble's predictions of the flowfield, at two planes, to PIV measurements and estimate the predictive skill of the calibrated 3D RANS model. We also compare it against 3D RANS predictions using the nominal (uncalibrated) values of (C%CE%BC;Ce2;Ce1), and find that calibration delivers a significant improvement to the predictive skill of the 3D RANS model. We repeat the calibration using surrogate models based on kriging and find that the calibration, based on these more accurate models, is not much better that those obtained with simple polynomial surrogates. We discuss the reasons for this rather surprising outcome.

More Details

TYPE SAND Report YEAR 2013

DOI OSTI

A multiresolution spatial parametrization for the estimation of fossil-fuel carbon dioxide emissions via atmospheric inversions

Ray, Jaideep; Lee, Jina; Lefantzi, Sophia; Van Bloemen Waanders, Bart

The estimation of fossil-fuel CO2 emissions (ffCO2) from limited ground-based and satellite measurements of CO2 concentrations will form a key component of the monitoring of treaties aimed at the abatement of greenhouse gas emissions. To that end, we construct a multiresolution spatial parametrization for fossil-fuel CO2 emissions (ffCO2), to be used in atmospheric inversions. Such a parametrization does not currently exist. The parametrization uses wavelets to accurately capture the multiscale, nonstationary nature of ffCO2 emissions and employs proxies of human habitation, e.g., images of lights at night and maps of built-up areas to reduce the dimensionality of the multiresolution parametrization. The parametrization is used in a synthetic data inversion to test its suitability for use in atmospheric inverse problem. This linear inverse problem is predicated on observations of ffCO2 concentrations collected at measurement towers. We adapt a convex optimization technique, commonly used in the reconstruction of compressively sensed images, to perform sparse reconstruction of the time-variant ffCO2 emission field. We also borrow concepts from compressive sensing to impose boundary conditions i.e., to limit ffCO2 emissions within an irregularly shaped region (the United States, in our case). We find that the optimization algorithm performs a data-driven sparsification of the spatial parametrization and retains only of those wavelets whose weights could be estimated from the observations. Further, our method for the imposition of boundary conditions leads to a 10computational saving over conventional means of doing so. We conclude with a discussion of the accuracy of the estimated emissions and the suitability of the spatial parametrization for use in inverse problems with a significant degree of regularization.

More Details

TYPE SAND Report YEAR 2013

DOI OSTI

Coherence metric for optimal compressive sensing

Lee, Jina; Lefantzi, Sophia; Ray, Jaideep

Abstract not provided.

More Details

TYPE Conference YEAR 2013

OSTI

Nowcasting influenza outbreaks using open-source media report

Ray, Jaideep

We construct and verify a statistical method to nowcast influenza activity from a time-series of the frequency of reports concerning influenza related topics. Such reports are published electronically by both public health organizations as well as newspapers/media sources, and thus can be harvested easily via web crawlers. Since media reports are timely, whereas reports from public health organization are delayed by at least two weeks, using timely, open-source data to compensate for the lag in %E2%80%9Cofficial%E2%80%9D reports can be useful. We use morbidity data from networks of sentinel physicians (both the Center of Disease Control's ILINet and France's Sentinelles network) as the gold standard of influenza-like illness (ILI) activity. The time-series of media reports is obtained from HealthMap (http://healthmap.org). We find that the time-series of media reports shows some correlation ( 0.5) with ILI activity; further, this can be leveraged into an autoregressive moving average model with exogenous inputs (ARMAX model) to nowcast ILI activity. We find that the ARMAX models have more predictive skill compared to autoregressive (AR) models fitted to ILI data i.e., it is possible to exploit the information content in the open-source data. We also find that when the open-source data are non-informative, the ARMAX models reproduce the performance of AR models. The statistical models are tested on data from the 2009 swine-flu outbreak as well as the mild 2011-2012 influenza season in the U.S.A.

More Details

TYPE SAND Report YEAR 2013

DOI OSTI

Bayesian calibration of the Community Land Model using surrogates

Ray, Jaideep; Sargsyan, Khachik

Abstract not provided.

More Details

TYPE Conference YEAR 2012

OSTI

Assimilating HealthMap Data to Nowcast Epidemics

Ray, Jaideep

Abstract not provided.

More Details

TYPE Presentation YEAR 2012

OSTI

An approach for estimating the uncertainty in ParaDiS predictions

Ray, Jaideep; Najm, Habib N.

Abstract not provided.

More Details

TYPE SAND Report YEAR 2012

DOI OSTI

A stopping criterion for Markov chains when generating independent random graphs

Ray, Jaideep; Comandur, Seshadhri

Abstract not provided.

More Details

TYPE Presentation YEAR 2012

OSTI

Solving inverse problems with quantified uncertainty

Ray, Jaideep; Van Bloemen Waanders, Bart; Mckenna, Sean A.

Abstract not provided.

More Details

TYPE Presentation YEAR 2012

OSTI

Estimation of Anthropogenic CO2 Emissions from Sparse Measurements

Ray, Jaideep; Van Bloemen Waanders, Bart; Mckenna, Sean A.

Abstract not provided.

More Details

TYPE Presentation YEAR 2012

OSTI

Decreasing the temporal complexity for

Proposed for publication in Computer methods in applied mechanics and engineering.

Van Bloemen Waanders, Bart; Ray, Jaideep

Abstract not provided.

More Details

TYPE Journal Article YEAR 2012

OSTI

Are we there yet? When to stop a Markov chain while generating random graphs

Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)

Ray, Jaideep; Pinar, Ali; Comandur, Seshadhri

Markov chains are convenient means of generating realizations of networks with a given (joint or otherwise) degree distribution, since they simply require a procedure for rewiring edges. The major challenge is to find the right number of steps to run such a chain, so that we generate truly independent samples. Theoretical bounds for mixing times of these Markov chains are too large to be practically useful. Practitioners have no useful guide for choosing the length, and tend to pick numbers fairly arbitrarily. We give a principled mathematical argument showing that it suffices for the length to be proportional to the number of desired number of edges. We also prescribe a method for choosing this proportionality constant. We run a series of experiments showing that the distributions of common graph properties converge in this time, providing empirical evidence for our claims. © 2012 Springer-Verlag.

More Details

TYPE Conference YEAR 2012

Scopus OSTI

Spatial and temporal data fusion for biosurveillance

Ray, Jaideep; Safta, Cosmin

Abstract not provided.

More Details

TYPE Conference YEAR 2012

OSTI

Structural Models Used In Real-time Biosurveillance Outbreak Detection and Outbreak Curve Isolation from Noisy Background Morbidity Levels

Proposed for publication in Journal of the American Medical Informatics Association.

Ray, Jaideep; Safta, Cosmin

Abstract not provided.

More Details

TYPE Journal Article YEAR 2012

OSTI

Bayesian estimation of multiscale structures in a binary medium from sparse observations

Ray, Jaideep; Lefantzi, Sophia; Mckenna, Sean A.; Van Bloemen Waanders, Bart

Abstract not provided.

More Details

TYPE Conference YEAR 2012

OSTI

Estimating a thinning ratio for a Markov chain of graphs

Ray, Jaideep; Pinar, Ali P.; Comandur, Seshadhri

Abstract not provided.

More Details

TYPE Conference YEAR 2012

OSTI

Estimation of multiscale fields representing anthropogenic CO2 emissions from sparse observations

Ray, Jaideep; Van Bloemen Waanders, Bart; Mckenna, Sean A.

Abstract not provided.

More Details

TYPE Conference YEAR 2012

OSTI

Generating independent graphs with prescribed joint degree distribution using a Markov chain sampler

Ray, Jaideep; Pinar, Ali P.; Comandur, Seshadhri

Abstract not provided.

More Details

TYPE Conference YEAR 2012

OSTI

BAYESIAN ESTIMATION OF MULTISCALE STRUCTURES IN A BINARY MEDIUM FROM SPARSE OBSERVATIONS

Ray, Jaideep; Lefantzi, Sophia; Mckenna, Sean A.; Van Bloemen Waanders, Bart

Abstract not provided.

More Details

TYPE Conference YEAR 2012

OSTI

Low dimensional models for the estimation of anthropogenic CO2 from atmospheric observations

Mckenna, Sean A.; Ray, Jaideep

Abstract not provided.

More Details

TYPE Presentation YEAR 2011

OSTI

Application of surrogate models to enable real-time characterization of epidemics

Ray, Jaideep; Safta, Cosmin; Lefantzi, Sophia

Abstract not provided.

More Details

TYPE Conference YEAR 2011

OSTI

Efficient uncertainty quantification methodologies for high-dimensional climate land models

Sargsyan, Khachik; Safta, Cosmin; Berry, Robert D.; Ray, Jaideep; Debusschere, Bert; Najm, Habib N.

In this report, we proposed, examined and implemented approaches for performing efficient uncertainty quantification (UQ) in climate land models. Specifically, we applied Bayesian compressive sensing framework to a polynomial chaos spectral expansions, enhanced it with an iterative algorithm of basis reduction, and investigated the results on test models as well as on the community land model (CLM). Furthermore, we discussed construction of efficient quadrature rules for forward propagation of uncertainties from high-dimensional, constrained input space to output quantities of interest. The work lays grounds for efficient forward UQ for high-dimensional, strongly non-linear and computationally costly climate models. Moreover, to investigate parameter inference approaches, we have applied two variants of the Markov chain Monte Carlo (MCMC) method to a soil moisture dynamics submodel of the CLM. The evaluation of these algorithms gave us a good foundation for further building out the Bayesian calibration framework towards the goal of robust component-wise calibration.

More Details

TYPE SAND Report YEAR 2011

DOI OSTI

A comparison of single and multi-chain methods for estimating parameters of the Community Land Model

Ray, Jaideep

Abstract not provided.

More Details

TYPE Conference YEAR 2011

OSTI

Bayesian inference of multiscale structures in porous media

Lefantzi, Sophia; Mckenna, Sean A.; Ray, Jaideep; Van Bloemen Waanders, Bart

Abstract not provided.

More Details

TYPE Conference YEAR 2011

OSTI

Bayesian data assimilation for stochastic multiscale models of transport in porous media

Lefantzi, Sophia; Klise, Katherine A.; Salazar, Luke; Mckenna, Sean A.; Van Bloemen Waanders, Bart; Ray, Jaideep

We investigate Bayesian techniques that can be used to reconstruct field variables from partial observations. In particular, we target fields that exhibit spatial structures with a large spectrum of lengthscales. Contemporary methods typically describe the field on a grid and estimate structures which can be resolved by it. In contrast, we address the reconstruction of grid-resolved structures as well as estimation of statistical summaries of subgrid structures, which are smaller than the grid resolution. We perform this in two different ways (a) via a physical (phenomenological), parameterized subgrid model that summarizes the impact of the unresolved scales at the coarse level and (b) via multiscale finite elements, where specially designed prolongation and restriction operators establish the interscale link between the same problem defined on a coarse and fine mesh. The estimation problem is posed as a Bayesian inverse problem. Dimensionality reduction is performed by projecting the field to be inferred on a suitable orthogonal basis set, viz. the Karhunen-Loeve expansion of a multiGaussian. We first demonstrate our techniques on the reconstruction of a binary medium consisting of a matrix with embedded inclusions, which are too small to be grid-resolved. The reconstruction is performed using an adaptive Markov chain Monte Carlo method. We find that the posterior distributions of the inferred parameters are approximately Gaussian. We exploit this finding to reconstruct a permeability field with long, but narrow embedded fractures (which are too fine to be grid-resolved) using scalable ensemble Kalman filters; this also allows us to address larger grids. Ensemble Kalman filtering is then used to estimate the values of hydraulic conductivity and specific yield in a model of the High Plains Aquifer in Kansas. Strong conditioning of the spatial structure of the parameters and the non-linear aspects of the water table aquifer create difficulty for the ensemble Kalman filter. We conclude with a demonstration of the use of multiscale stochastic finite elements to reconstruct permeability fields. This method, though computationally intensive, is general and can be used for multiscale inference in cases where a subgrid model cannot be constructed.

More Details

TYPE SAND Report YEAR 2011

DOI OSTI

Real-time Characterization of Partially Observed Epidemics using Surrogate Models

Mathematical Biosciences

Safta, Cosmin; Ray, Jaideep; Sargsyan, Khachik; Lefantzi, Sophia

Abstract not provided.

More Details

TYPE Journal Article YEAR 2011

OSTI

Deriving a model for influenza epidemics from historical data

Ray, Jaideep

In this report we describe how we create a model for influenza epidemics from historical data collected from both civilian and military societies. We derive the model when the population of the society is unknown but the size of the epidemic is known. Our interest lies in estimating a time-dependent infection rate to within a multiplicative constant. The model form fitted is chosen for its similarity to published models for HIV and plague, enabling application of Bayesian techniques to discriminate among infectious agents during an emerging epidemic. We have developed models for the progression of influenza in human populations. The model is framed as a integral, and predicts the number of people who exhibit symptoms and seek care over a given time-period. The start and end of the time period form the limits of integration. The disease progression model, in turn, contains parameterized models for the incubation period and a time-dependent infection rate. The incubation period model is obtained from literature, and the parameters of the infection rate are fitted from historical data including both military and civilian populations. The calibrated infection rate models display a marked difference in which the 1918 Spanish Influenza pandemic differed from the influenza seasons in the US between 2001-2008 and the progression of H1N1 in Catalunya, Spain. The data for the 1918 pandemic was obtained from military populations, while the rest are country-wide or province-wide data from the twenty-first century. We see that the initial growth of infection in all cases were about the same; however, military populations were able to control the epidemic much faster i.e., the decay of the infection-rate curve is much higher. It is not clear whether this was because of the much higher level of organization present in a military society or the seriousness with which the 1918 pandemic was addressed. Each outbreak to which the influenza model was fitted yields a separate set of parameter values. We suggest 'consensus' parameter values for military and civilian populations in the form of normal distributions so that they may be further used in other applications. Representing the parameter values as distributions, instead of point values, allows us to capture the uncertainty and scatter in the parameters. Quantifying the uncertainty allows us to use these models further in inverse problems, predictions under uncertainty and various other studies involving risk.

More Details

TYPE SAND Report YEAR 2011

DOI OSTI

Estimation of finescale conductivity fields from multiscale observations

Ray, Jaideep; Van Bloemen Waanders, Bart; Lefantzi, Sophia

Abstract not provided.

More Details

TYPE Conference YEAR 2011

OSTI

Real-time characterization of partially observed epidemics using surrogate models

Safta, Cosmin; Ray, Jaideep; Sargsyan, Khachik; Lefantzi, Sophia

We present a statistical method, predicated on the use of surrogate models, for the 'real-time' characterization of partially observed epidemics. Observations consist of counts of symptomatic patients, diagnosed with the disease, that may be available in the early epoch of an ongoing outbreak. Characterization, in this context, refers to estimation of epidemiological parameters that can be used to provide short-term forecasts of the ongoing epidemic, as well as to provide gross information on the dynamics of the etiologic agent in the affected population e.g., the time-dependent infection rate. The characterization problem is formulated as a Bayesian inverse problem, and epidemiological parameters are estimated as distributions using a Markov chain Monte Carlo (MCMC) method, thus quantifying the uncertainty in the estimates. In some cases, the inverse problem can be computationally expensive, primarily due to the epidemic simulator used inside the inversion algorithm. We present a method, based on replacing the epidemiological model with computationally inexpensive surrogates, that can reduce the computational time to minutes, without a significant loss of accuracy. The surrogates are created by projecting the output of an epidemiological model on a set of polynomial chaos bases; thereafter, computations involving the surrogate model reduce to evaluations of a polynomial. We find that the epidemic characterizations obtained with the surrogate models is very close to that obtained with the original model. We also find that the number of projections required to construct a surrogate model is O(10)-O(10{sup 2}) less than the number of samples required by the MCMC to construct a stationary posterior distribution; thus, depending upon the epidemiological models in question, it may be possible to omit the offline creation and caching of surrogate models, prior to their use in an inverse problem. The technique is demonstrated on synthetic data as well as observations from the 1918 influenza pandemic collected at Camp Custer, Michigan.

More Details

TYPE SAND Report YEAR 2011

DOI OSTI