Publications Search

Are we there yet? When to stop a Markov chain while generating random graphs

Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)

Ray, Jaideep; Pinar, Ali; Comandur, Seshadhri

Markov chains are convenient means of generating realizations of networks with a given (joint or otherwise) degree distribution, since they simply require a procedure for rewiring edges. The major challenge is to find the right number of steps to run such a chain, so that we generate truly independent samples. Theoretical bounds for mixing times of these Markov chains are too large to be practically useful. Practitioners have no useful guide for choosing the length, and tend to pick numbers fairly arbitrarily. We give a principled mathematical argument showing that it suffices for the length to be proportional to the number of desired number of edges. We also prescribe a method for choosing this proportionality constant. We run a series of experiments showing that the distributions of common graph properties converge in this time, providing empirical evidence for our claims. © 2012 Springer-Verlag.

More Details

TYPE Conference YEAR 2012

Scopus OSTI

Spatial and temporal data fusion for biosurveillance

Ray, Jaideep; Safta, Cosmin

Abstract not provided.

More Details

TYPE Conference YEAR 2012

OSTI

Structural Models Used In Real-time Biosurveillance Outbreak Detection and Outbreak Curve Isolation from Noisy Background Morbidity Levels

Proposed for publication in Journal of the American Medical Informatics Association.

Ray, Jaideep; Safta, Cosmin

Abstract not provided.

More Details

TYPE Journal Article YEAR 2012

OSTI

Bayesian estimation of multiscale structures in a binary medium from sparse observations

Ray, Jaideep; Lefantzi, Sophia; Mckenna, Sean A.; Van Bloemen Waanders, Bart

Abstract not provided.

More Details

TYPE Conference YEAR 2012

OSTI

Estimating a thinning ratio for a Markov chain of graphs

Ray, Jaideep; Pinar, Ali P.; Comandur, Seshadhri

Abstract not provided.

More Details

TYPE Conference YEAR 2012

OSTI

Estimation of multiscale fields representing anthropogenic CO2 emissions from sparse observations

Ray, Jaideep; Van Bloemen Waanders, Bart; Mckenna, Sean A.

Abstract not provided.

More Details

TYPE Conference YEAR 2012

OSTI

Generating independent graphs with prescribed joint degree distribution using a Markov chain sampler

Ray, Jaideep; Pinar, Ali P.; Comandur, Seshadhri

Abstract not provided.

More Details

TYPE Conference YEAR 2012

OSTI

BAYESIAN ESTIMATION OF MULTISCALE STRUCTURES IN A BINARY MEDIUM FROM SPARSE OBSERVATIONS

Ray, Jaideep; Lefantzi, Sophia; Mckenna, Sean A.; Van Bloemen Waanders, Bart

Abstract not provided.

More Details

TYPE Conference YEAR 2012

OSTI

Low dimensional models for the estimation of anthropogenic CO2 from atmospheric observations

Mckenna, Sean A.; Ray, Jaideep

Abstract not provided.

More Details

TYPE Presentation YEAR 2011

OSTI

Application of surrogate models to enable real-time characterization of epidemics

Ray, Jaideep; Safta, Cosmin; Lefantzi, Sophia

Abstract not provided.

More Details

TYPE Conference YEAR 2011

OSTI

Efficient uncertainty quantification methodologies for high-dimensional climate land models

Sargsyan, Khachik; Safta, Cosmin; Berry, Robert D.; Ray, Jaideep; Debusschere, Bert; Najm, Habib N.

In this report, we proposed, examined and implemented approaches for performing efficient uncertainty quantification (UQ) in climate land models. Specifically, we applied Bayesian compressive sensing framework to a polynomial chaos spectral expansions, enhanced it with an iterative algorithm of basis reduction, and investigated the results on test models as well as on the community land model (CLM). Furthermore, we discussed construction of efficient quadrature rules for forward propagation of uncertainties from high-dimensional, constrained input space to output quantities of interest. The work lays grounds for efficient forward UQ for high-dimensional, strongly non-linear and computationally costly climate models. Moreover, to investigate parameter inference approaches, we have applied two variants of the Markov chain Monte Carlo (MCMC) method to a soil moisture dynamics submodel of the CLM. The evaluation of these algorithms gave us a good foundation for further building out the Bayesian calibration framework towards the goal of robust component-wise calibration.

More Details

TYPE SAND Report YEAR 2011

DOI OSTI

A comparison of single and multi-chain methods for estimating parameters of the Community Land Model

Ray, Jaideep

Abstract not provided.

More Details

TYPE Conference YEAR 2011

OSTI

Bayesian inference of multiscale structures in porous media

Lefantzi, Sophia; Mckenna, Sean A.; Ray, Jaideep; Van Bloemen Waanders, Bart

Abstract not provided.

More Details

TYPE Conference YEAR 2011

OSTI

Bayesian data assimilation for stochastic multiscale models of transport in porous media

Lefantzi, Sophia; Klise, Katherine A.; Salazar, Luke; Mckenna, Sean A.; Van Bloemen Waanders, Bart; Ray, Jaideep

We investigate Bayesian techniques that can be used to reconstruct field variables from partial observations. In particular, we target fields that exhibit spatial structures with a large spectrum of lengthscales. Contemporary methods typically describe the field on a grid and estimate structures which can be resolved by it. In contrast, we address the reconstruction of grid-resolved structures as well as estimation of statistical summaries of subgrid structures, which are smaller than the grid resolution. We perform this in two different ways (a) via a physical (phenomenological), parameterized subgrid model that summarizes the impact of the unresolved scales at the coarse level and (b) via multiscale finite elements, where specially designed prolongation and restriction operators establish the interscale link between the same problem defined on a coarse and fine mesh. The estimation problem is posed as a Bayesian inverse problem. Dimensionality reduction is performed by projecting the field to be inferred on a suitable orthogonal basis set, viz. the Karhunen-Loeve expansion of a multiGaussian. We first demonstrate our techniques on the reconstruction of a binary medium consisting of a matrix with embedded inclusions, which are too small to be grid-resolved. The reconstruction is performed using an adaptive Markov chain Monte Carlo method. We find that the posterior distributions of the inferred parameters are approximately Gaussian. We exploit this finding to reconstruct a permeability field with long, but narrow embedded fractures (which are too fine to be grid-resolved) using scalable ensemble Kalman filters; this also allows us to address larger grids. Ensemble Kalman filtering is then used to estimate the values of hydraulic conductivity and specific yield in a model of the High Plains Aquifer in Kansas. Strong conditioning of the spatial structure of the parameters and the non-linear aspects of the water table aquifer create difficulty for the ensemble Kalman filter. We conclude with a demonstration of the use of multiscale stochastic finite elements to reconstruct permeability fields. This method, though computationally intensive, is general and can be used for multiscale inference in cases where a subgrid model cannot be constructed.

More Details

TYPE SAND Report YEAR 2011

DOI OSTI

Real-time Characterization of Partially Observed Epidemics using Surrogate Models

Mathematical Biosciences

Safta, Cosmin; Ray, Jaideep; Sargsyan, Khachik; Lefantzi, Sophia

Abstract not provided.

More Details

TYPE Journal Article YEAR 2011

OSTI

Deriving a model for influenza epidemics from historical data

Ray, Jaideep

In this report we describe how we create a model for influenza epidemics from historical data collected from both civilian and military societies. We derive the model when the population of the society is unknown but the size of the epidemic is known. Our interest lies in estimating a time-dependent infection rate to within a multiplicative constant. The model form fitted is chosen for its similarity to published models for HIV and plague, enabling application of Bayesian techniques to discriminate among infectious agents during an emerging epidemic. We have developed models for the progression of influenza in human populations. The model is framed as a integral, and predicts the number of people who exhibit symptoms and seek care over a given time-period. The start and end of the time period form the limits of integration. The disease progression model, in turn, contains parameterized models for the incubation period and a time-dependent infection rate. The incubation period model is obtained from literature, and the parameters of the infection rate are fitted from historical data including both military and civilian populations. The calibrated infection rate models display a marked difference in which the 1918 Spanish Influenza pandemic differed from the influenza seasons in the US between 2001-2008 and the progression of H1N1 in Catalunya, Spain. The data for the 1918 pandemic was obtained from military populations, while the rest are country-wide or province-wide data from the twenty-first century. We see that the initial growth of infection in all cases were about the same; however, military populations were able to control the epidemic much faster i.e., the decay of the infection-rate curve is much higher. It is not clear whether this was because of the much higher level of organization present in a military society or the seriousness with which the 1918 pandemic was addressed. Each outbreak to which the influenza model was fitted yields a separate set of parameter values. We suggest 'consensus' parameter values for military and civilian populations in the form of normal distributions so that they may be further used in other applications. Representing the parameter values as distributions, instead of point values, allows us to capture the uncertainty and scatter in the parameters. Quantifying the uncertainty allows us to use these models further in inverse problems, predictions under uncertainty and various other studies involving risk.

More Details

TYPE SAND Report YEAR 2011

DOI OSTI

Estimation of finescale conductivity fields from multiscale observations

Ray, Jaideep; Van Bloemen Waanders, Bart; Lefantzi, Sophia

Abstract not provided.

More Details

TYPE Conference YEAR 2011

OSTI

Real-time characterization of partially observed epidemics using surrogate models

Safta, Cosmin; Ray, Jaideep; Sargsyan, Khachik; Lefantzi, Sophia

We present a statistical method, predicated on the use of surrogate models, for the 'real-time' characterization of partially observed epidemics. Observations consist of counts of symptomatic patients, diagnosed with the disease, that may be available in the early epoch of an ongoing outbreak. Characterization, in this context, refers to estimation of epidemiological parameters that can be used to provide short-term forecasts of the ongoing epidemic, as well as to provide gross information on the dynamics of the etiologic agent in the affected population e.g., the time-dependent infection rate. The characterization problem is formulated as a Bayesian inverse problem, and epidemiological parameters are estimated as distributions using a Markov chain Monte Carlo (MCMC) method, thus quantifying the uncertainty in the estimates. In some cases, the inverse problem can be computationally expensive, primarily due to the epidemic simulator used inside the inversion algorithm. We present a method, based on replacing the epidemiological model with computationally inexpensive surrogates, that can reduce the computational time to minutes, without a significant loss of accuracy. The surrogates are created by projecting the output of an epidemiological model on a set of polynomial chaos bases; thereafter, computations involving the surrogate model reduce to evaluations of a polynomial. We find that the epidemic characterizations obtained with the surrogate models is very close to that obtained with the original model. We also find that the number of projections required to construct a surrogate model is O(10)-O(10{sup 2}) less than the number of samples required by the MCMC to construct a stationary posterior distribution; thus, depending upon the epidemiological models in question, it may be possible to omit the offline creation and caching of surrogate models, prior to their use in an inverse problem. The technique is demonstrated on synthetic data as well as observations from the 1918 influenza pandemic collected at Camp Custer, Michigan.

More Details

TYPE SAND Report YEAR 2011

DOI OSTI