Publications Search

Non-conformity Scores for High-Quality Uncertainty Quantification from Conformal Prediction

Adams, Jason R.; Berman, Brandon; Michalenko, Joshua J.; Deka, Rina

High-quality uncertainty quantification (UQ) is a critical component of enabling trust in deep learning (DL) models and is especially important if DL models are to be deployed in high-consequence applications. Conformal prediction (CP) methods represent an emerging nonparametric approach for producing UQ that is easily interpretable and, under weak assumptions, provides a guarantee regarding UQ quality. This report describes the research outputs of an Exploratory Express Laboratory Directed Research and Development (LDRD) project at Sandia National Laboratories. This project focused on how best to implement CP methods for DL models. This report introduces new methodology for obtaining high-quality UQ from DL models using CP methods, describes a novel system of assessing UQ quality, and provides experimental results that demonstrate the quality of the new methodology and utility of the UQ quality assessment system. Avenues for future research and discussion of potential impacts at Sandia and in the wider research community are also given.

More Details

TYPE LDRD Report YEAR 2024

DOI OSTI

Evaluating the Quality of Uncertainty Estimates Produced by Bayesian Neural Networks

Adams, Jason R.

Abstract not provided.

More Details

TYPE Presentation YEAR 2024

DOI OSTI

Recent Advances in Functional Data Analysis for Electronic Devices

Adams, Jason R.; Berman, Brandon; Buchheit, Thomas E.; Llosa-Vite, Carlos; Reza, Shahed

Abstract not provided.

More Details

TYPE Conference Presentation YEAR 2024

DOI OSTI

Recent Advances in Functional Data Analysis for Electronic Device Data

IEEE Electron Devices Technology and Manufacturing Conference: Strengthening the Globalization in Semiconductors, EDTM 2024

Adams, Jason R.; Berman, Brandon; Buchheit, Thomas E.; Llosa-Vite, Carlos; Reza, Shahed

Accurate understanding of the behavior of commercial-off-the-shelf electrical devices is important in many applications. This paper discusses methods for the principled statistical analysis of electrical device data. We present several recent successful efforts and describe two current areas of research that we anticipate will produce widely applicable methods. Because much electrical device data is naturally treated as functional, and because such data introduces some complications in analysis, we focus on methods for functional data analysis.

More Details

TYPE Conference Paper YEAR 2024

DOI OSTI Scopus

Improving and Assessing the Quality of Uncertainty Quantification in Deep Learning

Adams, Jason R.; Baiyasi, Rashad; Berman, Brandon; Darling, Michael C.; Ganter, Tyler; Michalenko, Joshua J.; Patel, Lekha; Ries, Daniel; Liang, Feng; Qian, Christopher; Roy, Krishna

Deep learning (DL) models have enjoyed increased attention in recent years because of their powerful predictive capabilities. While many successes have been achieved, standard deep learning methods suffer from a lack of uncertainty quantification (UQ). While the development of methods for producing UQ from DL models is an active area of current research, little attention has been given to the quality of the UQ produced by such methods. In order to deploy DL models to high-consequence applications, high-quality UQ is necessary. This report details the research and development conducted as part of a Laboratory Directed Research and Development (LDRD) project at Sandia National Laboratories. The focus of this project is to develop a framework of methods and metrics for the principled assessment of UQ quality in DL models. This report presents an overview of UQ quality assessment in traditional statistical modeling and describes why this approach is difficult to apply in DL contexts. An assessment on relatively simple simulated data is presented to demonstrate that UQ quality can differ greatly between DL models trained on the same data. A method for simulating image data that can then be used for UQ quality assessment is described. A general method for simulating realistic data for the purpose of assessing a model’s UQ quality is also presented. A Bayesian uncertainty framework for understanding uncertainty and existing metrics is described. Research that came out of collaborations with two university partners are discussed along with a software toolkit that is currently being developed to implement the UQ quality assessment framework as well as serve as a general guide to incorporating UQ into DL applications.

More Details

TYPE LDRD Report YEAR 2023

DOI OSTI

A Statistical Assessment of Zener Diode Behavior Using Functional Data Analysis

Champon, Xiaoxia; Angeles, David; Buchheit, Thomas E.; Canfield, David P.; Tucker, J.D.; Adams, Jason R.

Abstract not provided.

More Details

TYPE Conference Poster YEAR 2023

DOI OSTI

Evaluation and Calibration of Epistemic Uncertainty

Qian, Christopher; Ganter, Tyler; Michalenko, Joshua J.; Liang, Feng; Adams, Jason R.

Abstract not provided.

More Details

TYPE Conference Poster YEAR 2023

DOI OSTI

A Statistical Assessment of Zener Diode Behavior Using Functional Data Analysis

7th IEEE Electron Devices Technology and Manufacturing Conference: Strengthen the Global Semiconductor Research Collaboration After the Covid-19 Pandemic, EDTM 2023

Champon, Xiaoxia; Angeles, David; Buchheit, Thomas E.; Canfield, David P.; Tucker, J.D.; Adams, Jason R.

This paper presents an assessment of electrical device measurements using functional data analysis (FDA) on a test case of Zener diode devices. We employ three techniques from FDA to quantify the variability in device behavior, primarily due to production lot and demonstrate that this has a significant effect in our data set. We also argue for the expanded use of FDA methods in providing principled, quantitative analysis of electrical device data.

More Details

TYPE Conference Paper YEAR 2023

DOI OSTI Scopus

Mean Estimation and Nominal Device Selection with the Pairwise Midpoint Method

7th IEEE Electron Devices Technology and Manufacturing Conference: Strengthen the Global Semiconductor Research Collaboration After the Covid-19 Pandemic, EDTM 2023

Adams, Jason R.; Buchheit, Thomas E.; Al Mamun Mazumder, Abdullah; Moghal, Biazid K.; Fazle Rabbe, Fazle; Islam, Ahsanul; Reza, Shahed

Accurate characterization of electrical device behavior is a key component of developing accurate electrical models and assessing reliability. Measurements characterizing an electrical device can be produced from current-voltage (I-V) sweeps. We introduce the pairwise midpoint method (PMM) for estimating the mean of a functional data set and apply it to I-V sweeps from a Zener diode. Comparisons indicate that the PMM is a viable method for describing the mean behavior of a functional data set.

More Details

TYPE Conference Poster YEAR 2023

DOI OSTI Scopus

Mean Estimation and Nominal Device Selection with the Pairwise Midpoint Method

Adams, Jason R.; Buchheit, Thomas E.; Al Mazumder, Abdullahmamun; Moghal, Baizid K.; Rabbe, Md F.; Islam, Ahsanul; Reza, Shahed

Abstract not provided.

More Details

TYPE Conference Paper YEAR 2022

DOI OSTI

Supplementary Material for: The Evaluation and Calibration of Epistemic Uncertainty Estimates

Qian, Christopher; Ganter, Tyler; Michalenko, Joshua J.; Liang, Feng; Adams, Jason R.

Abstract not provided.

More Details

TYPE Conference Paper YEAR 2022

OSTI

The Evaluation and Calibration of Epistemic Uncertainty Estimates

Qian, Christopher; Ganter, Tyler; Michalenko, Joshua J.; Liang, Feng; Adams, Jason R.

Abstract not provided.

More Details

TYPE Conference Paper YEAR 2022

OSTI

Evaluation and Calibration of Epistemic Uncertainty Estimates

Qian, Christopher; Adams, Jason R.; Liang, Feng

Abstract not provided.

More Details

TYPE Conference Poster YEAR 2022

DOI OSTI

Assessing the Quality of Uncertainty Estimates in Deep Learning

Adams, Jason R.; Baiyasi, Rashad; Ganter, Tyler; Michalenko, Joshua J.; Ries, Daniel

Abstract not provided.

More Details

TYPE Conference Presentation YEAR 2022

DOI OSTI

Evaluating the quality of uncertainty quantification enabled deep learning models

Ries, Daniel; Adams, Jason R.; Ganter, Tyler; Michalenko, Joshua J.

Abstract not provided.

More Details

TYPE Conference Paper YEAR 2022

OSTI

Comparing the quality of neural network uncertainty estimates for classification problems

Proceedings - 21st IEEE International Conference on Machine Learning and Applications, ICMLA 2022

Ries, Daniel; Michalenko, Joshua J.; Ganter, Tyler; Baiyasi, Rashad; Adams, Jason R.

Traditional deep learning (DL) models are powerful classifiers, but many approaches do not provide uncertainties for their estimates. Uncertainty quantification (UQ) methods for DL models have received increased attention in the literature due to their usefulness in decision making, particularly for high-consequence decisions. However, there has been little research done on how to evaluate the quality of such methods. We use statistical methods of frequentist interval coverage and interval width to evaluate the quality of credible intervals, and expected calibration error to evaluate classification predicted confidence. These metrics are evaluated on Bayesian neural networks (BNN) fit using Markov Chain Monte Carlo (MCMC) and variational inference (VI), bootstrapped neural networks (NN), Deep Ensembles (DE), and Monte Carlo (MC) dropout. We apply these different UQ for DL methods to a hyperspectral image target detection problem and show the inconsistency of the different methods' results and the necessity of a UQ quality metric. To reconcile these differences and choose a UQ method that appropriately quantifies the uncertainty, we create a simulated data set with fully parameterized probability distribution for a two-class classification problem. The gold standard MCMC performs the best overall, and the bootstrapped NN is a close second, requiring the same computational expense as DE. Through this comparison, we demonstrate that, for a given data set, different models can produce uncertainty estimates of markedly different quality. This in turn points to a great need for principled assessment methods of UQ quality in DL applications.

More Details

TYPE Conference Presentation YEAR 2022

DOI OSTI Scopus

Target Detection on Hyperspectral Images Using MCMC and VI Trained Bayesian Neural Networks

IEEE Aerospace Conference Proceedings

Ries, Daniel; Adams, Jason R.; Zollweg, Joshua

Neural networks (NN) have become almost ubiquitous with image classification, but in their standard form produce point estimates, with no measure of confidence. Bayesian neural networks (BNN) provide uncertainty quantification (UQ) for NN predictions and estimates through the posterior distribution. As NN are applied in more high-consequence applications, UQ is becoming a requirement. Automating systems can save time and money, but only if the operator can trust what the system outputs. BNN provide a solution to this problem by not only giving accurate predictions and estimates, but also an interval that includes reasonable values within a desired probability. Despite their positive attributes, BNN are notoriously difficult and time consuming to train. Traditional Bayesian methods use Markov Chain Monte Carlo (MCMC), but this is often brushed aside as being too slow. The most common method is variational inference (VI) due to its fast computation, but there are multiple concerns with its efficacy. MCMC is the gold standard and given enough time, will produce the correct result. VI, alternatively, is an approximation that converges asymptotically. Unfortunately (or fortunately), high consequence problems often do not live in the land of asymtopia so solutions like MCMC are preferable to approximations. We apply and compare MCMC-and VI-trained BNN in the context of target detection in hyperspectral imagery (HSI), where materials of interest can be identified by their unique spectral signature. This is a challenging field, due to the numerous permuting effects practical collection of HSI has on measured spectra. Both models are trained using out-of-the-box tools on a high fidelity HSI target detection scene. Both MCMC-and VI-trained BNN perform well overall at target detection on a simulated HSI scene. Splitting the test set predictions into two classes, high confidence and low confidence predictions, presents a path to automation. For the MCMC-trained BNN, the high confidence predictions have a 0.95 probability of detection with a false alarm rate of 0.05 when considering pixels with target abundance of 0.2. VI-trained BNN have a 0.25 probability of detection for the same, but its performance on high confidence sets matched MCMC for abundances >0.4. However, the VI-trained BNN on this scene required significant expert tuning to get these results while MCMC worked immediately. On neither scene was MCMC prohibitively time consuming, as is often assumed, but the networks we used were relatively small. This paper provides an example of how to utilize the benefits of UQ, but also to increase awareness that different training methods can give different results for the same model. If sufficient computational resources are available, the best approach rather than the fastest or most efficient should be used, especially for high consequence problems.

More Details

TYPE Conference Presentation YEAR 2022

DOI OSTI Scopus

Target Detection on Hyperspectral Images Using MCMC and VI Trained Bayesian Neural Networks

Ries, Daniel; Adams, Jason R.; Zollweg, Joshua

Abstract not provided.

More Details

TYPE Conference Paper YEAR 2021

DOI OSTI

Semi-supervised Bayesian Low-shot Learning

Adams, Jason R.; Goode, Katherine; Michalenko, Joshua J.; Lewis, Phillip; Ries, Daniel

Deep neural networks (NNs) typically outperform traditional machine learning (ML) approaches for complicated, non-linear tasks. It is expected that deep learning (DL) should offer superior performance for the important non-proliferation task of predicting explosive device configuration based upon observed optical signature, a task which human experts struggle with. However, supervised machine learning is difficult to apply in this mission space because most recorded signatures are not associated with the corresponding device description, or “truth labels.” This is challenging for NNs, which traditionally require many samples for strong performance. Semi-supervised learning (SSL), low-shot learning (LSL), and uncertainty quantification (UQ) for NNs are emerging approaches that could bridge the mission gaps of few labels and rare samples of importance. NN explainability techniques are important in gaining insight into the inferential feature importance of such a complex model. In this work, SSL, LSL, and UQ are merged into a single framework, a significant technical hurdle not previously demonstrated. Exponential Average Adversarial Training (EAAT) and Pairwise Neural Networks (PNNs) are chosen as the SSL and LSL methods of choice. Permutation feature importance (PFI) for functional data is used to provide explainability via the Variable importance Explainable Elastic Shape Analysis (VEESA) pipeline. A variety of uncertainty quantification approaches are explored: Bayesian Neural Networks (BNNs), ensemble methods, concrete dropout, and evidential deep learning. Two final approaches, one utilizing ensemble methods and one utilizing evidential learning, are constructed and compared using a well-quantified synthetic 2D dataset along with the DIRSIG Megascene.

More Details

TYPE SAND Report YEAR 2021

DOI OSTI

Characterizing Electrical Device Behavior with Functional Tolerance Bounds

Adams, Jason R.; Tucker, J.D.; Angeles, David; Perez, Tabytha; Buchheit, Thomas E.; Reza, Shahed

Abstract not provided.

More Details

TYPE Conference Presentation YEAR 2021

DOI OSTI

Publications

Search results