Publications Search

Geospatial-Temporal Semantic Graph Evaluation for Induced Seismicity Analysis

We assess how geospatial-temporal semantic graphs and our GeoGraphy code implementation might contribute to induced seismicity analysis. We focus on evaluating strengths and weaknesses of both 1) the fundamental concept of semantic graphs and 2) our current code implementation. With extensions and research effort, code implementation limitations can be overcome. The paper also describes relevance including possible data input types, expected analytical outcomes and how it can pair with other approaches and fit into a workflow.

More Details

TYPE SAND Report YEAR 2016

DOI OSTI

Time series discord detection in medical data using a parallel relational database

Proceedings - 2015 IEEE International Conference on Bioinformatics and Biomedicine, BIBM 2015

Woodbridge, Diane M.; Wilson, Andrew T.; Foulk, James W.; Goldstein, Richard H.

Recent advances in sensor technology have made continuous real-time health monitoring available in both hospital and non-hospital settings. Since data collected from high frequency medical sensors includes a huge amount of data, storing and processing continuous medical data is an emerging big data area. Especially detecting anomaly in real time is important for patients' emergency detection and prevention. A time series discord indicates a subsequence that has the maximum difference to the rest of the time series subsequences, meaning that it has abnormal or unusual data trends. In this study, we implemented two versions of time series discord detection algorithms on a high performance parallel database management system (DBMS) and applied them to 240 Hz waveform data collected from 9,723 patients. The initial brute force version of the discord detection algorithm takes each possible subsequence and calculates a distance to the nearest non-self match to find the biggest discords in time series. For the heuristic version of the algorithm, a combination of an array and a trie structure was applied to order time series data for enhancing time efficiency. The study results showed efficient data loading, decoding and discord searches in a large amount of data, benefiting from the time series discord detection algorithm and the architectural characteristics of the parallel DBMS including data compression, data pipe-lining, and task scheduling.

More Details

TYPE Conference Poster YEAR 2015

DOI OSTI Scopus

Computing quality scores and uncertainty for approximate pattern matching in geospatial semantic graphs

Statistical Analysis and Data Mining

Stracuzzi, David J.; Brost, Randolph; Phillips, Cynthia A.; Robinson, David G.; Wilson, Alyson G.; Woodbridge, Diane M.

Geospatial semantic graphs provide a robust foundation for representing and analyzing remote sensor data. In particular, they support a variety of pattern search operations that capture the spatial and temporal relationships among the objects and events in the data. However, in the presence of large data corpora, even a carefully constructed search query may return a large number of unintended matches. This work considers the problem of calculating a quality score for each match to the query, given that the underlying data are uncertain. We present a preliminary evaluation of three methods for determining both match quality scores and associated uncertainty bounds, illustrated in the context of an example based on overhead imagery data.

More Details

TYPE Journal Article YEAR 2015

DOI OSTI Scopus

Time Series Discord Detection in Medical Data using a Parallel Relational Database

Woodbridge, Diane M.; Foulk, James W.; Wilson, Andrew T.; Goldstein, Richard

Recent advances in sensor technology have made continuous real-time health monitoring available in both hospital and non-hospital settings. Since data collected from high frequency medical sensors includes a huge amount of data, storing and processing continuous medical data is an emerging big data area. Especially detecting anomaly in real time is important for patients’ emergency detection and prevention. A time series discord indicates a subsequence that has the maximum difference to the rest of the time series subsequences, meaning that it has abnormal or unusual data trends. In this study, we implemented two versions of time series discord detection algorithms on a high performance parallel database management system (DBMS) and applied them to 240 Hz waveform data collected from 9,723 patients. The initial brute force version of the discord detection algorithm takes each possible subsequence and calculates a distance to the nearest non-self match to find the biggest discords in time series. For the heuristic version of the algorithm, a combination of an array and a trie structure was applied to order time series data for enhancing time efficiency. The study results showed efficient data loading, decoding and discord searches in a large amount of data, benefiting from the time series discord detection algorithm and the architectural characteristics of the parallel DBMS including data compression, data pipe-lining, and task scheduling.

More Details

TYPE Conference Poster YEAR 2015

DOI OSTI

Preliminary Results on Uncertainty Quantification for Pattern Analytics

Stracuzzi, David J.; Brost, Randolph; Chen, Maximillian G.; Malinas, Rebecca; Peterson, Matthew G.; Phillips, Cynthia A.; Robinson, David G.; Woodbridge, Diane M.

This report summarizes preliminary research into uncertainty quantification for pattern ana- lytics within the context of the Pattern Analytics to Support High-Performance Exploitation and Reasoning (PANTHER) project. The primary focus of PANTHER was to make large quantities of remote sensing data searchable by analysts. The work described in this re- port adds nuance to both the initial data preparation steps and the search process. Search queries are transformed from does the specified pattern exist in the data? to how certain is the system that the returned results match the query? We show example results for both data processing and search, and discuss a number of possible improvements for each.

More Details

TYPE SAND Report YEAR 2015

DOI OSTI

Time Series Discord Detection in Medical Data using a Parallel Relational Database

Woodbridge, Diane M.; Foulk, James W.; Wilson, Andrew T.; Goldstein, Richard

Abstract not provided.

More Details

TYPE Conference Poster YEAR 2015

OSTI

Image-Based Algorithms - Semantic Graph Algorithms

Brost, Randolph; Carroll, Michelle J.; Mclendon, William; Parekh, Ojas D.; Strip, David R.; Foulk, James W.; Woodbridge, Diane M.

Abstract not provided.

More Details

TYPE Conference Poster YEAR 2015

OSTI

Thoughts on Multi-Modality Data Analysis

Woodbridge, Diane M.; Brost, Randolph

Abstract not provided.

More Details

TYPE Conference Poster YEAR 2015

OSTI

A Computational Framework for Ontologically Storing and Analyzing Very Large Overhead Image Sets

Brost, Randolph; Mclendon, William; Parekh, Ojas D.; Foulk, James W.; Strip, David R.; Woodbridge, Diane M.

Abstract not provided.

More Details

TYPE Conference Poster YEAR 2014

DOI OSTI

A Computational Framework for Ontologically Storing and Analyzing Very Large Overhead Image Sets

Foulk, James W.; Brost, Randolph; Mclendon, William; Parekh, Ojas D.; Woodbridge, Diane M.; Strip, David R.

Abstract not provided.

More Details

TYPE Conference Poster YEAR 2014

OSTI

Computing Quality Scores and Uncertainty for Approximate Pattern Matching in Geospatial Semantic Graphs

Stracuzzi, David J.; Brost, Randolph; Phillips, Cynthia A.; Robinson, David G.; Woodbridge, Diane M.

Abstract not provided.

More Details

TYPE Presentation YEAR 2014

DOI OSTI

Temporal Analysis and Change Detection via Geospatial-Temporal Semantic Graphs

Brost, Randolph; Mclendon, William; Parekh, Ojas D.; Rintoul, Mark D.; Woodbridge, Diane M.

Abstract not provided.

More Details

TYPE Conference YEAR 2014

OSTI

Facility Search in Remote Sensing Data Using Geospatial Semantic Graphs

Brost, Randolph; Mclendon, William; Parekh, Ojas D.; Rintoul, Mark D.; Woodbridge, Diane M.

Abstract not provided.

More Details

TYPE Conference YEAR 2014

OSTI

Spacecraft state-of-health (SOH) analysis via data mining

13th International Conference on Space Operations, SpaceOps 2014

Lindsay, Stephen R.; Woodbridge, Diane M.

Spacecraft state-of-health (SOH) analysis typically consists of limit-checking to compare incoming measurand values against their predetermined limits. While useful, this approach requires significant engineering insight along with the ability to evolve limit values over time as components degrade and their operating environment changes. In addition, it fails to take into account the effects of measurand combinations, as multiple values together could signify an imminent problem. A more powerful approach is to apply data mining techniques to uncover hidden trends and patterns as well as interactions among groups of measurands. In an internal research and development effort, software engineers at Sandia National Laboratories explored ways to mine SOH data from a remote sensing spacecraft. Because our spacecraft uses variable sample rates and packetized telemetry to transmit values for 30,000 measurands across 700 unique packet IDs, our data is characterized by a wide disparity of time and value pairs. We discuss how we summarized and aligned this data to be efficiently applied to data mining algorithms. We apply supervised learning including decision tree and principal component analysis and unsupervised learning including k-means and orthogonal partitioning clustering and one-class support vector machine to four different spacecraft SOH scenarios after the data preprocessing step. Our experiment results show that data mining is a very good low-cost and high-payoff approach to SOH analysis and provides an excellent way to exploit vast quantities of time-series data among groups of measurands in different scenarios. Our scenarios show that the supervised cases were particularly useful in identifying key contributors to anomalous events, and the unsupervised cases were well-suited for automated analysis of the system as a whole. The developed underlying models can be updated over time to accurately represent a changing operating environment and ultimately to extend the mission lifetime of our valuable space assets.

More Details

TYPE Conference YEAR 2014

DOI OSTI Scopus

Publications

Search results