Publications Search

Anomaly Detection in Video Using Compression

Proceedings of the International Conference on Multimedia Information Processing and Retrieval, MIPR

Smith, Michael R.; Gooding, Renee; Bisila, Jonathan; Ting, Christina

Deep neural networks (DNNs) achieve state-of-the-art performance in video anomaly detection. However, the usage of DNNs is limited in practice due to their computational overhead, generally requiring significant resources and specialized hardware. Further, despite recent progress, current evaluation criteria of video anomaly detection algorithms are flawed, preventing meaningful comparisons among algorithms. In response to these challenges, we propose (1) a compression-based technique referred to as Spatio-Temporal N-Gram Prediction by Partial Matching (STNG PPM) and (2) simple modifications to current evaluation criteria for improved interpretation and broader applicability across algorithms. STNG PMM does not require specialized hardware, has few parameters to tune, and is competitive with DNNs on multiple benchmark data sets in video anomaly detection.

More Details

TYPE Conference Proceeding YEAR 2024

DOI OSTI Scopus

Chaconne: A Statistical Approach to Nonlocal Compression for Supervised Learning, Semi-Supervised Learning, and Anomaly Detection

Foss, Alexander; Field, Richard V.; Ting, Christina; Shuler, Kurtis; Bauer, Travis L.; Zhao, Sihai D.; Cardenas-Torres, Eduardo

This project developed a novel statistical understanding of compression analytics (CA), which has challenged and clarified some core assumptions about CA, and enabled the development of novel techniques that address vital challenges of national security. Specifically, this project has yielded the development of novel capabilities including 1. Principled metrics for model selection in CA, 2. Techniques for deriving/applying optimal classification rules and decision theory to supervised CA, including how to properly handle class imbalance and differing costs of misclassification, 3. Two techniques for handling nonlocal information in CA, 4. A novel technique for unsupervised CA that is agnostic with regard to the underlying compression algorithm, 5. A framework for semisupervised CA when a small number of labels are known in an otherwise large unlabeled dataset. 6. The academic alliance component of this project has focused on the development of a novel exemplar-based Bayesian technique for estimating variable length Markov models (closely related to PPM [prediction by partial matching] compression techniques). We have developed examples illustrating the application of our work to text, video, genetic sequences, and unstructured cybersecurity log files.

More Details

TYPE LDRD Report YEAR 2023

DOI OSTI

Identifying and Explaining Anomalous Activity in Surveillance Video with Compression Algorithms

Smith, Michael R.; Bisila, Jonathan; Gooding, Renee; Ting, Christina

The primary purpose of this document is to outline the progress made on the LDRD titled “Identifying and Explaining Anomalous Activity in Surveillance Video with Compression Algorithms” in FY22 and FY23. In this LDRD, we explored the usage of compression-based analytics to identify anomalous activity in video. We developed a novel algorithm, Spatio-Temporal N-Gram PPM (STNG PPM) that accounts for spatially and temporally aware anomalies in video. We extracted features using motions vectors from video as well as operating on the raw features. STNG PPM is comparable to many deep learning approaches but does not require specialized hardware (GPUs) to run efficiently. We also examine the evaluation metrics and propose novel measures addressing faults in the current evaluation measures.

More Details

TYPE LDRD Report YEAR 2023

DOI OSTI

Statistical Properties of Compression Analytics

Shuler, Kurtis; Foss, Alexander; Ting, Christina; Bauer, Travis L.; Field, Richard V.

Abstract not provided.

More Details

TYPE Conference Presentation YEAR 2023

DOI OSTI

Building an R Shiny App to Measure the Impact of Transparency and Interactivity on Human Trust in AI

Tuft, Marie; Sorge, Marieke A.; Polski, Anna V.; Wisniewski, Kyra L.; Ting, Christina; Matzen, Laura E.

Abstract not provided.

More Details

TYPE Conference Presentation YEAR 2023

DOI OSTI

Applying Compression Metrics to Seismic Data to Assist Analysts

Zhou, Angela E.; Field, Richard V.; Matzen, Laura E.; Ting, Christina

Abstract not provided.

More Details

TYPE Conference Poster YEAR 2023

DOI OSTI

Compression-based Analytics for Efficiently Identifying Events that Deviate from Standard Operating Procedures in Surveillance Video

Smith, Michael R.; Gooding, Renee; Ting, Christina; Bisila, Jonathan

Abstract not provided.

More Details

TYPE Conference Paper YEAR 2023

OSTI

Statistical Properties of Compression Analytics

Shuler, Kurtis; Foss, Alexander; Bauer, Travis L.; Field, Richard V.; Ting, Christina

Abstract not provided.

More Details

TYPE Conference Poster YEAR 2023

DOI OSTI

Identifying Anomalous Activity in Surveillance Video with Compression-Based Analytics

Ting, Christina; Smith, Michael R.; Gooding, Renee; Bisila, Jonathan

Abstract not provided.

More Details

TYPE Conference Poster YEAR 2023

DOI OSTI

A Model of Narrative Reinforcement on a Dual-Layer Social Network

Emery, Benjamin; Ting, Christina; Gearhart, Jared L.; Tucker, J.D.

More Details

TYPE SAND Report YEAR 2022

DOI OSTI

MIDAS: Modeling Individual Differences using Advanced Statistics

Wisniewski, Kyra L.; Matzen, Laura E.; Stites, Mallory C.; Ting, Christina; Tuft, Marie; Sorge, Marieke A.

This research explores novel methods for extracting relevant information from EEG data to characterize individual differences in cognitive processing. Our approach combines expertise in machine learning, statistics, and cognitive science, advancing the state-of-the art in all three domains. Specifically, by using cognitive science expertise to interpret results and inform algorithm development, we have developed a generalizable and interpretable machine learning method that can accurately predict individual differences in cognition. The output of the machine learning method revealed surprising features of the EEG data that, when interpreted by the cognitive science experts, provided novel insights to the underlying cognitive task. Additionally, the outputs of the statistical methods show promise as a principled approach to quickly find regions within the EEG data where individual differences lie, thereby supporting cognitive science analysis and informing machine learning models. This work lays methodological ground work for applying the large body of cognitive science literature on individual differences to high consequence mission applications.

More Details

TYPE SAND Report YEAR 2022

DOI OSTI

Instantiation of HCML Demonstrating Bayesian Predictive Modeling for Attentional Control

Bugg, Julie; Clifford, Joshua; Murchison, Nicole; Ting, Christina

The research team developed models of Attentional Control (AC) that are unique to existing modeling approaches in the literature. The goal was to enable the research team to (1) make predictions about AC and human performance in real-world scenarios and (2) to make predictions about individual characteristics based on human data. First, the team developed a proof-of-concept approach for representing an experimental design and human subjects data in a Bayesian model, then demonstrated an ability to draw inferences about conditions of interest relevant to real-world scenarios. Ultimately, this effort was successful, and we were able to make reasonable (meaning supported by behavioral data) inferences about conditions of interest to develop a risk model for AC (where risk is defined as a mismatch between AC and attentional demand). The team additionally defined a path forward for a human-constrained machine learning (HCML) approach to make predictions about an individual's state based on performance data. The effort represents a successful first step in both modeling efforts and serves as a basis for future work activities. Numerous opportunities for future work have been defined.

More Details

TYPE SAND Report YEAR 2022

DOI OSTI

Faster, featureless classification using compression analytics

Ting, Christina; Johnson, Nicholas; Onunkwo, Uzoma; Tucker, J.D.

Abstract not provided.

More Details

TYPE Conference Presentation YEAR 2021

DOI OSTI

A Projected Network Model of Online Disinformation Cascades

Emery, Benjamin; Ting, Christina; Johnson, Nicholas; Tucker, J.D.

Within the past half-decade, it has become overwhelmingly clear that suppressing the spread of deliberate false and misleading information is of the utmost importance for protecting democratic institutions. Disinformation has been found to come from both foreign and domestic actors, but the effects from either can be disastrous. From the simple encouragement of unwarranted distrust to conspiracy theories promoting violence, the results of disinformation have put the functionality of American democracy under direct threat. Present scientific challenges posed by this problem include detecting disinformation, quantifying its potential impact, and preventing its amplification. We present a model on which we can experiment with possible strategies toward the third challenge: the prevention of amplification. This is a social contagion network model, which is decomposed into layers to represent physical, ''offline'', interactions as well as virtual interactions on a social media platform. Along with the topological modifications to the standard contagion model, we use state-transition rules designed specifically for disinformation, and distinguish between contagious and non-contagious infected nodes. We use this framework to explore the effect of grassroots social movements on the size of disinformation cascades by simulating these cascades in scenarios where a proportion of the agents remove themselves from the social platform. We also test the efficacy of strategies that could be implemented at the administrative level by the online platform to minimize such spread. These top-down strategies include banning agents who disseminate false information, or providing corrective information to individuals exposed to false information to decrease their probability of believing it. We find an abrupt transition to smaller cascades when a critical number of random agents are removed from the platform, as well as steady decreases in the size of cascades with increasingly more convincing corrective information. Finally, we compare simulated cascades on this framework with real cascades of disinformation recorded on Whatsapp surrounding the 2019 Indian election. We find a set of hyperparameter values that produces a distribution of cascades matching the scaling exponent of the distribution of actual cascades recorded in the dataset. We acknowledge the available future directions for improving the performance of the framework and validation methods, as well as ways to extend the model to capture additional features of social contagion.

More Details

TYPE SAND Report YEAR 2021

DOI OSTI

Physiological Characterization of Language Comprehension

Matzen, Laura E.; Stites, Mallory C.; Ting, Christina; Howell, Breannan C.; Wisniewski, Kyra L.

In this project, our goal was to develop methods that would allow us to make accurate predictions about individual differences in human cognition. Understanding such differences is important for maximizing human and human-system performance. There is a large body of research on individual differences in the academic literature. Unfortunately, it is often difficult to connect this literature to applied problems, where we must predict how specific people will perform or process information. In an effort to bridge this gap, we set out to answer the question: can we train a model to make predictions about which people understand which languages? We chose language processing as our domain of interest because of the well- characterized differences in neural processing that occur when people are presented with linguistic stimuli that they do or do not understand. Although our original plan to conduct several electroencephalography (EEG) studies was disrupted by the COVID-19 pandemic, we were able to collect data from one EEG study and a series of behavioral experiments in which data were collected online. The results of this project indicate that machine learning tools can make reasonably accurate predictions about an individual?s proficiency in different languages, using EEG data or behavioral data alone.

More Details

TYPE SAND Report YEAR 2021

DOI OSTI

Human-Constrained Indicators of Gatekeeping Behavior as a Role in Information Suppression: Finding Invisible Information and the Significant Unsaid

Bandlow, Alisa; Murchison, Nicole; Ting, Christina; Wisniewski, Kyra L.; Zhou, Angela E.

To date, disinformation research has focused largely on the production of false information ignoring the suppression of select information. We term this alternative form of disinformation information suppression. Information suppression occurs when facts are withheld with the intent to mislead. In order to detect information suppression, we focus on understanding the actors who withhold information. In this research, we use knowledge of human behavior to find signatures of different gatekeeping behaviors found in text. Specifically, we build a model to classify the different types of edits on Wikipedia using the added text alone and compare a human-informed feature engineering approach to a featureless algorithm. Being able to computationally distinguish gatekeeping behaviors is a first step towards identifying when information suppression is occurring.

More Details

TYPE SAND Report YEAR 2021

DOI OSTI

Using Machine Learning to Predict Bilingual Language Proficiency from Reaction Time Priming Data

Proceedings of the 43rd Annual Meeting of the Cognitive Science Society: Comparative Cognition: Animal Minds, CogSci 2021

Matzen, Laura E.; Ting, Christina; Stites, Mallory C.

Studies of bilingual language processing typically assign participants to groups based on their language proficiency and average across participants in order to compare the two groups. This approach loses much of the nuance and individual differences that could be important for furthering theories of bilingual language comprehension. In this study, we present a novel use of machine learning (ML) to develop a predictive model of language proficiency based on behavioral data collected in a priming task. The model achieved 75% accuracy in predicting which participants were proficient in both Spanish and English. Our results indicate that ML can be a useful tool for characterizing and studying individual differences.

More Details

TYPE Conference Poster YEAR 2021

DOI OSTI Scopus

Using Machine Learning to Predict Bilingual Language Proficiency from Reaction Time Priming Data

Proceedings of the 43rd Annual Meeting of the Cognitive Science Society: Comparative Cognition: Animal Minds, CogSci 2021

Matzen, Laura E.; Ting, Christina; Stites, Mallory C.

Studies of bilingual language processing typically assign participants to groups based on their language proficiency and average across participants in order to compare the two groups. This approach loses much of the nuance and individual differences that could be important for furthering theories of bilingual language comprehension. In this study, we present a novel use of machine learning (ML) to develop a predictive model of language proficiency based on behavioral data collected in a priming task. The model achieved 75% accuracy in predicting which participants were proficient in both Spanish and English. Our results indicate that ML can be a useful tool for characterizing and studying individual differences.

More Details

TYPE Conference Paper YEAR 2021

OSTI Scopus

Faster classification using compression analytics

IEEE International Conference on Data Mining Workshops, ICDMW

Ting, Christina; Johnson, Nicholas; Onunkwo, Uzoma; Tucker, J.D.

Compression analytics have gained recent interest for application in malware classification and digital forensics. This interest is due to the fact that compression analytics rely on measured similarity between byte sequences in datasets without requiring prior feature extraction; in other words, these methods are featureless. Being featureless makes compression analytics particularly appealing for computer security applications, where good static features are either unknown or easy to circumvent by adversaries. However, previous classification methods based on compression analytics relied on algorithms that scaled with the size of each labeled class and the number of classes. In this work, we introduce an approach that, in addition to being featureless, can perform fast and accurate inference that is independent of the size of each labeled class. Our method is based on calculating a representative sample, the Fréchet mean, for each labeled class and using it at inference time. We introduce a greedy algorithm for calculating the Fréchet mean and evaluate its utility for classification across a variety of computer security applications, including authorship attribution of source code, file fragment type detection, and malware classification.

More Details

TYPE Conference Proceeding YEAR 2021

DOI OSTI Scopus

Applying Compression Distance Metrics to Seismic Data in Support of Global Nuclear Explosion Monitoring

Matzen, Laura E.; Ting, Christina; Field, Richard V.; Young, Christopher J.; Coram, Jamie L.

Abstract not provided.

More Details

TYPE Conference Poster YEAR 2020

DOI OSTI

Applying Compression-Based Metrics to Seismic Data in Support of Global Nuclear Explosion Monitoring

Matzen, Laura E.; Ting, Christina; Field, Richard V.; Morrow, J.D.; Brogan, Ronald; Young, Christopher J.; Zhou, Angela; Trumbo, Michael C.S.; Coram, Jamie L.

The analysis of seismic data for evidence of possible nuclear explosion testing is a critical global security mission that relies heavily on human expertise to identify and mark seismic signals embedded in background noise. To assist analysts in making these determinations, we adapted two compression distance metrics for use with seismic data. First, we demonstrated that the Normalized Compression Distance (NCD) metric can be adapted for use with waveform data and can identify the arrival times of seismic signals. Then we tested an approximation for the NCD called Sliding Information Distance (SLID), which can be computed much faster than NCD. We assessed the accuracy of the SLID output by comparing it to both the Akaike Information Criterion (AIC) and the judgments of expert seismic analysts. Our results indicate that SLID effectively identifies arrival times and provides analysts with useful information that can aid their analysis process.

More Details

TYPE SAND Report YEAR 2020

DOI OSTI

Efficient Generalized Boundary Detection Using a Sliding Information Distance

IEEE Transactions on Signal Processing

Field, Richard; Quach, Tu T.; Ting, Christina

We present a general machine learning algorithm for boundary detection within general signals based on an efficient, accurate, and robust approximation of the universal normalized information distance. Our approach uses an adaptive sliding information distance (SLID) combined with a wavelet-based approach for peak identification to locate the boundaries. Special emphasis is placed on developing an adaptive formulation of SLID to handle general signals with multiple unknown and/or drifting section lengths. Although specialized algorithms may outperform SLID when domain knowledge is available, these algorithms are limited to specific applications and do not generalize. SLID excels in these cases. We demonstrate the versatility and efficacy of SLID on a variety of signal types, including synthetically generated sequences of tokens, binary executables for reverse engineering applications, and time series of seismic events.

More Details

TYPE Journal Article YEAR 2020

DOI OSTI Scopus

Reordering Genomic Sequences for Enhanced Classification via Compression Analytics

Gooding, Renee; Ting, Christina; Caswell, Jacob; Field, Richard V.

Abstract not provided.

More Details

TYPE Conference Poster YEAR 2019

OSTI

Detailed Statistical Models of Host-Based Data for Detection of Malicious Activity

Foulk, James W.; Chen, Guenevere; Adams, Susan S.; Bryant, Ross D.; Haas, Jason J.; Johnson, Nicholas; Romanowich, Paul; Roy, Krishna; Shakamuri, Mayuri; Foulk, James W.; Ting, Christina

The cybersecurity research community has focused primarily on the analysis and automation of intrusion detection systems by examining network traffic behaviors. Expanding on this expertise, advanced cyber defense analysis is turning to host-based data to use in research and development to produce the next generation network defense tools. The ability to perform deep packet inspection of network traffic is increasingly harder with most boundary network traffic moving to HTTPS. Additionally, network data alone does not provide a full picture of end-to-end activity. These are some of the reasons that necessitate looking at other data sources such as host data. We outline our investigation into the processing, formatting, and storing of the data along with the preliminary results from our exploratory data analysis. In writing this report, it is our goal to aid in guiding future research by providing foundational understanding for an area of cybersecurity that is rich with a variety of complex, categorical, and sparse data, with a strong human influence component. Including suggestions for guiding potential directions for future research.

More Details

TYPE SAND Report YEAR 2019

DOI OSTI

Genomic Security Related Projects

Harmon, Brooke N.; Timlin, Jerilyn A.; Ting, Christina

More Details

TYPE Presentation YEAR 2019

OSTI

Publications

Search results