Publications Search

In recent years, infections and damage caused by malware have increased at exponential rates. At the same time, machine learning (ML) techniques have shown tremendous promise in many domains, often out performing human efforts by learning from large amounts of data. Results in the open literature suggest that ML is able to provide similar results for malware detection, achieving greater than 99% classifcation accuracy [49]. However, the same detection rates when applied in deployed settings have not been achieved. Malware is distinct from many other domains in which ML has shown success in that (1) it purposefully tries to hide, leading to noisy labels and (2) often its behavior is similar to benign software only differing in intent, among other complicating factors. This report details the reasons for the diffcultly of detecting novel malware by ML methods and offers solutions to improve the detection of novel malware.

More Details

TYPE SAND Report YEAR 2022

DOI OSTI

Machine Learning Classification and Reduction of CAD Parts for Rapid Design to Simulation

Owen, Steven J.; Carbajal, Armida J.; Peterson, Matthew G.; Ernst, Corey D.

Abstract not provided.

More Details

TYPE Conference Paper YEAR 2022

OSTI

Machine Learning Classification for Rapid CAD-to-Simulation

Owen, Steven J.; Ernst, Corey D.; Carbajal, Armida J.; Peterson, Matthew G.

Abstract not provided.

More Details

TYPE Conference Presentation YEAR 2022

DOI OSTI

Machine Learning Classification for Rapid CAD-to-Simulation

Owen, Steven J.; Carbajal, Armida J.; Ernst, Corey D.; Peterson, Matthew G.; Shead, Timothy M.

Abstract not provided.

More Details

TYPE Conference Paper YEAR 2021

OSTI

Going Beyond Signature Malware Detection by Learning Behaviors

Johnson, Nicholas; Domschot, Eva; Khanna, Kanad; Kegelmeyer, William P.; Lamb, Christopher; Ramyaa, Ramyaa; Smith, Michael R.; Verzi, Stephen J.; Zhou, Xin; Carbajal, Armida J.; Haus, Bridget; Ingram, Joe B.

Abstract not provided.

More Details

TYPE Conference Presentation YEAR 2021

DOI OSTI

Mind the Gap: On Bridging the Semantic Gap between Machine Learning and Malware Analysis

AISec 2020 - Proceedings of the 13th ACM Workshop on Artificial Intelligence and Security

Smith, Michael R.; Johnson, Nicholas; Ingram, Joe B.; Carbajal, Armida J.; Haus, Bridget I.; Domschot, Eva; Ramyaa, Ramyaa; Lamb, Christopher; Verzi, Stephen J.; Kegelmeyer, William P.

Machine learning (ML) techniques are being used to detect increasing amounts of malware and variants. Despite successful applications of ML, we hypothesize that the full potential of ML is not realized in malware analysis (MA) due to a semantic gap between the ML and MA communities-as demonstrated in the data that is used. Due in part to the available data, ML has primarily focused on detection whereas MA is also interested in identifying behaviors. We review existing open-source malware datasets used in ML and find a lack of behavioral information that could facilitate stronger impact by ML in MA. As a first step in bridging this gap, we label existing data with behavioral information using open-source MA reports-1) altering the analysis from identifying malware to identifying behaviors, 2)~aligning ML better with MA, and 3)~allowing ML models to generalize to novel malware in a zero/few-shot learning manner. We classify the behavior of a malware family not seen during training using transfer learning from a state-of-the-art model for malware family classification and achieve 57%-84% accuracy on behavioral identification but fail to outperform the baseline set by a majority class predictor. This highlights opportunities for improvement on this task related to the data representation, the need for malware specific ML techniques, and a larger training set of malware samples labeled with behaviors.

More Details

TYPE Conference Presentation YEAR 2020

DOI OSTI Scopus

Extending Theory-Guided Data Science to Consider Social Science Domains

Gunda, Thushara; Carbajal, Armida J.; Sanchez, Danielle N.

More Details

TYPE Conference Poster YEAR 2020

DOI OSTI

MalGen: On Bridging the Semantic Gap between Machine Learning and Malware Analysis

Smith, Michael R.; Carbajal, Armida J.; Domschot, Eva; Haus, Bridget I.; Ingram, Joe B.; Johnson, Nicholas; Kegelmeyer, William P.; Lamb, Christopher; Ramyaa, Ramyaa; Verzi, Stephen J.

Abstract not provided.

More Details

TYPE Conference Poster YEAR 2020

OSTI

Mind the Gap: On Bridging the Semantic Gap between Machine Learning and Malware Analysis

Smith, Michael R.; Johnson, Nicholas; Ingram, Joe B.; Carbajal, Armida J.; Haus, Bridget I.; Domschot, Eva; Ramyaa, Ramyaa; Lamb, Christopher; Verzi, Stephen J.; Kegelmeyer, William P.

Abstract not provided.

More Details

TYPE Conference Poster YEAR 2020

DOI OSTI

Mind the Gap: On Bridging the Semantic Gap between Machine Learning and Information Security

Smith, Michael R.; Johnson, Nicholas; Ingram, Joe B.; Carbajal, Armida J.; Ramyaa, Ramyaa; Domschot, Evelyn; Lamb, Christopher; Verzi, Stephen J.; Kegelmeyer, William P.

Abstract not provided.

More Details

TYPE Conference Poster YEAR 2020

OSTI

Industrial Control Systems: Cyber Security Risk Candidate Methods Analysis

Dawson, Lon A.; Lamb, Christopher; Carbajal, Armida J.

In recognition of their mission and in response to continuously evolving cyber threats against nuclear facilities, Department of Energy - Nuclear Energy (DOE-NE) is building the Nuclear Energy Cyber security Research, Development, and Demonstration (RD&D) Program, which includes a cyber risk management thrust. This report supports the cyber risk management thrust objective which is to deliver "Standardized methodologies for credible risk-based identification, evaluation and prioritization of digital components." In a previous task, the Sandia National Laboratories (SNL) team presented evaluation criteria and a survey to review methods to determine the most suitable techniques. In this task we will identify and evaluate a series of candidate methodologies. In this report, 10 distinct methodologies are evaluated. The overall goal of this effort was to identify the current range of risk analysis techniques that were currently available, and how they could be applied, with an focus on industrial control systems (ICS). Overall, most of the techniques identified did fall into accepted risk analysis practices, though they generally addressed only one step of the multi-step risk management process. A few addressed multiple steps, but generally their treatment was superficial. This study revealed that the current state of security risk analysis in digital control systems was not comprehensive and did not support a science-based evaluation. The papers surveyed did use mathematical formulation to describe the addressed problems, and tied the models to some kind of experimental or experiential evidence as support. Most of the papers, however, did not use a rigorous approach to experimentally support the proposed models, nor did they have enough evidence supporting the efficacy of the models to statistically analyze model impact. Both of these issues stem from the difficulty and expense associated with collecting experimental data in this domain.

More Details

TYPE SAND Report YEAR 2018

DOI OSTI

Enhanced Training for Cyber Situational Awareness

Carbajal, Armida J.; Silva, Austin R.; Nauer, Kevin; Anderson, Benjamin; Reed, Theodore; Forsythe, James C.

Abstract not provided.

More Details

TYPE Conference YEAR 2013

OSTI

Enhanced Training for Cyber Situational Awareness

Carbajal, Armida J.; Silva, Austin R.; Nauer, Kevin; Anderson, Benjamin; Reed, Theodore; Forsythe, James C.

Abstract not provided.

More Details

TYPE Conference YEAR 2013

OSTI

Enhanced Training for Cyber Situational Awareness in Red versus Blue Team Exercises

Forsythe, James C.; Carbajal, Armida J.; Adams, Susan S.; Silva, Austin R.; Nauer, Kevin; Anderson, Benjamin

This report summarizes research conducted through the Sandia National Laboratories Enhanced Training for Cyber Situational Awareness in Red Versus Blue Team Exercises Laboratory Directed Research and Development project. The objective of this project was to advance scientific understanding concerning how to best structure training for cyber defenders. Two modes of training were considered. The baseline training condition (Tool-Based training) was based on current practices where classroom instruction focuses on the functions of a software tool with various exercises in which students apply those functions. In the second training condition (Narrative-Based training), classroom instruction addressed software functions, but in the context of adversary tactics and techniques. It was hypothesized that students receiving narrative-based training would gain a deeper conceptual understanding of the software tools and this would be reflected in better performance within a red versus blue team exercise.

More Details

TYPE SAND Report YEAR 2012

DOI OSTI

The Use of Design of Experiments to Determine the Impact of Conductive Particles on Electrical Properties of Alumina for Electronic Applications

Carbajal, Armida J.

Abstract not provided.

More Details

TYPE Conference YEAR 2011

OSTI

Publications

Search results