Publications Search

Krishnakumar, Raga; Briquez, Priscilla S.; Goldberger, Zoe; Hauert, Sylvie; Chang, Kevin; Kurtanich, Trevin; Alpar, Aaron T.; Repond, Gregoire; Wang, Yue; Gomes, Suzana; Siddarth, Prabha; Swartz, Melody A.; Hubbell, Jeffrey A.

Immune checkpoint immunotherapy (ICI) can re-activate immune reactions against neoantigens, leading to remarkable remission in cancer patients. Nevertheless, only a minority of patients are responsive to ICI, and approaches for prediction of responsiveness are needed to improve the success of cancer treatments. While the tumor mutational burden (TMB) correlates positively with responsiveness and survival of patients undergoing ICI, the influence of the subcellular localizations of the neoantigens remains unclear. Here, we demonstrate in both a mouse melanoma model and human clinical datasets of 1,722 ICI-treated patients that a high proportion of membrane-localized neoantigens, particularly at the plasma membrane, correlate with responsiveness to ICI therapy and improved overall survival across multiple cancer types. We further show that combining membrane localization and TMB analyses can enhance the predictability of cancer patient response to ICI. Our results may have important implications for establishing future clinical guidelines to direct the choice of treatment toward ICI.

More Details

TYPE Journal Article YEAR 2023

DOI OSTI Scopus

Electrochemically Tunable Mixed Valence Conduction in Ruthenium Hexacyanoruthenate

Robinson, Donald A.; Foster, Michael E.; Bennett, Christopher H.; Bhandarkar, Austin; Webster, Elizabeth R.; Celebi, Aleyna; Celebi, Nisa; Fuller, Elliot J.; Stavila, Vitalie; Spataru, Dan C.; Ashby, D.S.; Marinella, Matthew; Krishnakumar, Raga; Allendorf, Mark D.; Talin, Albert A.

Abstract not provided.

More Details

TYPE Conference Presentation YEAR 2022

DOI OSTI

All Models are Wrong, but Some(times) are Useful: Evaluating when Machine Learning Models are Useful for Detecting Novel Malware in the Wild

Smith, Michael R.; Krishnakumar, Raga; Lubars, Joseph; Verzi, Stephen J.; Zhou, Xin; Goyal, Akul

Abstract not provided.

More Details

TYPE Conference Paper YEAR 2022

OSTI

MalGen: Malware Generation with Specific Behaviors to Improve Machine Learning-based Detectors

Smith, Michael R.; Carbajal, Armida J.; Domschot, Eva; Johnson, Nicholas T.; Goyal, Akul; Lamb, Chris; Lubars, Joseph; Kegelmeyer, William P.; Krishnakumar, Raga; Quynn, Sophie; Ramyaa, Ramyaa; Verzi, Stephen J.; Zhou, Xin

In recent years, infections and damage caused by malware have increased at exponential rates. At the same time, machine learning (ML) techniques have shown tremendous promise in many domains, often out performing human efforts by learning from large amounts of data. Results in the open literature suggest that ML is able to provide similar results for malware detection, achieving greater than 99% classifcation accuracy [49]. However, the same detection rates when applied in deployed settings have not been achieved. Malware is distinct from many other domains in which ML has shown success in that (1) it purposefully tries to hide, leading to noisy labels and (2) often its behavior is similar to benign software only differing in intent, among other complicating factors. This report details the reasons for the diffcultly of detecting novel malware by ML methods and offers solutions to improve the detection of novel malware.

More Details

TYPE SAND Report YEAR 2022

DOI OSTI

Combined Imaging and RNA-Seq on a Microfluidic Platform for Viral Infection Studies

Krishnakumar, Raga; Sjoberg, Kurt C.; Fisher, Andrew N.; Doudoukjian, Gloria E.; Webster, Elizabeth R.

The goal of this work was to pioneer a novel, low-overhead protocol for simultaneously assaying cell-surface markers and intracellular gene expression in a single mammalian cell. The purpose of developing such a method is to be able to understand the mechanisms by which pathogens engage with individual mammalian cells, depending on their cell surface proteins, and how both host and pathogen gene expression changes are reflective of these mechanisms. The knowledge gained from such analyses of single cells will ultimately lead to more robust pathogen detection and countermeasures. Our method was aimed at streamlining both the upstream cell sample preparation using microfluidic methods, as well as the actual library making protocol. Specifically, we wanted to implement a random hexamer-based reverse transcription of all RNA within a single cell (as opposed to oligo dT-based which would only capture polyadenylated transcripts), and then use a CRISPR-based method called scDash to deplete ribosomal DNAs (since ribosomal RNAs make up the majority of the RNA in a mammalian cell). After significant troubleshooting, we demonstrate that we are able to prepare cDNA from RNA using the random hexamer primer, and perform the rDNA depletion. We also show that we can visualize individually stained cells, setting up the pipeline for connecting surface markers to RNA-sequencing profiles. Finally, we test a number of devices for various parts of the pipeline, including bead generation, optical barcoding and cell dispensing, and demonstrate that while some of these have potential, more work is needed to optimize this part of the pipeline.

More Details

TYPE SAND Report YEAR 2022

DOI OSTI

Proton Tunable Analog Transistor for Low Power Computing

Robinson, Donald A.; Foster, Michael E.; Bennett, Christopher H.; Bhandarkar, Austin; Fuller, Elliot J.; Stavila, Vitalie; Spataru, Dan C.; Krishnakumar, Raga; Cole-Filipiak, Neil C.; Schrader, Paul; Ramasesha, Krupa; Allendorf, Mark D.; Talin, Albert A.

This project was broadly motivated by the need for new hardware that can process information such as images and sounds right at the point of where the information is sensed (e.g. edge computing). The project was further motivated by recent discoveries by group demonstrating that while certain organic polymer blends can be used to fabricate elements of such hardware, the need to mix ionic and electronic conducting phases imposed limits on performance, dimensional scalability and the degree of fundamental understanding of how such devices operated. As an alternative to blended polymers containing distinct ionic and electronic conducting phases, in this LDRD project we have discovered that a family of mixed valence coordination compounds called Prussian blue analogue (PBAs), with an open framework structure and ability to conduct both ionic and electronic charge, can be used for inkjet-printed flexible artificial synapses that reversibly switch conductance by more than four orders of magnitude based on electrochemically tunable oxidation state. Retention of programmed states is improved by nearly two orders of magnitude compared to the extensively studied organic polymers, thus enabling in-memory compute and avoiding energy costly off-chip access during training. We demonstrate dopamine detection using PBA synapses and biocompatibility with living neurons, evoking prospective application for brain - computer interfacing. By application of electron transfer theory to in-situ spectroscopic probing of intervalence charge transfer, we elucidate a switching mechanism whereby the degree of mixed valency between N-coordinated Ru sites controls the carrier concentration and mobility, as supported by density functional theory (DFT) .

More Details

TYPE SAND Report YEAR 2022

DOI OSTI

Data Science and Machine Learning for Genome Security

Verzi, Stephen J.; Krishnakumar, Raga; Levin, Drew; Krofcheck, Daniel J.; Williams, Kelly P.

This report describes research conducted to use data science and machine learning methods to distinguish targeted genome editing versus natural mutation and sequencer machine noise. Genome editing capabilities have been around for more than 20 years, and the efficiencies of these techniques has improved dramatically in the last 5+ years, notably with the rise of CRISPR-Cas technology. Whether or not a specific genome has been the target of an edit is concern for U.S. national security. The research detailed in this report provides first steps to address this concern. A large amount of data is necessary in our research, thus we invested considerable time collecting and processing it. We use an ensemble of decision tree and deep neural network machine learning methods as well as anomaly detection to detect genome edits given either whole exome or genome DNA reads. The edit detection results we obtained with our algorithms tested against samples held out during training of our methods are significantly better than random guessing, achieving high F1 and recall scores as well as with precision overall.

More Details

TYPE SAND Report YEAR 2022

DOI OSTI

OperonSEQer: A set of machine-learning algorithms with threshold voting for detection of operon pairs using short-read RNA-sequencing data

PLoS Computational Biology

Krishnakumar, Raga; Ruffing, Anne R.

Operon prediction in prokaryotes is critical not only for understanding the regulation of endogenous gene expression, but also for exogenous targeting of genes using newly developed tools such as CRISPR-based gene modulation. A number of methods have used transcriptomics data to predict operons, based on the premise that contiguous genes in an operon will be expressed at similar levels. While promising results have been observed using these methods, most of them do not address uncertainty caused by technical variability between experiments, which is especially relevant when the amount of data available is small. In addition, many existing methods do not provide the flexibility to determine the stringency with which genes should be evaluated for being in an operon pair. We present OperonSEQer, a set of machine learning algorithms that uses the statistic and p-value from a non-parametric analysis of variance test (Kruskal-Wallis) to determine the likelihood that two adjacent genes are expressed from the same RNA molecule. We implement a voting system to allow users to choose the stringency of operon calls depending on whether your priority is high recall or high specificity. In addition, we provide the code so that users can retrain the algorithm and re-establish hyperparameters based on any data they choose, allowing for this method to be expanded as additional data is generated. We show that our approach detects operon pairs that are missed by current methods by comparing our predictions to publicly available long-read sequencing data. OperonSEQer therefore improves on existing methods in terms of accuracy, flexibility, and adaptability.

More Details

TYPE Journal Article YEAR 2022

DOI OSTI Scopus

Malware Generation with Specific Behaviors to Improve Machine Learning-based Detectors

Verzi, Stephen J.; Johnson, Nicholas; Khanna, Kanad; Zhou, Xin; Quynn, Sophie; Krishnakumar, Raga; Smith, Michael R.

Abstract not provided.

More Details

TYPE Conference Presentation YEAR 2021

DOI OSTI DOI OSTI

CERES: CRISPR Engineering for the Rapid Enhancement of Strains

Ruffing, Anne R.; Podlevsky, Joshua; Krishnakumar, Raga; Smallwood, Chuck R.; Dallo, Tessa; Torres, Xavier; Kolker, Stephanie; Morgan, John; King, Nathaphon Y.H.; Marsing, Melissa

Previous strain development efforts for cyanobacteria have failed to achieve the necessary productivities needed to support economic biofuel production. We proposed to develop CRISPR Engineering for Rapid Enhancement of Strains (CERES). We developed genetic and computational tools to enable future high-throughput screening of CRISPR interference (CRISPRi) libraries in the cyanobacterium Synechococcus sp. PCC 7002, including: (1) Operon- SEQer: an ensemble of algorithms for predicting operon pairs using RNA-seq data, (2) experimental characterization and machine learning prediction of gRNA design rules for CRISPRi, and (3) a shuttle vector for gene expression. These tools lay the foundation for CRISPR library screening to develop cyanobacterial strains that are optimized for growth or metabolite production under a wide range of environmental conditions. The optimization of cyanobacterial strains will directly advance U.S. energy and climate security by enabling domestic biofuel production while simultaneously mitigating atmospheric greenhouse gases through photoautotrophic fixation of carbon dioxide.

More Details

TYPE SAND Report YEAR 2021

DOI OSTI

Data Science for Characterization of Genome Noise/Mutation

Verzi, Stephen J.; Krishnakumar, Raga; Levin, Drew; Krofcheck, Daniel J.; Boskin, Callie; Williams, Kelly P.

Abstract not provided.

More Details

TYPE Presentation YEAR 2021

OSTI

Data Science for Detection of Genome Editing

Verzi, Stephen J.; Krishnakumar, Raga; Levin, Drew; Krofcheck, Daniel J.; Boskin, Callie; Williams, Kelly P.

Abstract not provided.

More Details

TYPE Presentation YEAR 2021

OSTI

Augmentation of Antibacterial Activity in Mesenchymal Stromal Cells Through Systems-Level Analysis and CRISPR-mediated Activation of CD14

Hirakawa, Matthew; Tjahjono, Nikki; Light, Yooli K.; Chintalapudi, Prem; Branda, Steven S.; Butler, Kimberly; Krishnakumar, Raga

Abstract not provided.

More Details

TYPE Conference Poster YEAR 2021

DOI OSTI

Malware Generation with Specific Behaviors to Improve Machine Learning-based Detection

Proceedings - 2021 IEEE International Conference on Big Data, Big Data 2021

Bays, Nathan R.; Verzi, Stephen J.; Johnson, Nicholas T.; Khanna, Kanad; Zhou, Xin; Quynn, Sophie; Krishnakumar, Raga

We describe efforts in generating synthetic malware samples that have specified behaviors that can then be used to train a machine learning (ML) algorithm to detect behaviors in malware. The idea behind detecting behaviors is that a set of core behaviors exists that are often shared in many malware variants and that being able to detect behaviors will improve the detection of novel malware. However, empirically the multi-label task of detecting behaviors is significantly more difficult than malware classification, only achieving on average 84% accuracy across all behaviors as opposed to the greater than 95% multi-class or binary accuracy reported in many malware detection studies. One of the difficulties in identifying behaviors is that while there are ample malware samples, most data sources do not include behavioral labels, which means that generally there is insufficient training data for behavior identification. Inspired by the success of generative models in improving image processing techniques, we examine and extend a 1) conditional variational auto-encoder and 2) a flow-based generative model for malware generation with behavior labels. Initial experiments indicate that synthetic data is able to capture behavioral information and increase the recall of behaviors in novel malware from 32% to 45% without increasing false positives and to 52% with increased false positives.

More Details

TYPE Conference Paper YEAR 2021

OSTI Scopus

COVID-19 LDRD Project Summaries

Treece, Amy; Corbin, William; Caskey, Susan A.; Krishnakumar, Raga; Williams, Kelly P.; Branch, Darren W.; Harmon, Brooke N.; Polsky, Ronen; Bauer, Travis L.; Finley, Patrick D.; Jeffers, Robert; Safta, Cosmin; Makvandi, Monear; Laird, Carl; Domino, Stefan P.; Ho, Clifford K.; Grillet, Anne M.; Pacheco, Jose L.; Nemer, Martin; Rossman, Grant A.; Koplow, Jeffrey; Celina, Mathew C.; Jones, Brad H.; Burton, Patrick D.; Haggerty, Ryan P.; Jacobs-Gedrim, Robin B.; Thelen, Paul M.

Sandia National Laboratories currently has 27 COVID-related Laboratory Directed Research & Development (LDRD) projects focused on helping the nation during the pandemic. These LDRD projects cross many disciplines including bioscience, computing & information sciences, engineering science, materials science, nanodevices & microsystems, and radiation effects & high energy density science.

More Details

TYPE Other Report YEAR 2020

DOI OSTI

Gene editing and CRISPR in the clinic: Current and future perspectives

Bioscience Reports

Hirakawa, Matthew; Krishnakumar, Raga; Timlin, Jerilyn A.; Carney, James; Butler, Kimberly

Genome editing technologies, particularly those based on zinc-finger nucleases (ZFNs), transcription activator-like effector nucleases (TALENs), and CRISPR (clustered regularly interspaced short palindromic repeat DNA sequences)/Cas9 are rapidly progressing into clinical trials. Most clinical use of CRISPR to date has focused on ex vivo gene editing of cells followed by their re-introduction back into the patient. The ex vivo editing approach is highly effective for many disease states, including cancers and sickle cell disease, but ideally genome editing would also be applied to diseases which require cell modification in vivo. However, in vivo use of CRISPR technologies can be confounded by problems such as off-target editing, inefficient or off-target delivery, and stimulation of counterproductive immune responses. Current research addressing these issues may provide new opportunities for use of CRISPR in the clinical space. In this review, we examine the current status and scientific basis of clinical trials featuring ZFNs, TALENs, and CRISPR-based genome editing, the known limitations of CRISPR use in humans, and the rapidly developing CRISPR engineering space that should lay the groundwork for further translation to clinical application.

More Details

TYPE Journal Article YEAR 2020

DOI OSTI Scopus

Engineering mesenchymal stromal cells for anti-microbial therapy

Hirakawa, Matthew; Tjahjono, Nikki; Light, Yooli K.; Chintalapudi, Prem; Butler, Kimberly; Branda, Steven S.; Krishnakumar, Raga

Abstract not provided.

More Details

TYPE Conference Poster YEAR 2020

OSTI

Real-Time Selective Sequencing with RUBRIC: Read Until with Basecall and Reference-Informed Criteria

Scientific Reports

Bartsch, Michael S.; Krishnakumar, Raga; Sinha, Anupama; Patel, Kamlesh; Bird, Sara W.; Edwards, Harrison S.

The Oxford MinION, the first commercial nanopore sequencer, is also the first to implement molecule-by-molecule real-time selective sequencing or “Read Until”. As DNA transits a MinION nanopore, real-time pore current data can be accessed and analyzed to provide active feedback to that pore. Fragments of interest are sequenced by default, while DNA deemed non-informative is rejected by reversing the pore bias to eject the strand, providing a novel means of background depletion and/or target enrichment. In contrast to the previously published pattern-matching Read Until approach, our RUBRIC method is the first example of real-time selective sequencing where on-line basecalling enables alignment against conventional nucleic acid references to provide the basis for sequence/reject decisions. We evaluate RUBRIC performance across a range of optimizable parameters, apply it to mixed human/bacteria and CRISPR/Cas9-cut samples, and present a generalized model for estimating real-time selection performance as a function of sample composition and computing configuration.

More Details

TYPE Journal Article YEAR 2019

DOI OSTI Scopus