Publications & Software - Ashley Mae Conard, Ph.D.

*Denotes co-first authorship. †Denotes co-last authors.

Evidence Aggregator, crocodile detective with linked patients table

2026

Evidence Aggregator: AI reasoning applied to rare disease diagnostics

H Twede, L Pais, S Bryen, E O'Heir, G Smith, R Paulsen, CA Austin-Tse, A Bloemendal, C Simons, S Saponas, M Wander, DG MacArthur, H Rehm, AM Conard

Genetics in Medicine, 2026

We developed the Evidence Aggregator (EvAgg), a generative AI tool designed for rare disease diagnosis that systematically extracts relevant information from the scientific literature for any human gene. EvAgg reduced analyst review time by 34% (p<0.002) and increased papers, variants, and cases evaluated per unit time, with 97% recall identifying relevant papers.

Paper Preprint Code Blog

Tackling cancer with generative models, multimodal fusion figure

2026

Tackling the complexity of cancer with generative models

AM Conard, M Hughes, J Hall, N Tenenholtz, E Zimmermann, L Crawford, AP Amini, K Severson

Cell, April 2026

A perspective and framework for leveraging generative AI models to address the complexity and heterogeneity of cancer, enabling more precise therapeutic strategies.

Paper

Predicting evolutionary rate, sequence and evolutionary rate figure

2026

Predicting evolutionary rate as a pretraining task improves genome language model representations

ME Consens, KK Yang, J Hall, AM Conard, B Wang, L Crawford, A Moses, AX Lu

ICML 2026 / bioRxiv

We introduce pretraining tasks that predict evolutionary rates for genome language models, showing that models trained on both sequence and evolutionary rate outperform those trained on sequence alone across biologically grounded benchmarks.

preprint DOI

Biologically annotated neural network, Protein-Pathway-Phenotype architecture

2026

A biologically annotated neural network for proteomic discovery in Parkinson's disease

A Vijayaraghavan, L Crawford, S Krishnakant, AP Amini, AM Conard, A Olsen, L Chahine, K Severson

April 2026

A biologically annotated neural network framework leveraging proteomics data to identify novel biomarkers and mechanisms in Parkinson's disease.

Paper

2025

scGeneScope: A Treatment-Matched Single Cell Imaging and Transcriptomics Dataset and Benchmark for Treatment Response Modeling

J Dapello, M Nassar, R Eksi, B Wang, J Gagnon-Marchand, KT Gao, A Baharlouei, K Thrush-Evensen, N Riehs, AF Peterson, A Tolpadi, A Rajagopal, HE Miller, AM Conard, D Alvarez-Melis, R Stark, S Bianco, M Levine, AP Amini, AX Lu, N Fusi, R Pandya, V Pedoia, H El-Samad

NeurIPs Datasets and Benchmarks Track

scGeneScope as the first large-scale, high-quality treatment-matched multiprofile dataset for single-cell biology with over 627k scRNA-seq profiles and 716k Cell Painting images from identical chemical treatments across 28 diverse mechanisms of action. In our paper, we present this dataset and challenge the hype around foundation models by providing a realistic testbed to enable rigorous benchmarking of ML models (linear to foundation models) for drug discovery.

paper code data

2025

AI-Enhanced Sensemaking: Exploring the Design of a Generative AI-Based Assistant to Support Genetic Professionals

A Mastrianni, H Twede, A Sarcevic, J Wander, C Austin-Tse, S Saponas, H Rehm, AM Conard†, AK Hall†

ACM Transactions on Interactive Intelligent Systems

We co-designed a generative AI assistant with genetics professionals to support genome sequencing analysis for rare disease diagnosis. By identifying key challenges in sensemaking and reanalysis, we developed and prototyped AI features that help synthesize variant evidence and flag cases for reanalysis, ultimately aiming to increase diagnostic yield and reduce time to diagnosis.

Paper

2025

Evidence Aggregator: AI reasoning applied to rare disease diagnostics

H Twede, L Pais, S Bryen, E O'Heir, G Smith, R Paulsen, C A. Austin-Tse, A Bloemendal, C Simons, S Saponas, M Wander, D G. MacArthur, H Rehm†, AM Conard†

bioRxiv

We developed a large language model (LLM)-powered framework, EvAgg, to aggregate and synthesize rare disease literature and related content, enabling clinical genomic analysts to review patient cases more rapidly and thoroughly in research settings. EvAgg reduced case review time by 34% (p < 0.002) and significantly increased the throughput of papers, variants, and cases analyzed.

pdf code

2025

Addressing biomedical data challenges and opportunities to inform a large-scale data lifecycle for enhanced data sharing, interoperability, analysis, and collaboration across stakeholders

V Sriram, AM Conard, I Rosenberg, D Kim, TS Saponas, AK Hall

Scientific Reports 15 (1), 6291

We conducted a qualitative study to identify common challenges and data tasks across the biomedical discovery lifecycle by interviewing professionals from diverse roles in the field. Based on these insights, we proposed seven actionable recommendations to improve data quality, interoperability, and collaboration for precision medicine research.

Paper

2024

Dual DNA/RNA-binding factor regulates dynamics of hnRNP splicing condensates

Ray M, Zaborowsky J, Mahableshwarkar P, Vaidyanathan S, Shum J, Viswanathan R, Huang A, Wang S-H, Johnson V, Wake N, AM Conard, Conicella AE, Puterbaugh R, Fawzi NL, Larschan E

bioRxiv

Here we show that the transcription factor CLAMP doesn't just bind DNA, it also directly binds RNA and spliceosomal proteins through its prion-like domain, linking transcription to sex-specific alternative splicing. By regulating the dynamics of hnRNP splicing condensates, CLAMP ensures precise, sex-dependent splicing outcomes, revealing a new mechanism where transcription factors act as master organizers of splicing decisions.

Paper

2024

Multioviz: an interactive platform for in silico perturbation and interrogation of gene regulatory networks

H Xie, L Crawford†, AM Conard†

BMC bioinformatics 25 (1), 249

This is a user-friendly platform for visualizing and perturbing gene regulatory networks using multi-omics data. It enables researchers to test biological hypotheses in silico and identify molecular candidates for follow-up experiments, without requiring coding expertise.

Paper Code

2023

Sex-specific splicing occurs genome-wide during early Drosophila embryogenesis

M Ray, AM Conard, J Urban, P Mahableshwarkar, J Aguilera, A Huang, ...

Elife 12, e87865

Paper Code

2023

A spectrum of explainable and interpretable machine learning approaches for genomic studies

AM Conard*, A DenAdel*, L Crawford

WIREs Computational Statistics

We discuss the spectrum of machine learning model transparency, from black box to explainable to interpretable, highlighting methods tailored for genomic studies. Our focus was on how incorporating biological knowledge into model design can improve both predictive performance and scientific insight for precision medicine.

Paper

2022

It's About Time: Interpretable Methods and Associated Interactive Platforms to Uncover Regulatory Mechanisms from Temporal and Multi-Omics Data

AM Conard, C Lawrence, L Crawford, E Larschan

Brown University

We developed three interactive computational tools to uncover gene regulatory networks from temporal multi-omics data, focusing on transcription factor dynamics and sex-specific regulation. These platforms empower researchers to generate hypotheses, validate findings, and accelerate discovery, bringing us closer to personalized therapeutics.

Paper

2022

Sex-specific aging in animals: Perspective and future directions

AM Bronikowski, RP Meisel, PR Biga, JR Walters, JE Mank, E Larschan, GS Wilkinson, N Valenzuela, AM Conard, JP de Magalhães, JE Duan, AE Elias, T Gamble, RM Graze, KE Gribble, JA Kreiling, NC Riddle

Aging Cell

Paper

2021

TIMEOR: a web-based tool to uncover temporal regulatory mechanisms from multi-omics data

AM Conard, N Goodman, Y Hu, N Perrimon, R Singh, C Lawrence, ...

Nucleic Acids Research

Paper Web App Code

2021

Neuromolecular and behavioral effects of ethanol deprivation in Drosophila

NM D'Silva, KS McCullar, AM Conard, T Blackwater, R Azanchi, ...

bioRxiv, 2021.01.02.425101

Paper

2020

The transcription factor CLAMP is required for neurogenesis in Drosophila melanogaster

MA Tsiarli, JA Kentro, AM Conard, L Xu, E Nguyen, K O'Connor-Giles, ...

bioRxiv, 2020.10.09.333831

Paper

2019

Identification of Subclonal Drivers and Copy-Number Variants from Bulk and Single-Cell DNA Sequencing of Tumors

AM Conard, B Raphael

Brown University, Princeton University

Paper

2016

Highlights from the ISCB Student Council Symposia in 2016

A Jacobsen, B Siranosian, K Schwahn, AM Conard, N Aben, M Hassan, ...

F1000Research 5 (2852)

Paper

2015

Using a Big Data Database to Identify Pathogens in Protein Data Space

AM Conard, S Dodson, J Kepner, D Ricke

arXiv preprint arXiv:1501.05546

Paper

2014

Determining the winning SH3 coalition: how cooperative game theory reveals the importance of domain residues in peptide binding

AM Conard, E Cilia, T Lenaerts

Proceedings of the Benelux Bioinformatics Conference

Paper

scGeneScope

scGeneScope code enables benchmarking for treatment response modeling of our generated perturbationally-paired single cell RNA-seq and Cell Painting image dataset.

Python, Bash

link

The Evidence Aggregator

The Evidence Aggregator is a large language model (LLM)-powered framework that aggregates and synthesizes rare disease literature and related content.

Bash, Python

link

time2splice

time2splice is a method to find temporal and sex-specific alternative splicing from multi-omics data.

Bash, Python, R

link

TIMEOR

TIMEOR is a web server and Dockerized command line tool to identify gene regulatory networks and assign mechanism from temporal and multi-omics data.

Bash, Python, R, RShiny

link

PRIPS

A fast protein analysis algorithm using D4M, merging triplestore/NoSQL databases with associative array representations of proteomic sequences for fast big data analysis.

Matlab

Property of MIT Lincoln Laboratory

Chemical Inventory Database

Web-based inventory management system used in academic departments. Users log in and scan barcodes for automatic item entry.

HTML, CSS, Parse Platform

Property of DePauw

Arduino-CSSI

Set of Arduino workshop modules and Fritzing diagrams to teach students how to program as part of the Google Computer Science Summer Institute.

Property of Google

Instrument Control

Online internal system to monitor product batch data extracted from Eli Lilly's Data Mart and Data Warehouse databases.

SQL, Discoverant, Business Objects

Property of Eli Lilly and Elanco