Research & Publications
Selected publications at ICML, ICLR, NeurIPS, AAAI, KDD, and other venues. Full list on Google Scholar and Semantic Scholar.
Agents
SpIDER: Spatially Informed Dense Embedding Retrieval for Software Issue Localization
ArXiv Preprint, 2025
Dense embedding retrieval for localizing code changes from natural language issue descriptions in large software repositories.
Foundation Models
Chronos-2: From Univariate to Universal Forecasting
ArXiv Preprint, 2025
Extends the Chronos forecasting model to handle multivariate, covariate-conditioned, and probabilistic forecasting in a single unified architecture.
Exploring Representations and Interventions in Time Series Foundation Models
International Conference on Machine Learning (ICML), 2025
First to show that time series foundation models learn interpretable concepts (trends, seasonality) despite self-supervised training, and that these representations can be targeted for intervention.
MOMENT: A Family of Open Time-series Foundation Models
International Conference on Machine Learning (ICML), 2024
One of the first open-source time series foundation models. 2.5M+ downloads on HuggingFace, 700+ GitHub stars.
Evaluation Science
TimeSeriesExamAgent: Creating Time Series Reasoning Benchmarks at Scale
International Conference on Learning Representations (ICLR), 2026
Benchmark for temporal reasoning in LLMs, with scalable task generation via LLM agents and item response theory.
AQuA: A Benchmarking Tool for Label Quality Assessment
Neural Information Processing Systems (NeurIPS), 2023 Datasets and Benchmarks Track
A comprehensive benchmarking tool for evaluating label error detection methods across diverse datasets and annotation types.
Healthcare & Education
JoLT: Jointly Learned Representations of Language and Time-Series for Clinical Time-Series Interpretation
Best Student Abstract
AAAI Conference on Artificial Intelligence (AAAI), 2024 Student Abstract. Also at NeurIPS 2023 DGM4H Workshop.
Learns joint language-time series embeddings so clinicians can query physiological signals with natural language descriptions.
Using Weakly Supervised Machine Learning to Label Atrial Fibrillation in Real-World Intensive Care Unit Telemetry Data
Circulation, 2022
Applies weak supervision to detect atrial fibrillation episodes in noisy ICU telemetry without expert-labeled training data.
Classifying Unstructured Clinical Notes via Automatic Weak Supervision
Machine Learning for Healthcare Conference (MLHC), PMLR, 2022
Automatically generates labeling functions from clinical notes to train classifiers without manual annotation.
Counterfactual Phenotyping with Censored Time-to-Events
ACM SIGKDD Conference on Knowledge Discovery and Data Mining (KDD), 2022
Combines causal inference with survival analysis to discover patient phenotypes under counterfactual treatment scenarios.
Weak Supervision for Affordable Modeling of Electrocardiogram Data
AMIA Annual Symposium Proceedings, 2021
Applies weak supervision and data programming to build ECG classification models without manual expert labels.
Discriminating Cognitive Disequilibrium and Flow in Problem Solving: A Semi-Supervised Approach Using Involuntary Dynamic Behavioral Signals
AAAI Conference on Artificial Intelligence (AAAI), 2020
Uses semi-supervised learning on involuntary behavioral signals (eye gaze, facial expressions) to detect cognitive states during problem solving.