Skip to content

Research & Publications

Selected publications at ICML, ICLR, NeurIPS, AAAI, KDD, and other venues. Full list on Google Scholar and Semantic Scholar.

Agents

SpIDER: Spatially Informed Dense Embedding Retrieval for Software Issue Localization
ArXiv Preprint, 2025
Dense embedding retrieval for localizing code changes from natural language issue descriptions in large software repositories.
PDF
TimeSeriesGym: A Scalable Benchmark for (Time Series) Machine Learning Engineering Agents
ArXiv Preprint, 2025
A benchmarking environment for evaluating ML engineering agents on the full time-series modeling stack, from data exploration and modeling to testing and deployment.
PDF Code

Foundation Models

Chronos-2: From Univariate to Universal Forecasting
ArXiv Preprint, 2025
Extends the Chronos forecasting model to handle multivariate, covariate-conditioned, and probabilistic forecasting in a single unified architecture.
PDF
Exploring Representations and Interventions in Time Series Foundation Models
International Conference on Machine Learning (ICML), 2025
First to show that time series foundation models learn interpretable concepts (trends, seasonality) despite self-supervised training, and that these representations can be targeted for intervention.
PDF
MOMENT: A Family of Open Time-series Foundation Models
International Conference on Machine Learning (ICML), 2024
One of the first open-source time series foundation models. 2.5M+ downloads on HuggingFace, 700+ GitHub stars.
PDF Code HuggingFace Data

Evaluation Science

TimeSeriesExamAgent: Creating Time Series Reasoning Benchmarks at Scale
International Conference on Learning Representations (ICLR), 2026
Benchmark for temporal reasoning in LLMs, with scalable task generation via LLM agents and item response theory.
PDF
AQuA: A Benchmarking Tool for Label Quality Assessment
Neural Information Processing Systems (NeurIPS), 2023 Datasets and Benchmarks Track
A comprehensive benchmarking tool for evaluating label error detection methods across diverse datasets and annotation types.
PDF Code
Unsupervised Model Selection for Time-series Anomaly Detection Spotlight
International Conference on Learning Representations (ICLR), 2023
Shows that ensembles of unsupervised heuristics and weak supervision can accurately select anomaly detection models without any ground-truth labels.
PDF Code

Healthcare & Education

JoLT: Jointly Learned Representations of Language and Time-Series for Clinical Time-Series Interpretation Best Student Abstract
AAAI Conference on Artificial Intelligence (AAAI), 2024 Student Abstract. Also at NeurIPS 2023 DGM4H Workshop.
Learns joint language-time series embeddings so clinicians can query physiological signals with natural language descriptions.
Paper OpenReview
Using Weakly Supervised Machine Learning to Label Atrial Fibrillation in Real-World Intensive Care Unit Telemetry Data
Circulation, 2022
Applies weak supervision to detect atrial fibrillation episodes in noisy ICU telemetry without expert-labeled training data.
Paper
Classifying Unstructured Clinical Notes via Automatic Weak Supervision
Machine Learning for Healthcare Conference (MLHC), PMLR, 2022
Automatically generates labeling functions from clinical notes to train classifiers without manual annotation.
PDF
Counterfactual Phenotyping with Censored Time-to-Events
ACM SIGKDD Conference on Knowledge Discovery and Data Mining (KDD), 2022
Combines causal inference with survival analysis to discover patient phenotypes under counterfactual treatment scenarios.
PDF
Weak Supervision for Affordable Modeling of Electrocardiogram Data
AMIA Annual Symposium Proceedings, 2021
Applies weak supervision and data programming to build ECG classification models without manual expert labels.
Paper
Discriminating Cognitive Disequilibrium and Flow in Problem Solving: A Semi-Supervised Approach Using Involuntary Dynamic Behavioral Signals
AAAI Conference on Artificial Intelligence (AAAI), 2020
Uses semi-supervised learning on involuntary behavioral signals (eye gaze, facial expressions) to detect cognitive states during problem solving.
Paper