Information scientist · AI agent advocate
Stephanie Jarmak — information scientist and AI agent advocate
I work on multi-agent orchestration and code intelligence: getting teams of agents to reliably understand and change large codebases, and evaluating whether they actually help. Currently at Sourcegraph, and a research affiliate with NASA SciX.
Currently
- AI agent advocate / applied research scientist Sourcegraph
- Research affiliate NASA Science Explorer (SciX)
Selected work
All projects →-
SciX Agent
An agentic research assistant over the NASA SciX / ADS corpus, bridging AI agents with scholarly search infrastructure.
PythonAgentsMCPRetrieval
-
Gas City
An orchestration-builder SDK for multi-agent coding workflows. I'm a maintainer.
GoAgentsOrchestration
-
CodeScaleBench
A benchmark suite for evaluating how AI coding agents use external context-retrieval tools on realistic developer tasks in large, enterprise-scale codebases.
C++EvaluationRetrieval
-
EnterpriseBench
A benchmark for evaluating how well coding agents understand and navigate code across large, distributed enterprise codebases.
PythonEvaluationAgents
-
CodeProbe
Benchmarks AI coding agents against your own codebase by mining evaluation tasks from its git history, so the suite can't be contaminated by training data.
PythonEvaluationAgents
-
mem
Build and benchmark agentic memory using a multi-agent orchestrator's own work traces as the evaluation corpus, where every unit of work has a verifiable outcome.
TypeScriptEvaluationAgent memory
-
AccountBot
A Slack-native sales assistant that runs a Claude tool-use loop over a curated target-account corpus, answering go-to-market questions with per-account research, segment cohorts, and campaign assets pulled live from BigQuery, a federated data proxy, and signed object stores.
PythonAgentsSlackLLM
Speaking
All talks →