Featured project · 2026·UW HCDE Capstone·Jan–Jun 2026

Trustworthy AI for Research Budgets

Integrating AI inside the university's research administration system. Deciding what the AI is for, how AI automates, and where humans make the final call.

Timeline

Jan–Jun 2026 · UW HCDE Capstone

My Role

Product Manager & UX Research Lead

Team

1 PM, 1 UXR, 2 Designers

Sponsor

UW Office of Research Information Services (ORIS), which owns SAGE

TL;DR

Problem

A grant manager carries ~50 awards (up to 75+ at peak) and builds each budget twice: once in Excel, once by hand in SAGE, across 8+ disconnected systems, ~2–3 hrs of copying per award. This is not a tooling problem but a coordination gap.

Strategy

Embedded the worksheet inside SAGE and put AI only on the in-between work, gated by AI vs. Human boundary.

Impact

2+ hrs saved per award
25–33% workload cut (projected 75–150 hrs/GM/yr)
Proved trustworthy AI in a high-stakes workflow

Problem solved

01. The work between the systems had become the work.

A UW grant manager carries ~50 active awards, up to 75+ at peak, and rebuilds each budget twice: ~2–3 hours of hand-copying across 8+ disconnected systems before the work only they can do (rates, exceptions, compliance, sign-off) even begins.

~50 active awards per GM on average, up to 75+ at peak
8+ disconnected surfaces per award: the NOA PDF, Excel, 5–6 rate sites, Workday, SAGE
~2–3 hours of hand-copying per award, before any judgment work begins

The scattered sources a grant manager juggles per award: file folders, an Excel budget, the Notice of Award PDF, and UW rate websites — Eight-plus disconnected surfaces per award: folders, Excel, the Notice of Award, and UW rate sites, each re-keyed by hand.

How I validated

02. One pattern recurred in all six interviews.

Ran 6 contextual inquiries with grant managers across 6 departments (67+ combined years, 200+ active awards), watching real work (which systems they open, how many tabs stay up, where they copy and paste) over asking them to describe it.

“GMs build every budget twice.”

Built once in Excel for the proposal, again in the portal after the award.

6/6 used Excel as a playground system · 5/6 did manual rate lookups across scattered sources.

The decision that mattered

03. Reframed what the internal admin platform is for.

University of Washington treated SAGE, the admin system, as a system of record, somewhere to store and verify finished budgets. We reframed it as a system of work: the surface where the budget actually gets built. That shift is what makes “bring the worksheet inside SAGE” a structural fix for double-entry, not just a new feature.

✗ rejected · standalone tool

Instead of introducing a standalone tool, I integrated into the existing user experience, to minimize friction while improving efficiency.

✓ selected · worksheet inside SAGE

Embed the worksheet in SAGE; put AI only on the in-between work.
Respected GMs' existing Excel-based workflow, using their mental models and minimizing re-training.

✓ selected · AI-powered budgeting with manual adjustment

AI fills the routine lines and does the math; the GM reviews and adjusts before anything is saved. Rates and dollar/percent math run on fixed rules, not AI guesses, and every AI-filled value links to its source so it can be checked.

What the AI is for, and where it stops

Use AI herethe routine in-between work

Reads the document and websites
Auto-fills budget data from the worksheet into the admin system
Flags mismatches to review, each linked back to its source

Keep humans herewhere the decision must stay human

Reviewing and adjusting every AI-filled value
The final call for budget submission

How I built · trust by design

04. Made the AI's work easy to check.

Trust fails two ways: distrust the AI and re-key everything by hand, or accept a wrong number without accountability. AI is here not to replace their expertise, but to assist users to make the smart decisions efficiently and effectively.

AI assists, the human decides.
The GM keeps every final call: review, sign-off, and submit. AI never commits anything on its own.
Turned trust guidelines into real UI.
Applied Microsoft HAX, Google PAIR, and the University of Idaho's AI4RA framework as concrete parts: confidence chips, a source label on every value, and a preview before anything saves.
Kept drafting and checking apart.
You build in the worksheet; you resolve mismatches in reconciliation. Every AI fix is shown and confirmed.

Description

Role / basis

Amount

Harry Potter

Main PI · 10% effort

$16,826

F10

Fringe benefits

Faculty blended rate

22.7%

C11

ARVO Annual Meeting

1 PI · 4 nights · Seattle

$3,281

▲ hover, tap, or tab to a green-underlined value

Resolve-mismatch view, a typed budget mismatch surfaced with a one-click, preview-then-confirm fix — The accountability moment: a typed mismatch with a preview-then-confirm fix

Before & after

05. One workspace. Zero round-trips.

Scattered tools get absorbed into SAGE: Excel, the NOA PDF, and the rate websites now live inside one surface. Plays on its own when it scrolls into view; hit ↻ Replay to watch again.

📊

Excel

build the budget

→copy

📄

NOA PDF

award letter

→re-key

🌐

Rate sites

5–6 sources

→re-enter

🌿

SAGE

the destination

🌿 SAGE · one workspace inside SAGE

✓ 📊 Excel✓ 📄 NOA PDF✓ 🌐 Rate sites

Auto-filled & source-labeled · built once, inside SAGE

4+ round-trips · several hours by hand

The same work, before and after

Before and after: the current multi-system budgeting workflow — build in Excel, submit, then resolve mismatches with repeated round-trips — versus the same work with the worksheet embedded in SAGE, where AI auto-fills and pre-populates and the grant manager verifies in one screen

Impact delivered

06. What it moved.

Validated in moderated usability sessions with grant managers.

To user

75–150hrs / GM / yr

Reclaimed per grant manager

projected · 2+ hrs/award × award volume

To user

25–33%

Workload cut

estimated across ~50 concurrent awards (P1)

To business

4+ → 1

Round-trips per budget

time-to-submission: several hrs → ≤30 min

To business

3,000+hrs / yr

Returned org-wide

across UW's 1,000+ active grants, at ≤30 min/project

Video

The concept film frames the scale: UW runs 1,000+ active research grants, with grant managers tracking 75+ projects each at 2 to 3 hours per budget. Trustworthy AI for Research Budgets cuts that to under 30 minutes per project, returning 3,000+ hours a year across UW's research portfolio.

Key takeaway

The hardest part of integrating AI isn't the model performance. It's narrowing what the AI is for.

Responsible AI isn't choosing the best model. It's the narrowing of what the AI is for, and calibrating trust so people catch the AI's mistakes without second-guessing what it gets right. We need to decide where AI meets human decision.

Keep exploring

You just viewed: Trustworthy AI for Research Budgets

Next project

Needle Space →

Pre-tagged cafe search for remote workers: meaning over keywords.

Observatory

See how I grow everyday →

Monthly digests, side projects, and what didn't make the case studies.