new predict method API, and reduced sampling in input space by odunbar · Pull Request #404 · CliMA/CalibrateEmulateSample.jl

odunbar · 2026-03-06T20:08:29Z

Purpose

Closes "Predicted covariance" is not always what the user thinks it is. Perhaps a better API? #400
Closes Add a machine learning tool for user-defined functions #380

Content

For the predict() for both Emulator and ForwardMapWrapper
- Added deprecation notice to transform_to_real=false kwarg
- Added add_obs_noise_cov=false kwarg to predict. when set true, this adds the regularization to the machine learning tool
- Added encode=nothing to the predict method, when set to "in", "out", "in_and_out" it will take in / push out encoded/decoded outputs.
- When encoding inputs, with ForwardMapWrapper, samples are decoded into the output space before application

API is now improved for user experience

# (User) typical to predict at new points:
y_pred, y_pred_cov = predict(em_or_fmw, new_inputs) 

# (Internal) call within encoded MCMC, with full uncertainty
g_pred, g_pred_cov = predict(em_or_fmw, new_inputs, encode="in_and_out", add_obs_noise_cov=true)

When retrieving MCMC posterior, if a sufficiently lossy encoder is used noise is injected into the null space from the prior (and correlation-aware)
Added utility get_encoder_from_schedule to return E,b from an encoder_schedule defining an affine encoding Ex + b
Added new docstrings and improved existing docstrings
Full unit testing.

Misc

encoding in input space does lead to lost performance in the (already-slow) ReverseDiff-mode differentiable MCMC samplers. But otherwise no noticeable differences
example testing. Works on Sinusoid example, [now testing on Darcy...]
Bugfix for arguments of _predict , , -> ;

Detail on current encoding setup

Current settings: full (F) vs reduced (R) space

Emulator space (R)
MCMC: initial conditions (R)
MCMC: computing likelihood (R)
MCMC: stored prior distribution (F)
MCMC: computing logpdf(prior,x) (F) (decoding current sample of subspace)
MCMC: output posterior samples (F) (decoding each posterior sample + add correlated noise in null space from the prior)

Darcy example

We compare, lossless encoding. Lossy encoding without inflation, Lossy encoding with inflation, and the truth. We look for the lossy encoding to by increased by using the prior variance. The D utility indicates the amount of contraction from prior to posterior (by comparison of determinants)

Lossless - (20D->50D), D utility O(10^11)

Lossy - inflate - (18D->13D), D utility O(10^11)

Lossy - no inflate - (18D->13D), D utility O(10^43)

true

I have read and checked the items on the review checklist.

Co-authored-by: ArneBouillon <45404227+ArneBouillon@users.noreply.github.com>

deprecate transform_to_real replace transform_to_real kwarg, and add add_obs_noise_cov

…terior currently)

…gaussian lifts correctly

…teps during tests

codecov · 2026-03-25T22:01:16Z

Codecov Report

❌ Patch coverage is 77.55102% with 33 lines in your changes missing coverage. Please review.
✅ Project coverage is 92.92%. Comparing base (c011a05) to head (faf88f5).
⚠️ Report is 5 commits behind head on main.

Files with missing lines	Patch %	Lines
src/Utilities.jl	36.36%	21 Missing ⚠️
src/MarkovChainMonteCarlo.jl	89.13%	5 Missing ⚠️
src/MachineLearningTools/GaussianProcess.jl	76.92%	3 Missing ⚠️
src/Utilities/elementwise_scaler.jl	0.00%	2 Missing ⚠️
src/Emulator.jl	97.50%	1 Missing ⚠️
src/MachineLearningTools/VectorRandomFeature.jl	85.71%	1 Missing ⚠️

Additional details and impacted files

@@            Coverage Diff             @@
##             main     #404      +/-   ##
==========================================
- Coverage   94.20%   92.92%   -1.29%     
==========================================
  Files          10       10              
  Lines        1813     1950     +137     
==========================================
+ Hits         1708     1812     +104     
- Misses        105      138      +33

☔ View full report in Codecov by Sentry.
📢 Have feedback on the report? Share it here.

🚀 New features to boost your workflow:

❄️ Test Analytics: Detect flaky tests, report on failures, and find test suite problems.

odunbar force-pushed the orad/new-predict-method branch from 42f37c7 to 105d172 Compare March 6, 2026 20:11

odunbar and others added 4 commits March 6, 2026 12:16

added ForwardMapWrapper, made compatible with MCMC and encoder utilities

892063d

format

a4dbaaa

Update src/Emulator.jl

eafcc06

Co-authored-by: ArneBouillon <45404227+ArneBouillon@users.noreply.github.com>

first update to predict

b02758f

deprecate transform_to_real replace transform_to_real kwarg, and add add_obs_noise_cov

odunbar force-pushed the orad/new-predict-method branch from 105d172 to b02758f Compare March 6, 2026 20:18

restore format

49dd3a6

odunbar changed the title ~~new predict method~~ new predict method API, and reduced sampling in input space Mar 6, 2026

odunbar added 16 commits March 16, 2026 11:04

initial encode/decode inputs

92259e7

add orth. compl. piece

af57c8f

format

a3c5d12

precompiles

d64728f

corrected logic for encode/decode flags

596663b

add maxlog to warning message

e15cbb2

add get_en/decoder_from_schedule

56aaed9

running version or sampler with encoded inputs (just decoding for pos…

0e2125f

…terior currently)

return full affine shift from get_encoder_from_schedule, and compute …

6c4622d

…gaussian lifts correctly

compute proposals correctly, resolve input dim checks

fe8c020

format

a822ddd

add guardrailsfor getting empty encoders

69f5500

reshape bug in test

89af12e

format

0d37dc0

due to reverse-diff slow downs with LinearMaps.jl, reduced the mcmc s…

ffdc01c

…teps during tests

change lengths

faf88f5

odunbar added 5 commits March 26, 2026 10:27

tests for utils

92e96b9

test retrieval of encoder

b888359

tests for the enriching

e47dd50

typo

149c218

format

bd8813f

odunbar added 2 commits March 27, 2026 11:23

updated and added new docstrings

71a02af

larger tol as it is borderline

bf0de84

odunbar requested a review from ArneBouillon March 27, 2026 20:34

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

new predict method API, and reduced sampling in input space#404

new predict method API, and reduced sampling in input space#404
odunbar wants to merge 28 commits intomainfrom
orad/new-predict-method

odunbar commented Mar 6, 2026 •

edited

Loading

Uh oh!

codecov bot commented Mar 25, 2026 •

edited

Loading

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

Conversation

odunbar commented Mar 6, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Purpose

Content

Misc

Detail on current encoding setup

Darcy example

Uh oh!

codecov bot commented Mar 25, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Codecov Report

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

odunbar commented Mar 6, 2026 •

edited

Loading

codecov bot commented Mar 25, 2026 •

edited

Loading