Explicit error terms production code by janzill · Pull Request #1064 · ActivitySim/activitysim

janzill · 2026-04-03T23:52:53Z

This PR brings the explicit error term (EET) work to production standard. It contains the following major changes compared to the PoC implementation:
1. It adds testing and documentation for EET
2. It decouples the sampling method from the simulation method and adds Poisson sampling (based on @m-richards implementation in #1065), including tests and documentation. It also addresses runtime and memory issues with the sampling method "eet".
3. It removes several inconsistencies in the EET simulation branch where edge cases could have led to unexpected changes in choices for individual choosers due to non-alignment of error terms.
4. It consistently implements EET for nested logit models

Note that the default simulation method remains Monte Carlo, with all existing unit and integration tests unchanged and passing. Users should therefore not see any differences in their model runs unless explicitly opting in to EET. This is done by adding use_explicit_error_terms: True to the settings. This is a drop-in replacement, with default settings leading to a 3-10% runtime increase of a single demand model run for the models we have tested so far.

This is a large PR and I will add details in comments below so we can keep discussions focused.

…EET comparison tests

… are the same for eval_mnl and eval_nl

…ing cdap eet parity

… toy example and compares the result

Extend logit tests, add model tests, add simulate tests, draft docs

…with edge cases for small sample test. note this also happens for non-eet, e.g. with base seed 1

…t for mc

…mulation method

Poisson sampling, inconsistency fixes, eet runtime improvements, true nested logit eet

janzill · 2026-05-21T04:32:57Z

Regarding tests:

Regarding integration tests with the two external models, we cannot compare outcomes between MC and EET due to the small household sample sizes.
We included EET functionality in semcog tests and noticed a small difference for runs with 1 and 2 processes. Debugging revealed an edge case for probabilistic scheduling. This is independent of EET, the model just looks up probabilities from provided tables and therefore does not have an EET equivalent. It turns out that how failed trips are grouped together can lead to very small differences between single and multi-processing runs for small test sets. This applies to MC as well. Changing the base seed did not trigger that edge case, but worth discussing during the engineering meeting.
We noticed that ARC tests are disabled but wanted to add these back in because it's the only test model that uses trip_scheduling_choice afaik. It turns out the regress trips had small differences for trip schedules when running with MC. Git history shows the test was disabled in BayDAG Contribution #16: Parking Locations in Trip Matrices #840. It looks like the test files were later touched in Trip Scheduling Choice -- Same Results Single Process and Multi-Process #1005 but the test was never re-enabled. I cannot see why the test was disabled in the first place, I might be missing some context here but I decided to update the MC regress file and re-enable the test, as well as add one for EET

janzill · 2026-05-21T04:33:55Z

Inconsistencies fixes:

Improved consistency of error terms during sampling by aligning random draws to the universe of alternatives, not just zero-attraction alternatives
Improved consistency of error terms for choices from sampled sets by aligning random draws to the universe of alternatives, not the position in the sampled set
Improved consistency of two-zone pre-sampling by aligning MAZ choices to the universe of alternatives.
For shadow pricing, choosers now have consistent error terms for sample and final choice over shadow pricing iterations. Note for shadow pricing method simulation, fixing random numbers per chooser over iteration led to a markedly different solution (much longer distances) than without. This was independent of the simulation method and also of the sampling method and purely due to having the same random numbers in the shadow pricing decisions. We therefore introduced a separate RNG channel to keep results in line with current release code. This RNG is only used if use_explicit_error_terms = True and shadow pricing method = simulation.

janzill · 2026-05-21T04:36:01Z

Regarding Poisson sampling and disaggregate accessibilities:

For the MTC_extended and SANDAG models, we saw large differences (about 50%) when running with Poisson sampling compared to the other two methods. The other two methods agree. Disaggregate accessibilities are destination choice logsums, generated by sampling a subset of alternatives, and we traced the differences back to how the sampling correction factor is specified. The bottom line is that in the current specification Poisson is un-biased, whereas both MC and EET sampling are biased by log(sample_size). This is material (on the order of half the mean value) and leads to significant differences in downstream models that use disaggregate accessibilities, like auto_ownership and cdap in SANDAG. As mentioned in this week's meeting, we will present more details on this in the engineering meeting on 5/21.

janzill · 2026-05-21T04:36:51Z

Regarding nested logit:

The PoC implementation had a recursive structure, with choices made by walking down nests and choosing at each level. The use of logsums means that there were edge cases where unintuitive changes could happen (nest switching), and the recursive structure was slow. We replaced this with a method that draws nested logit error terms directly, solving both of these problems.

janzill and others added 30 commits March 19, 2026 15:21

np array instead of for loop

5c506f1

memory reduction

710b3b0

no duplicate arrays

60a744a

bug fix: order of chooser_idx in interaction_simulate

f97185e

add tests, docstrings for logit

9d69dab

Add basic with/without EET test for interaction simulate

a9db131

Fix, complete tests for interaction sample, simulate

a7f2e8f

Linting

df574ee

Normalise number of choosers, alternatives and minimum tolerance for …

cf92aad

…EET comparison tests

reshape, do not flatten for potential performance

73f45fe

undo stray comment

6440d3f

unify mc and eet reporting during choice making

1df7d0d

series not array

5e9847d

reinstate test, up number of draws for comparison

1213b56

numpy not loop

fa54204

interaction_sample test to catch index order bug

1b2b6dc

make test clearer

4e0c7e8

reset rng offset on iterate_location_choice (shadow pricing)

1aa85ba

Add tests for logit NL, ordering, and all models using EET

bd4211f

Add basic docs for EET

76cacb2

Add tests checking that choices made using eet and from probabilities…

f628505

… are the same for eval_mnl and eval_nl

Linting, minor changes to test_simulate.py

362eef0

roll back changes to core tests to minimize noise

d0a6429

Implement Jan's suggestion of how to calculate household ids for test…

65fd171

…ing cdap eet parity

Add test for compute_nested_utilities. Computes nested utilities on a…

f55b2cd

… toy example and compares the result

Linting

4debaad

Move nest_spec to a fixture in simulate.py tests

743c56f

Merge with Tom's linting changes

b62001e

Finish removing nest_spec definition in simulate.py tests

b021cee

Merge pull request #5 from outerl/tom/nested-logit-tests-and-docs

52f75fa

Extend logit tests, add model tests, add simulate tests, draft docs

janzill added 24 commits May 14, 2026 07:06

switch base seed to avoid trip_scheduling (probabilistic) to come up …

5c961b7

…with edge cases for small sample test. note this also happens for non-eet, e.g. with base seed 1

forgot the corresponding golden trips

a325f45

test multiple_zone golden

4fcd239

removes requirement of interaction_sample_simulate to have alts_context

8933d6c

fix test by using eet as intended, remove stable_indexing in tour_des…

270d450

…t for mc

clean up

01b38c7

conditional stable sample indexes

4b78e20

stable alts only for eet with poisson sampling for two-zone

ae278cc

trip maz-for-taz stable alts for eet with poisson sampling

de74dfc

clean up

77b80b3

stable two-zone alts for tour_od

bdfe757

tour_od stable alt cond

67fd055

resolve_smapling_method

249acfa

decouple sample and simulation methods

3ebe20c

separate RNG for shadow pricing to enable loc choiec rng reset for si…

dc52907

…mulation method

lint

d58d0c5

logging and comments

7f68160

debug logging for sample method, add warning for disagg acc and poisson

e6b75ea

doco

4803032

info, nto warning

d11f2be

compute settings, not model settings

7b7170b

compute settings, not model settings

b844372

variable name

196704a

Merge pull request #8 from outerl/jzill/eet_runtime_inc_nl_and_poisson

1567d5f

Poisson sampling, inconsistency fixes, eet runtime improvements, true nested logit eet

janzill mentioned this pull request May 21, 2026

2026-05-21 Engineering Team ActivitySim/meeting-notes#101

Open

janzill changed the title ~~Explicit error terms testing and documentation~~ Explicit error terms production code May 21, 2026

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Explicit error terms production code#1064

Explicit error terms production code#1064
janzill wants to merge 151 commits into
ActivitySim:explicit_error_termsfrom
outerl:explicit_error_terms

janzill commented Apr 3, 2026 •

edited

Loading

Uh oh!

janzill commented May 21, 2026

Uh oh!

janzill commented May 21, 2026

Uh oh!

janzill commented May 21, 2026

Uh oh!

janzill commented May 21, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

Conversation

janzill commented Apr 3, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

janzill commented May 21, 2026

Uh oh!

janzill commented May 21, 2026

Uh oh!

janzill commented May 21, 2026

Uh oh!

janzill commented May 21, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

janzill commented Apr 3, 2026 •

edited

Loading