mirrors/z3 - Libre-Chip.org

mirrors/z3

mirror of https://github.com/Z3Prover/z3 synced 2026-06-23 00:50:29 +00:00

Author	SHA1	Message	Date
Nikolaj Bjorner	07cea49e4b	Address PR review: push_path helper, lbool eval_cond, fix year - Add push_path(path, c, sign) that decomposes conjuncts/disjuncts - Add simplify_ite_rec(path, c, t, e) helper for cleaner recursion - Change eval_cond signature to return lbool (l_undef = undetermined) - Fix copyright year from 2025 to 2026 Co-authored-by: Copilot <223556219+Copilot@users.noreply.github.com>	2026-06-04 08:29:44 -07:00
Nikolaj Bjorner	ca238a9107	Address PR review: subsumption, is_value, simplify_ite fixes - Add lightweight structural is_subset for union/inter simplification - Use m.is_value instead of is_const_char for swap checks - Move eval_cond to beginning of simplify_ite_rec - Use path.shrink(sz) instead of copying extended_path - Fix normalize_reverse stuck case to return mk_reverse(r) - Expose subsumes() in public API Co-authored-by: Copilot <223556219+Copilot@users.noreply.github.com>	2026-06-04 07:45:19 -07:00
Lev Nachmanson	400fe313d9	ci(nightly): always-on issue-oracle smoke test (~2 min, never fails the build) (#9688 ) Adds a tightly-bounded issue-oracle smoke test as a sibling of the existing `test_benchmarks.py` step in the nightly's `ubuntu-build` job. The step always runs as part of every nightly, can never fail the build, and completes in ~2 min. ## Why `Z3Prover/bench` ships a per-issue regression corpus (`inputs/issues/iss-N/`) plus a runner (`scripts/issues_check_oracle.py`) that diffs current z3 output against captured `<stem>.expected.out` byte streams. Wiring that into the nightly gives us a daily smoke signal that detects regressions on benchmarks distilled from real z3 issues — without requiring any z3 contributor to ever touch the bench repo. ## What A two-step block added right after the existing `Clone z3test` + `Test` steps in `ubuntu-build`: 1. Clone bench (sparse, ~800 MB of ~12 GB total) `git clone --depth 1 --filter=blob:none --sparse https://github.com/Z3Prover/bench bench` then `sparse-checkout set scripts inputs/issues`. 2. Run issue-oracle smoke test (~2 min) ```yaml continue-on-error: true run: \| timeout 90 python bench/scripts/issues_check_oracle.py \ --z3 build-dist/z3 \ --all bench/inputs/issues \ --max 200 --timeout 5 --wallclock 60 \ --jobs 0 --quiet \ --json-report issue-oracle-report.json ``` The JSON report is then uploaded as a workflow artifact (`issue-oracle-report`, 7-day retention) for inspection. ### Wall-clock bounds (defense in depth) \| Bound \| Where \| Purpose \| \|---\|---\|---\| \| `--max 200` \| issues_check_oracle CLI \| walk only first 200 of ~2,700 `iss-` dirs (alphabetic; stable across nightlies) \| \| `--timeout 5` \| issues_check_oracle CLI \| per-file z3 cap \| \| `--wallclock 60` \| issues_check_oracle CLI \| hard global cap inside the script \| \| `timeout 90` \| shell wrapper \| belt-and-braces backstop, leaves 30 s headroom for the script to flush its JSON report before SIGTERM \| \| `continue-on-error: true` \| step gate \| absorbs every failure mode (missing z3, sparse-clone failure, outer timeout firing, etc.) so the smoke test can never* red the nightly build \| ### Scope Only `ubuntu-build` and only one place in `nightly.yml`. The push/PR lanes (`ci.yml`, `Windows.yml`) and the other scheduled/dispatch lanes (`coverage.yml`, `memory-safety.yml`, `nightly-validation.yml`, `release.yml`, `wip.yml`, `daily-test-improver`) are intentionally left untouched so this gate runs exactly once per night. ## Local verification On Mac (16 cores, capped to 8 jobs by `--jobs 0` resolving to `min(jobs, cores)`): ``` [issues_check_oracle] 368 file-check(s) \| timeout=5s \| wallclock=60s === summary === total: 368 ok: 286 DIFF: 4 (per-file timeouts) skipped: 78 elapsed: 8.3s / 60s exit code: 0 ``` GHA Ubuntu (4 cores → 4 jobs) extrapolation: ~17 s typical, well under all wall-clock caps. ### Adversarial cases (all leave the workflow green via step-level `continue-on-error: true`) \| Failure mode \| Result \| \|---\|---\| \| z3 binary missing \| each per-file run records `exec-error`, script summary-exits 0 → green \| \| Sparse clone fails (previous step's continue-on-error absorbs it) \| oracle finds no `bench/` → script `sys.exit(1)` → step's continue-on-error absorbs → green \| \| Wallclock fires \| script writes report with `wallclock_hit: true`, exits 0 → green \| \| Outer `timeout 90` fires \| SIGTERM → bash exits 124 → step's continue-on-error absorbs → green \| ## Companion bench-repo PR The data side of this (per-bench sidecar schema, `bug-K.json` + `<stem>.expected.out`, oracle rewrite) lands in `Z3Prover/bench` as PR [#2503](https://github.com/Z3Prover/bench/pull/2503). The nightly step here depends on that PR's `scripts/issues_check_oracle.py` and the migrated corpus. Both PRs should be merged together; bench can also merge first (the script handles a missing corpus gracefully). Co-authored-by: Copilot <223556219+Copilot@users.noreply.github.com> Co-authored-by: Copilot <223556219+Copilot@users.noreply.github.com>	2026-06-03 17:35:43 -07:00
Nikolaj Bjorner	3afd83103a	Address PR review comments: cache, simplify_ite_rec, itos - Cache now indexes by (ele, r) pair using obj_pair_map - Remove eval() function; operator()(ele, r) handles all cases - Rewrite simplify_ite_rec with path vector of signed conditions - Add range-based simplification: (lo <= x, false) + (x <= hi, false) eliminates ite(x = v, t, e) when v is outside [lo, hi] - Add is_itos case in derive_to_re: guards on n >= 0, digit range, and first character match - Port is_reverse normalization (previous commit) Co-authored-by: Copilot <223556219+Copilot@users.noreply.github.com>	2026-06-03 17:16:23 -07:00
Nikolaj Bjorner	a77155a5c4	Port reverse normalization into derive class Instead of treating reverse(r) as stuck (returning symbolic mk_derivative), normalize it by pushing reverse inward through the regex structure, then compute the derivative of the normalized result. Mirrors mk_re_reverse logic. Handles: concat, union, intersection, diff, ite, opt, complement, star, plus, loop, to_re (string literals, units, concats), and symmetric cases. Co-authored-by: Copilot <223556219+Copilot@users.noreply.github.com>	2026-06-03 15:29:30 -07:00
Nikolaj Bjorner	f8925ca6fa	Add simplify_ite_rec and eval for two-phase derivative - Add simplify_ite post-processing in operator() to simplify ITE conditions - Add simplify_ite_rec(cond, sign, r) for propagating condition truth values - Handles c == cond, x=ch1 vs x=ch2 with different constants - Add eval(ele, d) for efficient two-phase: symbolic derivative + concrete eval - mk_derivative uses two-phase pattern: m_derive(r) then m_derive.eval(ele, d) Co-authored-by: Copilot <223556219+Copilot@users.noreply.github.com>	2026-06-03 14:25:03 -07:00
Can Cebeci	b2401b87db	Remove redundant min_gen_match search (#9696 ) While working on https://github.com/Z3Prover/z3/pull/9405, I noticed that euf_mam.cpp code was slightly out of sync with mam.cpp and did some redundant work. Co-authored-by: Can Cebeci <t-cancebeci@microsoft.com>	2026-06-03 13:36:51 -07:00
Can Cebeci	14746d7fb6	Update used_enodes properly (#9695 ) https://github.com/Z3Prover/z3/pull/9405 made the trace report used_enodes incorrectly, since the previous code relied on update_max_generation to maintain the relevant data structure. This should fix it. Co-authored-by: Can Cebeci <t-cancebeci@microsoft.com>	2026-06-03 13:36:37 -07:00
Nikolaj Bjorner	7dc25e73d5	make reset private	2026-06-03 11:41:37 -07:00
Nikolaj Bjorner	9aca2edcfc	updates per PR comments	2026-06-03 11:32:32 -07:00
Nikolaj Bjorner	cb2cf913e3	move seq_derive and fix include paths, remove antimirov code	2026-06-03 11:04:19 -07:00
Nikolaj Bjorner	1f28fd0e6b	Add seq::derive class for symbolic regex derivatives Implement a new seq::derive class (seq_derive.h/cpp) that computes symbolic derivatives of regular expressions using ITE-trees, based on the RE# approach (Varatalu, Veanes, Ernits - POPL 2025). Key features: - Two-argument operator()(ele, r): computes derivative of regex r w.r.t. element ele (concrete character or de Bruijn variable for symbolic mode) - ACI canonicalization (flatten, stable_sort, dedup) for union/intersection - ITE-tree combinators for binary/unary operations - Info-based nullability with recursive fallback - Complement absorption rules - Depth-bounded recursion to prevent stack overflow Integration with seq_rewriter: - mk_derivative(ele, r) and mk_derivative(r) now delegate to m_derive - Removed dead mk_derivative_rec function - Added ITE hoisting in mk_re_star, mk_re_concat, mk_re_union0, mk_re_inter0, mk_re_complement - Added depth limiting in Antimirov derivative helpers Co-authored-by: Copilot <223556219+Copilot@users.noreply.github.com>	2026-06-03 10:36:19 -07:00
Copilot	d64ce41b2e	Remove unused defined_names artifacts and simplify fingerprint_set::contains (#9702 ) Cleans up dead code left by the "remove side definitions" refactoring (`a0a3047`). - `smt_model_checker.cpp` — Remove `defined_names dn(m)` variable that was declared but never used - `smt_model_checker.h` — Drop the now-unnecessary `#include "ast/normal_forms/defined_names.h"` - `fingerprints.cpp` — Collapse redundant tail in `fingerprint_set::contains`: ```cpp // Before if (m_set.contains(d)) return true; return false; // After return m_set.contains(d); ``` --------- Co-authored-by: copilot-swe-agent[bot] <198982749+Copilot@users.noreply.github.com>	2026-06-03 08:16:46 -07:00
Clément Pit-Claudel	1d706e875c	Handle SIGXCPU like a regular timeout (#9697 ) Z3's -T measures wall clock time, whereas `ulimit -t` measures CPU time. Currently, an expired ulimit timeout crashes Z3 without printing statistics; this patch makes it react cleanly (just as if it has encountered a regular timeout) to SIGXCPU, the signal that ulimit sends before sending SIGKILL.	2026-06-03 07:26:38 -07:00
Hari Govind V K	922f49e187	Fix MBP QEL soundness bug in datatype accessor elimination (#9571 ) (#9692 ) Two fixes for mbp_dt_tg::apply() when encountering an accessor whose argument has a different constructor in the model: 1. Don't call rm_accessor (which would assert a contradictory recognizer, making the formula false). This prevents the original bug where QEL returned 'false' for satisfiable formulas. 2. Branch on the model-assigned constructor for the accessor's argument. The correct output should include the literal introduced in (2). However, this fix does not produce it. Spacer is sound with this over-approximation, as long as the counter example does not depend on value of mismatched accessors (e.g. (tl nil)). Co-authored-by: Copilot <223556219+Copilot@users.noreply.github.com>	2026-06-03 07:23:21 -07:00
Nikolaj Bjorner	a0a3047e36	remove side definitions	2026-06-02 21:43:55 -07:00
Nikolaj Bjorner	ab259b6830	add depth guard	2026-06-02 14:53:50 -07:00
Nikolaj Bjorner	77f8b33794	re-enable unit tests	2026-06-02 10:39:41 -07:00
Nikolaj Bjorner	2dbe233f6a	fix condition that skipped mbqi Signed-off-by: Nikolaj Bjorner <nbjorner@microsoft.com>	2026-06-02 10:38:52 -07:00
Nikolaj Bjorner	eaf7562a1d	disable test in tptp, move to native lambdas	2026-06-02 10:38:51 -07:00
Nikolaj Bjorner	3e0a350411	Comment out ho_curried_application and ho_choice_expression tests Comment out two test functions for debugging purposes.	2026-06-02 08:47:43 -07:00
Nikolaj Bjorner	78a7b4d3a6	Update model_core.h	2026-06-01 19:47:40 -07:00
Nikolaj Bjorner	358378a6f0	remove tptp from all Signed-off-by: Nikolaj Bjorner <nbjorner@microsoft.com>	2026-06-01 19:36:18 -07:00
Nikolaj Bjorner	94b981024e	set up udoc relation to use datalog engine	2026-06-01 19:06:25 -07:00
Nikolaj Bjorner	c4366e57f8	Update udoc_relation.cpp	2026-06-01 17:22:06 -07:00
Copilot	947af23fc4	[code-simplifier] Align choice axiom naming in theory_array_full (#9660 ) This simplifies the recent `choice` axiom path in the SMT array solver for consistency with the SAT-side implementation. The change is purely structural: align local naming with the quantifier body it represents, inline a single-use literal, and remove stray whitespace in the array decl header. - Choice axiom cleanup - Rename the local implication term in `theory_array_full::instantiate_choice_axiom` from `ax` to `body` - Match the naming already used in `sat/smt/array_axioms.cpp::assert_choice_axiom` - Single-use literal inlining - Replace the temporary `literal l = mk_literal(q); assert_axiom(l);` with a direct call - Reduce noise without changing behavior - Header whitespace cleanup - Remove trailing whitespace in `src/ast/array_decl_plugin.h` ```c++ expr_ref body(m.mk_implies(px, pc), m); expr_ref q(m.mk_forall(1, &x_sort, &x_name, body), m); ctx.get_rewriter()(q); assert_axiom(mk_literal(q)); ``` --------- Co-authored-by: copilot-swe-agent[bot] <198982749+Copilot@users.noreply.github.com> Co-authored-by: Nikolaj Bjorner <nbjorner@microsoft.com>	2026-06-01 16:03:42 -07:00
dependabot[bot]	b0536c3998	Bump github/gh-aw-actions from 0.76.1 to 0.77.0 (#9661 ) Bumps [github/gh-aw-actions](https://github.com/github/gh-aw-actions) from 0.76.1 to 0.77.0. <details> <summary>Release notes</summary> <p><em>Sourced from <a href="https://github.com/github/gh-aw-actions/releases">github/gh-aw-actions's releases</a>.</em></p> <blockquote> <h2>v0.77.0</h2> <p>Sync of actions from <a href="https://github.com/github/gh-aw">gh-aw</a> at <code>v0.77.0</code>.</p> </blockquote> </details> <details> <summary>Commits</summary> <ul> <li><a href="`b11be78086`"><code>b11be78</code></a> chore: sync actions from gh-aw@v0.77.0 (<a href="https://redirect.github.com/github/gh-aw-actions/issues/122">#122</a>)</li> <li>See full diff in <a href="https://github.com/github/gh-aw-actions/compare/v0.76.1...v0.77.0">compare view</a></li> </ul> </details> <br /> [![Dependabot compatibility score](https://dependabot-badges.githubapp.com/badges/compatibility_score?dependency-name=github/gh-aw-actions&package-manager=github_actions&previous-version=0.76.1&new-version=0.77.0)](https://docs.github.com/en/github/managing-security-vulnerabilities/about-dependabot-security-updates#about-compatibility-scores) Dependabot will resolve any conflicts with this PR as long as you don't alter it yourself. You can also trigger a rebase manually by commenting `@dependabot rebase`. [//]: # (dependabot-automerge-start) [//]: # (dependabot-automerge-end) --- <details> <summary>Dependabot commands and options</summary> <br /> You can trigger Dependabot actions by commenting on this PR: - `@dependabot rebase` will rebase this PR - `@dependabot recreate` will recreate this PR, overwriting any edits that have been made to it - `@dependabot show <dependency name> ignore conditions` will show all of the ignore conditions of the specified dependency - `@dependabot ignore this major version` will close this PR and stop Dependabot creating any more for this major version (unless you reopen the PR or upgrade to it yourself) - `@dependabot ignore this minor version` will close this PR and stop Dependabot creating any more for this minor version (unless you reopen the PR or upgrade to it yourself) - `@dependabot ignore this dependency` will close this PR and stop Dependabot creating any more for this dependency (unless you reopen the PR or upgrade to it yourself) </details> Signed-off-by: dependabot[bot] <support@github.com> Co-authored-by: dependabot[bot] <49699333+dependabot[bot]@users.noreply.github.com>	2026-06-01 16:01:32 -07:00
Can Cebeci	8ddd435835	Fix misleading generation number in trace (#9687 ) Current implementation prints 0 when the cached generation is used	2026-06-01 16:00:59 -07:00
Nikolaj Bjorner	d025b34606	prepare for enodes over lambdas	2026-06-01 13:00:35 -07:00
Nikolaj Bjorner	705569df24	add include directive	2026-06-01 11:39:18 -07:00
Nikolaj Bjorner	ebdf031c8f	ensure engine is datalog for dl_table and dl_util tests	2026-05-31 15:32:23 -07:00
Nikolaj Bjorner	24e5a6ae3f	ensure base class has propagation Signed-off-by: Nikolaj Bjorner <nbjorner@microsoft.com>	2026-05-30 22:21:15 -07:00
Nikolaj Bjorner	a595e98707	fix regression: m_tmp_diseq has 0 arguments, you have to access the expression Signed-off-by: Nikolaj Bjorner <nbjorner@microsoft.com>	2026-05-30 18:57:21 -07:00
Nikolaj Bjorner	dbe986fdf7	move closure conversion to solver internalization - only the internalizer performs closure conversion - theory_array treats propagation of lambdas similar to stores - ho_matcher treats top-level flex patterns as first-order - pattern-inference fix to handle quantifiers (lambdas) in patterns that are computed	2026-05-30 18:41:37 -07:00
Nikolaj Bjorner	2cc4422018	use expr based access to enodes to allow for storing first-class lambas	2026-05-30 15:13:08 -07:00
Lev Nachmanson	5f3088f3b5	CI: validate libz3.dylib architecture on macOS to prevent #9662 regression (#9669 )	2026-05-29 16:00:36 -07:00
Nikolaj Bjorner	30df8e7ece	build warnings	2026-05-29 10:17:46 -07:00
Nikolaj Bjorner	48bcee8e62	add lambda-t case in addition to p-lambda case	2026-05-29 01:18:34 -07:00
Copilot	b74e35f4fb	Fix mpz_manager leak in algebraic root comparison (#9654 ) A `root-obj`-driven unsat case was exiting with a leaked `mpz_manager` allocation even though solver output was correct. The leak came from temporary rational bounds created during algebraic-number comparison and not released before shutdown. - Root cause - `algebraic_numbers::compare_core()` materialized interval bounds as raw `mpq` temporaries. - Those temporaries could allocate backing `mpz` storage, but their lifetime was not tied to the manager, so the allocator retained leaked cells at process exit. - Change - Replace the raw `mpq` temporaries with `scoped_mpq` in `/src/math/polynomial/algebraic_numbers.cpp`. - This keeps the comparison logic unchanged while making temporary bound conversion use RAII-managed cleanup. - Effect - `root-obj` comparisons no longer leave `mpz_manager` allocations behind. - Solver behavior is unchanged; the fix is limited to temporary numeral lifetime management. ```c++ - mpq l_a, u_a, l_b, u_b; + scoped_mpq l_a(qm()), u_a(qm()), l_b(qm()), u_b(qm()); ``` Co-authored-by: copilot-swe-agent[bot] <198982749+Copilot@users.noreply.github.com>	2026-05-28 09:06:05 -07:00
Nikolaj Bjorner	0b56db7f07	fix #9657	2026-05-28 09:01:48 -07:00
Nikolaj Bjorner	b34a7b4319	use trail stack from context for ho-matcher	2026-05-28 07:57:07 -07:00
Nikolaj Bjorner	9d09a050e8	use max-top-generation Signed-off-by: Nikolaj Bjorner <nbjorner@microsoft.com>	2026-05-27 14:37:37 -07:00
Nikolaj Bjorner	17c6e0729b	control recursion depth for check function	2026-05-27 14:29:53 -07:00
Nikolaj Bjorner	5fe4d88d43	recognize ubv_to_int as part of BV logic	2026-05-27 13:08:54 -07:00
Copilot	51da9db615	Add SMT-LIB `choice` support via array `OP_CHOICE` and instantiate choice axioms in array solvers (#9649 ) This change wires SMT-LIB Hilbert choice parsing to a concrete array-theory operator and ensures both array backends enforce the expected semantic axiom. Previously, `(choice ((x T)) phi)` parsed as NYI and had no solver-side instantiation path. - Parser: lower `choice_k` into array `OP_CHOICE` - `pop_quant_frame(choice_k)` now builds `(choice p)` instead of throwing. - Added parser include/use of array utilities to construct the term directly from the generated lambda predicate. - Array decl plugin: add `OP_CHOICE` typing + surface syntax - Added declaration support for `choice` with signature: - `(Array T Bool) -> T` (encoded as `('a -> Bool) -> 'a` in HO view). - Added recognizer/util helpers (`is_choice`, `mk_choice`) and exposed `"choice"` in op names. - SMT array theory (`theory_array_full`): instantiate choice axiom - Added instantiation for each encountered `choice(p)`: - `forall x . p(x) => p(choice(p))` - Integrated into internalization/relevancy paths and statistics. - *SAT/SMT array backend (`sat/smt/array_`): instantiate choice axiom - Added new axiom record kind for choice, internalization hook, assertion routine, and diagnostics/stat tracking. - Uses the same quantified implication schema as above. - Regression coverage** - Extended SMT2 parser regression with an HO `choice` example to ensure parser/eval pipeline accepts and processes choice terms. Example of the now-supported input: ```smt2 (set-logic HO_ALL) (declare-sort U 0) (declare-fun P () (-> U Bool)) (assert (exists ((x U)) (P x))) (assert (= witness (choice ((x U)) (P x)))) ``` --------- Co-authored-by: copilot-swe-agent[bot] <198982749+Copilot@users.noreply.github.com>	2026-05-27 10:05:06 -07:00
yhx-12243	690cdd3f25	fix(make-ts-wrapper): correct out buffer size for WASM (#9644 )	2026-05-27 10:03:39 -07:00
Copilot	eb4c3a0756	Update compare-stats anomaly reporter to read benchmark stats from `/z3/` (#9650 ) This updates the compare-stats anomaly reporter to retrieve benchmark data from `http://mtzguido.tplinkdns.com:8081/z3/` instead of the old `compare_stats.html` endpoint. The workflow prompt and generated lock file now consistently reference the root benchmark stats URL. - Workflow source - Repoint the benchmark source URL from `.../compare_stats.html` to `.../z3/` - Update prompt text to describe the source as benchmark statistics rather than a specific HTML file - Rename the temporary fetched artifact from `compare_stats.html` to `benchmark_stats.html` for consistency - Generated workflow - Regenerate `compare-stats-anomaly-reporter.lock.yml` so the compiled workflow matches the markdown source - Refresh the embedded workflow description and prompt payload to reference the new endpoint - Report output - Update the discussion template’s source link to point at the root benchmark stats page ```md Source URL: `http://mtzguido.tplinkdns.com:8081/z3/` curl -fsSL --max-time 60 "http://mtzguido.tplinkdns.com:8081/z3/" \ -o /tmp/gh-aw/agent/benchmark_stats.html ``` --------- Co-authored-by: copilot-swe-agent[bot] <198982749+Copilot@users.noreply.github.com>	2026-05-27 09:57:20 -07:00
Copilot	1564e00215	smt2parser: realign `pop_app_frame` non-`expr_head` `else` block indentation (#9646 ) This updates a formatting regression introduced in the `pop_app_frame` non-`expr_head` path, where block indentation made control flow harder to read. The patch is whitespace-only and keeps parser behavior unchanged. - What changed - Reindented the `else` body in `src/parsers/smt2/smt2parser.cpp::pop_app_frame` so nested `if/else` structure is visually unambiguous. - Removed trailing spaces on the `m_ctx.mk_app(symbol("select"), ...)` lines in the same block. - Scope - No control-flow, data-flow, or API changes. - No changes outside `pop_app_frame`. ```cpp // Before else { local l; if (m_env.find(fr->m_f, l)) { ... } else { ... } } // After else { local l; if (m_env.find(fr->m_f, l)) { ... } else { ... } } ``` --------- Co-authored-by: copilot-swe-agent[bot] <198982749+Copilot@users.noreply.github.com>	2026-05-27 09:35:54 -07:00
Copilot	b3fff5b399	Add compare-stats anomaly reporter workflow for 30h bug/crash triage (#9647 ) This adds an agentic workflow that analyzes `compare_stats.html` over a rolling 30-hour window and publishes a GitHub Discussion summarizing bugs, crashes, and anomalies. It explicitly captures unknown-outlier patterns where a benchmark is `unknown` while peers in the same set are mostly `sat`/`unsat`/`timeout`. - Workflow added - Introduces `.github/workflows/compare-stats-anomaly-reporter.md` (plus compiled `.lock.yml`). - Supports `workflow_dispatch` and scheduled execution. - Uses safe discussion output with auto-close of older reports for the same stream. - Data acquisition + robustness - Fetches `http://mtzguido.tplinkdns.com:8081/z3/compare_stats.html` with `curl` and `wget` fallback. - Adds integrity checks (non-empty HTML/table presence) and explicit incomplete-report behavior on fetch/parse failures. - 30-hour analysis semantics - Filters rows by timestamp candidates (`time`, `timestamp`, `date`, `run`, etc.) using UTC. - Falls back to full-table analysis when timestamps are unavailable, and marks the report accordingly. - Classification logic - Detects bug/crash signals from status/details (`crash`, `segfault`, `assert`, `abort`, `exception`, `error`, `failed`, `bug`). - Detects: - unknown-outlier anomalies (thresholded minority `unknown` in otherwise decisive SAT-family outcomes), - status divergences (conflicting non-timeout outcomes for same benchmark), - repeated hard-failure anomalies. - Discussion output shape - Produces a compact report with executive counts, bug/crash table, anomaly subsections, and raw extraction summary/limitations. ```yaml safe-outputs: create-discussion: title-prefix: "[Compare Stats] " category: "agentic workflows" close-older-discussions: true ``` --------- Co-authored-by: copilot-swe-agent[bot] <198982749+Copilot@users.noreply.github.com>	2026-05-27 09:25:33 -07:00
Nikolaj Bjorner	1aa2158bf4	fix comment Signed-off-by: Nikolaj Bjorner <nbjorner@microsoft.com>	2026-05-27 09:02:39 -07:00

1 2 3 4 5 ...

22346 commits