3
0
Fork 0
mirror of https://github.com/Z3Prover/z3 synced 2026-05-28 12:56:28 +00:00
z3/.github/workflows
Copilot b3fff5b399
Add compare-stats anomaly reporter workflow for 30h bug/crash triage (#9647)
This adds an agentic workflow that analyzes `compare_stats.html` over a
rolling 30-hour window and publishes a GitHub Discussion summarizing
bugs, crashes, and anomalies. It explicitly captures unknown-outlier
patterns where a benchmark is `unknown` while peers in the same set are
mostly `sat`/`unsat`/`timeout`.

- **Workflow added**
- Introduces `.github/workflows/compare-stats-anomaly-reporter.md` (plus
compiled `.lock.yml`).
  - Supports `workflow_dispatch` and scheduled execution.
- Uses safe discussion output with auto-close of older reports for the
same stream.

- **Data acquisition + robustness**
- Fetches `http://mtzguido.tplinkdns.com:8081/z3/compare_stats.html`
with `curl` and `wget` fallback.
- Adds integrity checks (non-empty HTML/table presence) and explicit
incomplete-report behavior on fetch/parse failures.

- **30-hour analysis semantics**
- Filters rows by timestamp candidates (`time`, `timestamp`, `date`,
`run`, etc.) using UTC.
- Falls back to full-table analysis when timestamps are unavailable, and
marks the report accordingly.

- **Classification logic**
- Detects bug/crash signals from status/details (`crash`, `segfault`,
`assert`, `abort`, `exception`, `error`, `failed`, `bug`).
  - Detects:
- unknown-outlier anomalies (thresholded minority `unknown` in otherwise
decisive SAT-family outcomes),
- status divergences (conflicting non-timeout outcomes for same
benchmark),
    - repeated hard-failure anomalies.

- **Discussion output shape**
- Produces a compact report with executive counts, bug/crash table,
anomaly subsections, and raw extraction summary/limitations.

```yaml
safe-outputs:
  create-discussion:
    title-prefix: "[Compare Stats] "
    category: "agentic workflows"
    close-older-discussions: true
```

---------

Co-authored-by: copilot-swe-agent[bot] <198982749+Copilot@users.noreply.github.com>
2026-05-27 09:25:33 -07:00
..
shared Upgrade agentic workflows to gh-aw v0.36.0 (#8122) 2026-01-08 11:50:35 -08:00
a3-python.lock.yml Bump github/gh-aw-actions from 0.74.8 to 0.76.1 (#9630) 2026-05-26 18:10:06 -07:00
a3-python.md Add noop report-as-issue: false to all agentic workflow frontmatter 2026-03-12 20:01:30 +00:00
academic-citation-tracker.lock.yml Bump github/gh-aw-actions from 0.74.8 to 0.76.1 (#9630) 2026-05-26 18:10:06 -07:00
academic-citation-tracker.md Fix agentic workflow compilation errors (gh-aw v0.68 compat) (#9275) 2026-04-11 10:19:45 -07:00
agentics-maintenance.yml Bump github/gh-aw-actions from 0.74.8 to 0.76.1 (#9630) 2026-05-26 18:10:06 -07:00
android-build.yml Bump actions/upload-artifact from 7.0.0 to 7.0.1 (#9300) 2026-04-19 16:51:02 +02:00
api-coherence-checker.lock.yml Bump github/gh-aw-actions from 0.74.8 to 0.76.1 (#9630) 2026-05-26 18:10:06 -07:00
api-coherence-checker.md Fix agentic workflow compilation errors (gh-aw v0.68 compat) (#9275) 2026-04-11 10:19:45 -07:00
build-warning-fixer.lock.yml Bump github/gh-aw-actions from 0.74.8 to 0.76.1 (#9630) 2026-05-26 18:10:06 -07:00
build-warning-fixer.md Fix agentic workflow compilation errors (gh-aw v0.68 compat) (#9275) 2026-04-11 10:19:45 -07:00
build-z3-cache.yml Bump actions/cache from 5.0.4 to 5.0.5 (#9299) 2026-04-19 15:57:29 +02:00
ci.yml Make manylinux Python selection dynamic in CI and release workflows (#9502) 2026-05-12 12:42:04 -04:00
code-conventions-analyzer.lock.yml Bump github/gh-aw-actions from 0.74.8 to 0.76.1 (#9630) 2026-05-26 18:10:06 -07:00
code-conventions-analyzer.md Fix agentic workflow compilation errors (gh-aw v0.68 compat) (#9275) 2026-04-11 10:19:45 -07:00
code-simplifier.lock.yml Bump github/gh-aw-actions from 0.74.8 to 0.76.1 (#9630) 2026-05-26 18:10:06 -07:00
code-simplifier.md Add noop report-as-issue: false to code-simplifier workflow (#9397) 2026-04-26 18:28:30 +02:00
compare-stats-anomaly-reporter.lock.yml Add compare-stats anomaly reporter workflow for 30h bug/crash triage (#9647) 2026-05-27 09:25:33 -07:00
compare-stats-anomaly-reporter.md Add compare-stats anomaly reporter workflow for 30h bug/crash triage (#9647) 2026-05-27 09:25:33 -07:00
copilot-setup-steps.yml update aw to current version 2026-01-08 18:15:03 +00:00
coverage.yml Bump actions/upload-artifact from 7.0.0 to 7.0.1 (#9300) 2026-04-19 16:51:02 +02:00
cross-build.yml Bump actions/checkout from 5.0.1 to 6.0.2 (#9018) 2026-03-16 15:52:35 -07:00
csa-analysis.lock.yml Bump github/gh-aw-actions from 0.74.8 to 0.76.1 (#9630) 2026-05-26 18:10:06 -07:00
csa-analysis.md Fix agentic workflow compilation errors (gh-aw v0.68 compat) (#9275) 2026-04-11 10:19:45 -07:00
docs.yml Bump actions/upload-artifact from 7.0.0 to 7.0.1 (#9300) 2026-04-19 16:51:02 +02:00
issue-backlog-processor.lock.yml Bump github/gh-aw-actions from 0.74.8 to 0.76.1 (#9630) 2026-05-26 18:10:06 -07:00
issue-backlog-processor.md Fix Issue Backlog Processor: prevent context exhaustion by batching and requiring safe output (#9272) 2026-04-11 10:21:01 -07:00
mark-prs-ready-for-review.yml Bump actions/github-script from 8.0.0 to 9.0.0 (#9296) 2026-04-19 16:49:03 +02:00
memory-safety-report.lock.yml Bump github/gh-aw-actions from 0.74.8 to 0.76.1 (#9630) 2026-05-26 18:10:06 -07:00
memory-safety-report.md Fix agentic workflow compilation errors (gh-aw v0.68 compat) (#9275) 2026-04-11 10:19:45 -07:00
memory-safety.yml Bump actions/upload-artifact from 7.0.0 to 7.0.1 (#9300) 2026-04-19 16:51:02 +02:00
msvc-static-build-clang-cl.yml Bump actions/checkout from 5.0.1 to 6.0.2 (#9018) 2026-03-16 15:52:35 -07:00
msvc-static-build.yml Bump actions/checkout from 5.0.1 to 6.0.2 (#9018) 2026-03-16 15:52:35 -07:00
nightly-validation.yml Add riscv64 wheel builds to nightly and release PyPI publishing (#9153) 2026-03-28 15:26:59 -07:00
nightly.yml Publish Pyodide wheels from nightly and release workflows (#9588) 2026-05-21 11:17:23 -07:00
nuget-build.yml Bump nuget/setup-nuget from 3 to 4 (#9350) 2026-04-21 19:26:55 +02:00
ocaml.yaml Bump actions/cache from 5.0.4 to 5.0.5 (#9299) 2026-04-19 15:57:29 +02:00
ostrich-benchmark.lock.yml Bump github/gh-aw-actions from 0.74.8 to 0.76.1 (#9630) 2026-05-26 18:10:06 -07:00
ostrich-benchmark.md fix(ostrich-benchmark): add safeoutputs keepalive noop calls before long benchmark run (#9313) 2026-04-16 03:22:33 +02:00
pyodide.yml Bump actions/checkout from 5.0.1 to 6.0.2 (#9018) 2026-03-16 15:52:35 -07:00
qf-s-benchmark.lock.yml Bump github/gh-aw-actions from 0.74.8 to 0.76.1 (#9630) 2026-05-26 18:10:06 -07:00
qf-s-benchmark.md fix(qf-s-benchmark): add safeoutputs keepalive noop after build, reduce cap 500→300 (#9290) 2026-04-12 18:26:55 -07:00
release-notes-updater.lock.yml Bump github/gh-aw-actions from 0.74.8 to 0.76.1 (#9630) 2026-05-26 18:10:06 -07:00
release-notes-updater.md Fix agentic workflow compilation errors (gh-aw v0.68 compat) (#9275) 2026-04-11 10:19:45 -07:00
release.yml Publish Pyodide wheels from nightly and release workflows (#9588) 2026-05-21 11:17:23 -07:00
smtlib-benchmark-finder.lock.yml Bump github/gh-aw-actions from 0.74.8 to 0.76.1 (#9630) 2026-05-26 18:10:06 -07:00
smtlib-benchmark-finder.md Add smtlib-benchmark-finder agentic workflow (#9629) 2026-05-26 15:28:11 -07:00
specbot-crash-analyzer.lock.yml Bump github/gh-aw-actions from 0.74.8 to 0.76.1 (#9630) 2026-05-26 18:10:06 -07:00
specbot-crash-analyzer.md Fix agentic workflow compilation errors (gh-aw v0.68 compat) (#9275) 2026-04-11 10:19:45 -07:00
tactic-to-simplifier.lock.yml Bump github/gh-aw-actions from 0.74.8 to 0.76.1 (#9630) 2026-05-26 18:10:06 -07:00
tactic-to-simplifier.md Fix agentic workflow compilation errors (gh-aw v0.68 compat) (#9275) 2026-04-11 10:19:45 -07:00
tptp-benchmark.lock.yml Bump github/gh-aw-actions from 0.74.8 to 0.76.1 (#9630) 2026-05-26 18:10:06 -07:00
tptp-benchmark.md Add weekly TPTP front-end benchmark workflow (#9523) 2026-05-13 06:05:08 -04:00
wasm-release.yml Bump mymindstorm/setup-emsdk from 15 to 16 (#9297) 2026-04-19 15:57:13 +02:00
wasm.yml Bump mymindstorm/setup-emsdk from 15 to 16 (#9297) 2026-04-19 15:57:13 +02:00
Windows.yml Bump microsoft/setup-msbuild from 2 to 3 (#9109) 2026-03-23 16:33:25 -07:00
wip.yml Bump actions/checkout from 5.0.1 to 6.0.2 (#9018) 2026-03-16 15:52:35 -07:00
workflow-suggestion-agent.lock.yml Bump github/gh-aw-actions from 0.74.8 to 0.76.1 (#9630) 2026-05-26 18:10:06 -07:00
workflow-suggestion-agent.md Fix agentic workflow compilation errors (gh-aw v0.68 compat) (#9275) 2026-04-11 10:19:45 -07:00
zipt-code-reviewer.lock.yml Bump github/gh-aw-actions from 0.74.8 to 0.76.1 (#9630) 2026-05-26 18:10:06 -07:00
zipt-code-reviewer.md Fix agentic workflow compilation errors (gh-aw v0.68 compat) (#9275) 2026-04-11 10:19:45 -07:00