Add ASan/UBSan memory safety CI workflow

Adds a workflow that builds and tests Z3 with AddressSanitizer and UndefinedBehaviorSanitizer on every push to catch runtime memory errors and undefined behavior. Runs unit tests, SMT-LIB2 benchmarks, and regression tests under both sanitizers. Includes a Copilot agentic workflow to generate summary reports as GitHub Discussions. Triggered on push (path-filtered to src/) and manual dispatch.
2026-05-14 14:15:29 +00:00 · 2026-03-04 16:31:29 +00:00 · 2026-03-04 16:31:29 +00:00 · 123bb620d4
commit 123bb620d4
parent d40c4721d8
2 changed files with 457 additions and 0 deletions
--- a/.github/workflows/memory-safety-report.md
+++ b/.github/workflows/memory-safety-report.md
@ -0,0 +1,205 @@
+---
+description: >
+  Generates a detailed Memory Safety report for Z3 by analyzing ASan/UBSan
+  sanitizer logs from the memory-safety workflow, posting findings as a
+  GitHub Discussion.
+
+on:
+  workflow_run:
+    workflows: ["Memory Safety Analysis"]
+    types: [completed]
+  workflow_dispatch:
+
+timeout-minutes: 30
+
+permissions:
+  actions: read
+  contents: read
+  discussions: write
+
+network: defaults
+
+tools:
+  cache-memory: true
+  github:
+    toolsets: [default]
+  bash: [":*"]
+  glob: {}
+  view: {}
+
+safe-outputs:
+  create-discussion:
+    title-prefix: "[Memory Safety] "
+    category: "Agentic Workflows"
+    close-older-discussions: true
+  missing-tool:
+    create-issue: true
+
+steps:
+  - name: Checkout repository
+    uses: actions/checkout@v5
+    with:
+      persist-credentials: false
+
+---
+
+# Memory Safety Analysis Report Generator
+
+## Job Description
+
+Your name is ${{ github.workflow }}. You are an expert memory safety analyst for the Z3 theorem prover repository `${{ github.repository }}`. Your task is to download, analyze, and report on the results from the Memory Safety Analysis workflow, covering runtime sanitizer (ASan/UBSan) findings.
+
+## Your Task
+
+### 1. Download Artifacts from the Triggering Workflow Run
+
+If triggered by `workflow_run`, download the artifacts from the completed Memory Safety Analysis run:
+
+```bash
+# Get the triggering run ID
+RUN_ID="${{ github.event.workflow_run.id }}"
+
+# If manual dispatch, find the latest Memory Safety Analysis run
+if [ -z "$RUN_ID" ] || [ "$RUN_ID" = "" ]; then
+  echo "Manual dispatch — finding latest Memory Safety Analysis run..."
+  gh run list --workflow="Memory Safety Analysis" --limit=1 --json databaseId --jq '.[0].databaseId'
+fi
+```
+
+Download all artifacts:
+
+```bash
+mkdir -p /tmp/reports
+gh run download "$RUN_ID" --dir /tmp/reports 2>&1 || echo "Some artifacts may not be available"
+ls -la /tmp/reports/
+```
+
+### 2. Analyze Sanitizer Reports
+
+Parse the ASan and UBSan report files:
+
+```bash
+# Check ASan results
+if [ -d /tmp/reports/asan-reports ]; then
+  cat /tmp/reports/asan-reports/summary.md
+  ls /tmp/reports/asan-reports/
+fi
+
+# Check UBSan results
+if [ -d /tmp/reports/ubsan-reports ]; then
+  cat /tmp/reports/ubsan-reports/summary.md
+  ls /tmp/reports/ubsan-reports/
+fi
+```
+
+For each sanitizer finding, extract:
+- **Error type** (heap-buffer-overflow, heap-use-after-free, stack-buffer-overflow, signed-integer-overflow, null-pointer-dereference, etc.)
+- **Source location** (file, line, column)
+- **Stack trace** (first 5 frames)
+- **Allocation/deallocation site** (for memory errors)
+
+### 3. Compare with Previous Results
+
+Check cache memory for previous run results:
+- Total findings from last run (ASan + UBSan)
+- List of previously known issues
+- Identify new findings (regressions) vs. resolved findings (improvements)
+
+### 4. Generate the Discussion Report
+
+Create a comprehensive GitHub Discussion with this structure:
+
+```markdown
+# Memory Safety Analysis Report
+
+**Date**: YYYY-MM-DD
+**Commit**: `<short SHA>` on branch `<branch>`
+**Triggered by**: push / workflow_dispatch
+**Workflow Run**: [#<run_id>](link)
+
+## Executive Summary
+
+| Category | ASan | UBSan | Total |
+|----------|------|-------|-------|
+| Buffer Overflow | Y | - | Z |
+| Use-After-Free | Y | - | Z |
+| Double-Free | Y | - | Z |
+| Null Dereference | - | - | Z |
+| Integer Overflow | - | Y | Z |
+| Undefined Behavior | - | Y | Z |
+| Other | Y | Z | Z |
+| **Total** | **Y** | **Z** | **N** |
+
+## Trend
+
+- New findings since last run: N
+- Resolved since last run: N
+- Unchanged: N
+
+## Critical Findings (Immediate Action Needed)
+
+[List any high-severity findings: buffer overflows, use-after-free, double-free]
+
+## Important Findings (Should Fix)
+
+[List medium-severity: null derefs, integer overflows]
+
+## Low-Severity / Informational
+
+[List warnings: potential issues]
+
+## ASan Findings
+
+[Each finding with error type, location, and stack trace snippet]
+
+## UBSan Findings
+
+[Each finding with error type, location, and explanation]
+
+## Top Affected Files
+
+| File | Findings |
+|------|----------|
+| src/... | N |
+
+## Recommendations
+
+1. [Actionable recommendations based on the findings]
+2. [Patterns to address]
+
+<details>
+<summary>Raw Data</summary>
+
+[Compressed summary of all data for future reference]
+
+</details>
+```
+
+### 5. Update Cache Memory
+
+Store the current run's results in cache memory for future comparison:
+- Total count by category
+- List of file:line pairs with findings
+- Run metadata (commit SHA, date, run ID)
+
+### 6. Handle Edge Cases
+
+- If the triggering workflow failed entirely, report that analysis could not complete and include any partial results.
+- If no artifacts are available, report that and suggest running the workflow manually.
+- If zero findings across all tools, create a discussion noting the clean bill of health.
+
+## Guidelines
+
+- **Be thorough**: Analyze every available artifact and log file.
+- **Be accurate**: Distinguish between ASan and UBSan findings.
+- **Be actionable**: For each finding, include enough context to locate and understand the issue.
+- **Track trends**: Use cache memory to identify regressions and improvements over time.
+- **Prioritize**: Critical memory safety issues (buffer overflow, UAF, double-free) should be prominently highlighted.
+
+## Important Notes
+
+- **DO NOT** create pull requests or modify source files.
+- **DO NOT** attempt to fix the findings automatically.
+- **DO** close older Memory Safety discussions automatically (configured via `close-older-discussions: true`).
+- **DO** always report the commit SHA so findings can be correlated with specific code versions.
+- **DO** use cache memory to track trends over multiple runs.