feat(bench): add Criterion benchmarks and CSIC 2010 dataset converter

8 scanner benchmarks covering allowlist fast path (7.6ns), model path
(172-445ns), and feature extraction (248ns). Python converter script
transforms CSIC 2010 raw HTTP dataset into Sunbeam audit-log JSONL
with realistic scanner feature adaptation.

Signed-off-by: Sienna Meridian Satterwhite <sienna@sunbeam.pt>
This commit is contained in:
2026-03-10 23:38:20 +00:00
parent 867b6b2489
commit 45f0751e1e
3 changed files with 540 additions and 0 deletions

5
.gitignore vendored
View File

@@ -4,3 +4,8 @@ certs/
*.pem
*.key
*.crt
# Training data and model binaries
*.bin
*.jsonl
heuristics.toml