Files
DECNET/decnet/ttp/data/intel/abuseipdb.yaml
anti d25f69ba1b feat(ttp): extract intel_lifter provider mappings to YAML data + ATT&CK external_reference enrichment
The four provider→technique tables (AbuseIPDB cat→techniques,
GreyNoise tag→techniques, ThreatFox threat_type→techniques, plus
the Feodo binary-listed signal) used to live as Final[dict] constants
in intel_lifter.py. Two real problems with that:

1. Drift between rules/ttp/R0054.yaml..R0058.yaml (which declare
   the full slate per provider) and the Python dicts (which decide
   which slate-member fires per signal). The v2 audit comment in
   intel_lifter.py documented that they had silently drifted.
2. No ATT&CK provenance on emissions — the loaded STIX bundle has
   rich external_references (canonical attack.mitre.org URLs) that
   never surfaced because the lifter had no path back to them.

Mappings now live as YAML at decnet/ttp/data/intel/{provider}.yaml,
validated at load against the loaded ATT&CK bundle, with each entry
enriched by attack_stix._attack_pattern_by_id to attach the canonical
MITRE URL to every emission.

- decnet/ttp/data/intel_loader.py: pydantic-validated schema +
  ProviderMapping/Signal/TechniqueEmission frozen dataclasses +
  load_provider_mapping(provider) lru-cached.
- Per-technique high_score_threshold inlined into YAML
  (collapses the separate _ABUSEIPDB_HIGH_SCORE_GATED dict).
- external_reference field follows the STIX 2.1 external-reference
  shape (source_name + url + optional external_id) so the future
  STIX/MISP exporter is a direct translation.
- intel_lifter.py: dicts deleted, decision functions read from
  ProviderMapping accessors. Decision-flow constants (T1071/T1595
  bare-classification fallbacks in _greynoise_decisions) stay in
  code — they're not table rows.
- Each emit slot's evidence_extra now carries mitre_url for any
  technique resolved in the bundle (every one in practice).
- tests/ttp/test_intel_mappings.py: snapshot equivalence vs the
  legacy dicts, high-score gate behavior, every-signal-has-an-
  external-reference, every-emission-has-a-mitre-url, negative
  paths (unknown technique_id raises AttackBundleError, mismatched
  provider field rejected, dir listing matches expected providers).

The YAML schema + mitre_url enrichment lays groundwork for the
future STIX exporter; this commit does NOT build that exporter.
2026-05-09 06:18:25 -04:00

126 lines
3.6 KiB
YAML

# AbuseIPDB category → ATT&CK technique mapping.
#
# Mirrors what _ABUSEIPDB_CATEGORY_TO_TECHNIQUES + _ABUSEIPDB_HIGH_SCORE_GATED
# used to encode in decnet/ttp/impl/intel_lifter.py before the data
# extraction. Source-of-truth column for which categories produce
# which ATT&CK tags, paired with rules/ttp/R0054.yaml which declares
# the full slate the predicate can emit.
#
# Cat 4 (DDoS), 10 (Web Spam), 12 (Blog Spam) are intentionally
# unmapped — design doc TTP_TAGGING.md §A.10: DDoS-without-protocol
# is too muddy for v0; CMS spam has no clean ATT&CK fit at the IP
# layer. Keep the explanatory comments here so the next quarterly
# drift check (development/DEBT.md DEBT-048) can diff cheaply.
provider: abuseipdb
mapping_version: "2"
attack_release: ">=15.1"
signals:
- id: cat_5
label: "FTP Brute-Force"
external_reference:
source_name: abuseipdb
url: "https://www.abuseipdb.com/categories#5"
techniques:
- technique_id: T1110
- id: cat_7
label: "Phishing"
external_reference:
source_name: abuseipdb
url: "https://www.abuseipdb.com/categories#7"
techniques:
- technique_id: T1566
- id: cat_9
label: "Open Proxy"
external_reference:
source_name: abuseipdb
url: "https://www.abuseipdb.com/categories#9"
techniques:
- technique_id: T1090
- id: cat_11
label: "Email Spam"
external_reference:
source_name: abuseipdb
url: "https://www.abuseipdb.com/categories#11"
techniques:
- technique_id: T1496
- technique_id: T1566
high_score_threshold: 80
- id: cat_13
label: "VPN IP"
external_reference:
source_name: abuseipdb
url: "https://www.abuseipdb.com/categories#13"
techniques:
- technique_id: T1090
- id: cat_14
label: "Port Scan"
external_reference:
source_name: abuseipdb
url: "https://www.abuseipdb.com/categories#14"
techniques:
- technique_id: T1046
- technique_id: T1595
- id: cat_15
label: "Hacking"
external_reference:
source_name: abuseipdb
url: "https://www.abuseipdb.com/categories#15"
techniques:
- technique_id: T1190
- id: cat_16
label: "SQL Injection"
external_reference:
source_name: abuseipdb
url: "https://www.abuseipdb.com/categories#16"
techniques:
- technique_id: T1190
- id: cat_17
label: "Spoofing"
external_reference:
source_name: abuseipdb
url: "https://www.abuseipdb.com/categories#17"
techniques:
- technique_id: T1566
- id: cat_18
label: "Brute-Force"
external_reference:
source_name: abuseipdb
url: "https://www.abuseipdb.com/categories#18"
techniques:
- technique_id: T1110
- id: cat_19
label: "Bad Web Bot"
external_reference:
source_name: abuseipdb
url: "https://www.abuseipdb.com/categories#19"
techniques:
- technique_id: T1595
- id: cat_20
label: "Exploited Host"
external_reference:
source_name: abuseipdb
url: "https://www.abuseipdb.com/categories#20"
techniques:
- technique_id: T1078
- id: cat_21
label: "Web App Attack"
external_reference:
source_name: abuseipdb
url: "https://www.abuseipdb.com/categories#21"
techniques:
- technique_id: T1190
- id: cat_22
label: "SSH"
external_reference:
source_name: abuseipdb
url: "https://www.abuseipdb.com/categories#22"
techniques:
- technique_id: T1110
- id: cat_23
label: "IoT Targeted"
external_reference:
source_name: abuseipdb
url: "https://www.abuseipdb.com/categories#23"
techniques:
- technique_id: T1190