Voice fraud governance
Ingest audio and get a forensic risk event with acoustic tags and explainable reason codes, not just a score.
Voice governance infrastructure
Built for compliance, fraud, and risk teams accountable to auditors and regulators.
Open demoServices
Supervisory documentation, audit trails, and defensible evidence — the artifacts regulators ask for.
Ingest audio and get a forensic risk event with acoustic tags and explainable reason codes, not just a score.
Forensic reports with measurements and rationale, designed for FINRA review, EU AI Act documentation, and FinCEN SAR filings.
Multi-codec spectral and temporal analysis so experts can validate detections and produce compliance-ready reports.
Technology
Detection is only the first layer. Every output is paired with explainable evidence and governance artifacts built for regulated environments.
Detection layer
Each sensor tracks a different physical property of speech. They run in parallel and converge into one documented decision path.
Respiratory physiology
Measures voicing duration against human respiratory limits. Synthetic pipelines can sustain phonation patterns that healthy biological speech cannot.
Environmental consistency
Tracks room response and ambient consistency across the call. Spliced or generated segments often break that acoustic continuity.
Temporal dynamics
Computes multi-scale permutation entropy of pitch and energy movement. Natural speech keeps characteristic complexity that synthetic output often over-regularizes.
Harmonic resonance
Evaluates harmonic coherence patterns in the source signal. Natural vocal fold behavior produces structures that synthetic engines struggle to replicate consistently.
Governance layer
The governance layer turns sensor output into traceable decisions, from first flag through compliance documentation.
Explainability
Every verdict links to specific measurements and documented thresholds so investigators can follow the logic step-by-step.
Reproducibility
Frozen baselines and versioned calibration show exactly what was in production when each decision was made.
Regulatory readiness
Flags are tested against alternative explanations before escalation, including codec degradation, bandwidth limits, and environmental noise.
Design principles
Product constraints are governance features. What we refuse to do is as important as what we automate.
Classification uses inspectable linear models on hand-crafted features, with thresholds that can be documented and reviewed.
Audio is processed in memory and discarded. We measure acoustic behavior, not identity signals.
Outputs focus on measured anomalies and supporting rationale, not standalone confidence percentages.
Physics-based features target properties of natural speech rather than signatures of one specific synthesizer family.
Validation
Performance is validated on ASVspoof benchmarks across wideband, G.711 μ-law, and AMR-NB conditions. New features must pass a Do-No-Harm gate before release.
Governance artifacts are generated from the same evidence path used for detection decisions.
Trust & Governance
We avoid legal overclaims by design. The platform is built to reduce exposure, improve audit readiness, and produce records your teams can review and defend.
We log run metadata, thresholds, sensor outputs, and decision rationale so any verdict can be reconstructed. That record supports supervision, internal escalation, and external review.
We do not retain voiceprints, biometric templates, or raw audio after analysis. Processing is in memory, and zero biometric storage is a non-negotiable product constraint.
Context
Most tools can flag suspicious audio. Fewer can explain a decision in a way an examiner, auditor, or legal team can reliably review.
In high-stakes channels, one false positive can freeze a transfer and trigger immediate scrutiny: what was measured, which thresholds were applied, and why that conclusion was reasonable.
We focus on organizations where voice decisions authorize money movement or account control: community and regional banks, credit unions, broker-dealers, RIAs, and family offices.
Sonotheia pairs detection with decision governance so every verdict can be reconstructed, challenged, and defended with evidence rather than confidence scores alone.
Founded by a compliance executive and an audio engineer. We build for audit committees and examiners — not just detection accuracy.
CO-FOUNDER & PRESIDENT
Audio engineer and educator with 20+ years in professional audio. Post-production across Sony Music Studios, Sync Sound, and Creative Group. 18+ years as course director at Full Sail University. MPSE and AES member.
Leads detection R&D and the firm's physics-based approach to voice integrity.
View full profile →CO-FOUNDER & CEO
Financial services and regulatory leader with 15+ years navigating SEC, FINRA, CFTC, and Federal Reserve expectations. Licensed attorney; former Managing Director at Charles Schwab. Senior roles at SVB, Citizens, and Morgan Stanley.
Leads operations and client relationships, ensuring programs remain defensible under scrutiny.
View full profile →FAQ
Practical questions from compliance, risk, and fraud teams evaluating deployment.
Next step
We will scope your voice channels, define governance artifacts, and map an evidence workflow your risk and compliance teams can review before rollout.
Back to top