How is this different from detection vendors?

Most vendors stop at scores and APIs. Sonotheia adds a governance layer: decision trace, calibration history, and review-ready artifacts.

Do you store voiceprints?

No. Audio is processed in memory and not retained after analysis. We do not create or store voiceprints or biometric templates.

How do evidence packages support SAR and regulatory filings?

Our evidence packages support your compliance team to make timely and compliant filings like SARs or regulatory responses with full audit documentation.

What does explainability mean in practice?

Each verdict ties to concrete measurements and documented thresholds. You can see which sensors contributed, what rules were triggered, and why the outcome was reached.

How do you handle drift and updates?

Baseline locking and calibration history preserve reproducibility. Updates are versioned so teams can show exactly what was active at any point in time.

Are you replacing existing vendors?

Usually no. Sonotheia is designed to complement existing detection tools with governance artifacts and supervisory documentation.

Voice governance infrastructure

Defensible voice channel decisions.Glass box, not black box.

Built for compliance, fraud, and risk teams accountable to auditors and regulators.

Open demo

Services

What we deliver

Supervisory documentation, audit trails, and defensible evidence — the artifacts regulators ask for.

Voice fraud governance

Ingest audio and get a forensic risk event with acoustic tags and explainable reason codes, not just a score.

Supervisory documentation

Forensic reports with measurements and rationale, designed for FINRA review, EU AI Act documentation, and FinCEN SAR filings.

Forensic analysis dashboard

Multi-codec spectral and temporal analysis so experts can validate detections and produce compliance-ready reports.

Technology

Evidence-first voice integrity stack

Detection is only the first layer. Every output is paired with explainable evidence and governance artifacts built for regulated environments.

Detection layer

Four independent acoustic sensors

Each sensor tracks a different physical property of speech. They run in parallel and converge into one documented decision path.

Respiratory physiology

Biological continuity

Measures voicing duration against human respiratory limits. Synthetic pipelines can sustain phonation patterns that healthy biological speech cannot.

Environmental consistency

Context continuity

Tracks room response and ambient consistency across the call. Spliced or generated segments often break that acoustic continuity.

Temporal dynamics

Temporal integrity

Computes multi-scale permutation entropy of pitch and energy movement. Natural speech keeps characteristic complexity that synthetic output often over-regularizes.

Harmonic resonance

Harmonic coherence analysis

Evaluates harmonic coherence patterns in the source signal. Natural vocal fold behavior produces structures that synthetic engines struggle to replicate consistently.

Governance layer

Supervision built in, not bolted on

The governance layer turns sensor output into traceable decisions, from first flag through compliance documentation.

Explainability

Decision trace

Every verdict links to specific measurements and documented thresholds so investigators can follow the logic step-by-step.

Reproducibility

Calibration & baselines

Frozen baselines and versioned calibration show exactly what was in production when each decision was made.

Regulatory readiness

Counter-hypothesis evaluation

Flags are tested against alternative explanations before escalation, including codec degradation, bandwidth limits, and environmental noise.

Design principles

Intentional constraints

Product constraints are governance features. What we refuse to do is as important as what we automate.

No neural networks in the decision path

Classification uses inspectable linear models on hand-crafted features, with thresholds that can be documented and reviewed.

No voiceprint storage

Audio is processed in memory and discarded. We measure acoustic behavior, not identity signals.

No confidence scores without evidence

Outputs focus on measured anomalies and supporting rationale, not standalone confidence percentages.

No training on the attacks we detect

Physics-based features target properties of natural speech rather than signatures of one specific synthesizer family.

Validation

Calibrated against public benchmarks

Performance is validated on ASVspoof benchmarks across wideband, G.711 μ-law, and AMR-NB conditions. New features must pass a Do-No-Harm gate before release.

Governance artifacts are generated from the same evidence path used for detection decisions.

Trust & Governance

Clear boundaries on data and decisions

We avoid legal overclaims by design. The platform is built to reduce exposure, improve audit readiness, and produce records your teams can review and defend.

What we log and why

We log run metadata, thresholds, sensor outputs, and decision rationale so any verdict can be reconstructed. That record supports supervision, internal escalation, and external review.

What we do not store

We do not retain voiceprints, biometric templates, or raw audio after analysis. Processing is in memory, and zero biometric storage is a non-negotiable product constraint.

Governance primitives

Baseline locking. Freeze production baselines so historical decisions are reproducible.
Drift tracking. Versioned calibration history records what changed, when, and why.
Decision trace. Every verdict links to concrete measurements and thresholds.
Counter-hypothesis review. Alternative explanations (codec, environment, noise) are tested before escalation.

Context

Why voice fraud becomes a governance problem

Most tools can flag suspicious audio. Fewer can explain a decision in a way an examiner, auditor, or legal team can reliably review.

In high-stakes channels, one false positive can freeze a transfer and trigger immediate scrutiny: what was measured, which thresholds were applied, and why that conclusion was reasonable.

Built for regulated voice channels

We focus on organizations where voice decisions authorize money movement or account control: community and regional banks, credit unions, broker-dealers, RIAs, and family offices.

The 2026 oversight wave is operational, not theoretical

FINRA (2026 guidance)Supervisory systems and vendor oversight expectations now explicitly address generative AI use.
FinCEN (2024 onward)SAR narratives involving synthetic media are expected to reference FIN-2024-DEEPFAKEFRAUD guidance.
EU AI Act (effective August 2026)Article 50 transparency duties raise the bar for explainability in high-risk channels.
Colorado AI Act (effective June 30, 2026)Consequential AI decisions require documented rationale and governance controls.

The operating goal: defensible decisions under scrutiny

Sonotheia pairs detection with decision governance so every verdict can be reconstructed, challenged, and defended with evidence rather than confidence scores alone.

Founding Team

Founded by a compliance executive and an audio engineer. We build for audit committees and examiners — not just detection accuracy.

Doron Reizes

CO-FOUNDER & PRESIDENT

Audio engineer and educator with 20+ years in professional audio. Post-production across Sony Music Studios, Sync Sound, and Creative Group. 18+ years as course director at Full Sail University. MPSE and AES member.

Leads detection R&D and the firm's physics-based approach to voice integrity.

View full profile →

Alexander Forostenko

CO-FOUNDER & CEO

Financial services and regulatory leader with 15+ years navigating SEC, FINRA, CFTC, and Federal Reserve expectations. Licensed attorney; former Managing Director at Charles Schwab. Senior roles at SVB, Citizens, and Morgan Stanley.

Leads operations and client relationships, ensuring programs remain defensible under scrutiny.

View full profile →

FAQ

Frequently asked questions

Practical questions from compliance, risk, and fraud teams evaluating deployment.

How is this different from detection vendors?: Most vendors stop at scores and APIs. Sonotheia adds a governance layer: decision trace, calibration history, and review-ready artifacts.
Do you store voiceprints?: No. Audio is processed in memory and not retained after analysis. We do not create or store voiceprints or biometric templates.
How do evidence packages support SAR and regulatory filings?: Our evidence packages support your compliance team to make timely and compliant filings like SARs or regulatory responses with full audit documentation.
What does explainability mean in practice?: Each verdict ties to concrete measurements and documented thresholds. You can see which sensors contributed, what rules were triggered, and why the outcome was reached.
How do you handle drift and updates?: Baseline locking and calibration history preserve reproducibility. Updates are versioned so teams can show exactly what was active at any point in time.
Are you replacing existing vendors?: Usually no. Sonotheia is designed to complement existing detection tools with governance artifacts and supervisory documentation.

Next step

Launch with a controlled pilot.

We will scope your voice channels, define governance artifacts, and map an evidence workflow your risk and compliance teams can review before rollout.

Request pilot briefing

inquiries@sonotheia.ai

We reply within two business days.