Spirit Voice Vocal Generator V1.0 Here
Author: [Reserved for Peer Review] Affiliation: [Reserved for Computational Psychoacoustics Lab] Date: April 17, 2026
11.6 ms (512 samples at 44.1 kHz) – suitable for live performance. 4. Perceptual Evaluation A pilot listening test was conducted with 30 participants (20 audio professionals, 10 naive listeners). Spirit Voice Vocal Generator v1.0
| Parameter | Range | Default | Description | |-----------|-------|---------|-------------| | Spectral Dispersion | 0 – 1.0 | 0.65 | Degree of formant stretching/compression | | Subharmonic Mix | -inf – +6 dB | -3 dB | Level of f0/2 and f0/3 components | | Turbulence Density | 0 – 1.0 | 0.4 | Amplitude of stochastic noise layer | | Temporal Smear | 0 – 50 ms | 15 ms | Phase randomization across frequency bins | | Dry/Wet Mix | 0 – 1.0 | 0.7 | Balance of original vs. processed signal | | Parameter | Range | Default | Description
This paper presents the Spirit Voice Vocal Generator v1.0 (SVVG v1.0), a novel digital audio processing system designed to synthesize hybrid vocal timbres that transcend traditional human or synthetic voice boundaries. Unlike conventional vocoders or text-to-speech (TTS) systems that aim for naturalistic reproduction, SVVG v1.0 introduces a spectral-parametric morphing engine that combines real-time formant filtering, subharmonic excitation, and stochastic noise modulation. The system generates what we term "ethereal vocal artifacts"—voices that possess phonemic intelligibility but lack a definitive source identity. This paper details the architecture, signal processing pipeline, and preliminary perceptual evaluation of v1.0, demonstrating its applications in avant-garde music composition, therapeutic voice therapy, and paranormal-ambient sound design. The system generates what we term "ethereal vocal
5 spoken phrases ("The moon rises over the silent field") processed through SVVG v1.0 at three EC settings (0.3, 0.6, 0.9).
