Voice fetish (koe)
✎ 本文編集 (admin) 🖼 画像編集 (admin)Eyes closed in a dark room. Headphones holding a recorded voice that whispers near one ear, then the other. The voice belongs to a performer the listener has chosen carefully, listened to for years, paid for through a substantial subscription-and-download history. Voice fetish — koe-fechi in the Japanese vocabulary — is the aesthetic and erotic engagement with a person’s voice as an independent object-of-attention, and the form has produced one of the most-developed adult-content production economies of any sensory-modality-coded category.
Overview
Voice fetish (Japanese: 声フェチ, koe fechi; alternative Japanese: ボイスフェチ boisu fechi, 音声フェチ onsei fechi; English working translations: voice fetish, voice kink, audio fetish; clinical and academic vocabulary: auditory fetishism, phonetic fetish) is the aesthetic-and-erotic engagement with voice — particular voice-qualities, particular speaking-styles, particular vocalisation-registers — as an independent object of attention. The form sits within the broader category of auditory fetishism and is one of the better-developed sub-categories within that broader field.
The Japanese-and-international adult-audio-content economy that has formed around this aesthetic-and-erotic engagement is substantial. The Japanese seiyuu (voice-actor) industry’s substantial fan-economy in non-adult contexts maps closely onto an adjacent adult-audio-content economy, and the cross-pollination between the two has produced a continuously-renewing pipeline of voice-trained performers, fan-recognition-of-particular-voice-styles, and content-and-format-experimentation. Major distribution platforms (DLsite and others) maintain dedicated audio-content categories with substantial volumes of available work, and the format-recognition is firm enough that the voice-fetish category operates as a genuinely-independent slot in the adult-content category-system.
Etymology and definition
The Japanese 声 (koe, “voice”) and the loanword fechi (フェチ, abbreviation of fetish) compound to koe-fechi — the standard Japanese term for the category. The variants ボイスフェチ (boisu-fechi, with the English voice loanword) and 音声フェチ (onsei-fechi, with the more-formal-Sino-Japanese onsei “audio / voice”) are also current. The English-language voice fetish, voice kink, and the older clinical-vocabulary phonetic fetish / auditory fetishism operate as approximate translations.
The category covers, in standard fan-vocabulary use:
- Voice quality / register: the broad acoustic-properties of the voice — low-and-husky, high-and-clear, breathy-and-soft, sharp-and-clear, and the many other recognisable register-options.
- Speaking style / accent: the speech-pattern of the voice — particular dialect (Kansai, Hakata, Tōhoku), particular sentence-final patterns, the polite-or-rough-register choice, the pace-and-rhythm of the speaking.
- Non-verbal vocalisation: breath, sigh, laugh, hum, whisper, the sounds of eating-or-drinking, and the broader range of voice-without-words sounds. The non-verbal sound register overlaps substantially with the scent-fetish and ear-fetish traditions.
- Specific vocalisation contexts: the voice in moments of arousal, in moments of emotional release, in moments of laughter or weeping. The arousal-vocalisation register is the principal one for adult-audio production and ranges from light-vocal-shadings through to fully-elaborated vocal performances.
The structure of the kink
The voice-fetish category operates on several structural elements that distinguish it within the broader sensory-modality-coded fetish landscape.
The first is information-density of voice. Human voices encode, simultaneously, age, gender, emotional state, cultural background, current physiological state, and personality. The receiving-listener processes all of these layers in parallel, and the resulting mental-image-formation — the listener’s automatic construction of the speaker’s whole presence from voice alone — is one of the form’s principal aesthetic-resources. The information-density-of-voice register is partly what drives the form’s ability to produce strong-imaginative-and-affective response from voice-alone-content.
The second is eyes-closed accessibility. Voice operates effectively in conditions where vision is unavailable: in darkness, with eyes closed, during sleep-onset, in moments of physical-contact with closed-eyes. The corresponding ear-and-brain-only access channel is the structural-channel that voice-fetish content uses, and the form’s most-effective implementations exploit the availability of the channel in conditions where visual-content cannot reach.
The third is imaginative augmentation. Where visual-content presents the body-in-its-fullness, voice-content presents only voice and requires the listener to construct the rest from imagination-and-association. The resulting active-construction-by-the-listener is part of what makes the form produce particularly-high engagement: the listener is not passively-receiving the body, the listener is actively-constructing it from the audio-cues. The construction-engagement is one of the form’s most-distinctive aesthetic-features.
The fourth is direct-physiological-affect. Voice — and particularly the soft-and-near-ear-binaural register — produces direct-physiological-affect (the ASMR response, the corresponding parasympathetic-relaxation register, the gooseflesh-and-tingle response) in many listeners. The physical-sensation-register that voice can produce in the body provides a direct-physical-channel that other sensory-modalities cannot match for many listeners, and the corresponding bodily-sensation is one of the form’s most-immediate appeals.
Sub-registers
The voice-fetish category sub-divides on several axes simultaneously.
Pitch and register axis. Low-and-husky voices, high-and-clear voices, breathy-and-soft voices, sharp-and-clear voices, and the many gradations between them. Different listeners maintain strong-and-stable preferences along this axis, and the production-side recognises and serves the resulting category-system.
Speech-style axis. Polite-and-formal speech, casual-and-rough speech, dialect-coded speech (Kansai, Hakata, Tōhoku, and others), the senpai-to-kōhai and kōhai-to-senpai directions of register, the seiyuu-character-voice and the real-voice registers, and many adjacent sub-registers. The speech-style axis is one of the more-elaborated dimensions of the voice-fetish vocabulary.
Verbal-versus-non-verbal axis. Articulated-speech-coded sub-registers, breathy-and-vocal-coded sub-registers, sound-only-coded sub-registers (ASMR-only, breath-only, ear-attention-only). Each sub-register operates with a distinctive content-and-listening register.
Audio-content production economy
The Japanese adult-audio-content economy is one of the most-developed audio-content economies in any genre or language. Built on the foundation of the seiyuu (voice-actor) industry’s general-content production tradition, the adult-audio-content sector emerged as a specialised production-form in the 2000s and consolidated through the 2010s.
The principal distribution-platform is DLsite, with substantial volumes of audio-content available across multiple sub-genres. Top-tier voice-performers in the adult-audio sector accumulate substantial individual sales-volumes, with single-work sales reaching tens-of-thousands of copies for the most-popular performers. The performer-and-listener relationship in the sector has substantial similarities to the more-general voice-actor-and-fan relationship in mainstream anime-and-game production, with listeners following particular performers across multiple-works and across multiple-platforms.
Production-format development in the sector has been substantial. The binaural recording technique — using two microphones positioned at the locations of human ears to capture stereo-perceived sound that recreates the listener’s experience of having the speaking-figure positioned at a specific physical location relative to the listener — has been widely-adopted as a standard production-format for adult-audio-content, with the near-ear whisper (mimi-zeme, ear-attention-seme), ear-licking (mimi-name), and similar sub-genres particularly suited to the binaural format.
Genre context: voice and broader sensory-content
Voice-fetish content sits within a broader sensory-content ecosystem.
ASMR (autonomous sensory meridian response) content operates in the same audio-and-binaural register as adult-audio-content, with the difference being principally the genre-context (general-relaxation versus adult-content) and the corresponding production-and-reception conventions. Substantial cross-pollination between the two categories has produced ASMR-coded adult-audio-content and adult-audio-coded ASMR content, with the boundary between the two categories increasingly fluid.
Voice-drama / voice-dialogue audio content with character-coded voice-acting and narrative-structured scenarios (shichu-voice, situation-voice) supplies the more-narratively-elaborated sub-category of audio-content. The combination of character-positioning, scenario-construction, and voice-performance produces a more-elaborated production-form than the more-direct ASMR-coded sub-categories.
Game audio and character-voice in eroge operates in a related-but-distinct genre. The character-voice in eroge productions delivers particular-character-voice-content within the broader game-narrative, with the corresponding voice-actor-and-character-recognition forming part of the game’s reception-and-fan-engagement.
The four content-categories — ASMR-coded audio, dramatic-shichu-voice audio, eroge-character-voice content, and the more-direct voice-fetish-coded audio content — operate as related-and-overlapping production-categories with substantial cross-listener migration and substantial cross-pollination of production-techniques.
International circulation
The Japanese-language voice-fetish-and-adult-audio-content sector has developed in close-and-distinctive relation to the broader Japanese voice-actor-fan-economy. The international Anglophone equivalent has been substantially less-developed historically, with the audio erotica category in English-language adult-content production substantially smaller in volume and substantially less-established in the broader adult-content category-system.
Through the 2010s and 2020s, the Anglophone audio-erotica sector has developed substantially, with platforms including Quinn, Dipsea, Bloom, and others establishing dedicated audio-content distribution. The corresponding Japanese-language sector remains the more-developed, with the longer-established voice-actor-and-fan-economy and the more-developed binaural-and-production-technique infrastructure giving the Japanese sector substantial advantages in production-and-listening sophistication.
Cross-cultural circulation between the two language-sectors has been moderate. English-translation of Japanese audio-content has been limited (audio-content translation requires re-recording rather than the easier subtitle-or-translation work of text-or-image content), but international fans of Japanese voice-actors have substantial communities accessing the Japanese-language content directly.
Related Terms
- Ear fetish (mimi fetish)
- Erotic ASMR
- Situation-voice (shichu-voice)
- Eroge
- Doujinshi
Updated
「Voice fetish (koe)」の動画作品
Powered by FANZA Webサービス
「Voice fetish (koe)」の同人作品
Powered by FANZA Webサービス
「Voice fetish (koe)」の同人作品(DLsiteランキング)
References
- 『Anime's Media Mix: Franchising Toys and Characters in Japan』 University of Minnesota Press (2012) — Reference for the voice-actor-and-character-merchandising ecosystem within which voice-fetish operates.
- 『The Sound of Anime: Voice Acting and Sound Design in Japanese Animation』 Japan Forum (2019)
- 『Sound Pleasure: Sex, Sound, and the Pornographic Imagination』 Duke University Press (2018)
- 『Otaku: Japan's Database Animals』 University of Minnesota Press (2009)
Also known as
- voice fetish
- voice kink
- audio fetish
- ja: 声フェチ
- ja: ボイスフェチ
- ja: 声萌え
- ja: 音声フェチ
Related
- Ero ASMR
- Dialect Fetish (Hougen)
- Gag play (sarugutsuwa)
- Hentai ASMR
- Jawline Fetish
- ASMR (Autonomous Sensory Meridian Response)
- Blazer School Uniform
- Bodikon (Body-Conscious Fashion)
- Sexual Dimorphism Fetish (Dansa Fechi)
- Reading Fetish (Dokusho Fechi)
- Gangimari (Drugged-Face Expression)
- Hard Pounding (Gan-tsuki / Geki-pisu)