The human voice consists of sound made by a human being using the vocal folds for talking, singing, laughing, crying, screaming, etc. Its frequency ranges from about 60 to 7000 Hz. The human voice is specifically that part of human sound production in which the vocal folds (vocal cords) are the primary sound source. Generally speaking, the mechanism for generating the human voice can be subdivided into three parts; the lungs, the vocal folds within the larynx, and the articulators. The lung (the pump) must produce adequate airflow and air pressure to vibrate vocal folds (this air pressure is the fuel of the voice). The vocal folds (vocal cords) are a vibrating valve that chops up the airflow from the lungs into audible pulses that form the laryngeal sound source. The muscles of the larynx adjust the length and tension of the vocal folds to fine tune pitch and tone. The articulators (the parts of the vocal tract above the larynx consisting of tongue, palate, cheek, lips, etc.) articulate and filter the sound emanating from the larynx and to some degree can interact with the laryngeal airflow to strengthen it or weaken it as a sound source.
The vocal folds, in combination with the articulators, are capable of producing highly intricate arrays of sound. The tone of voice may be modulated to suggest emotions such as anger, surprise, or happiness. Singers use the human voice as an instrument for creating music.
Voice types and the folds (cords) themselves
Adult men and women have different sizes of vocal fold; reflecting the male-female differences in larynx size. Adult male voices are usually lower-pitched and have larger folds. The male vocal folds (which would be measured vertically in the opposite diagram), are between 17 mm and 25 mm in length. The female vocal folds are between 12.5 mm and 17.5 mm in length.
A labeled anatomical diagram of the vocal folds
As seen in the illustration, the folds are located just above the vertebrate trachea (the windpipe, which travels from the lungs). Food and drink do not pass through the cords but instead pass through the esophagus, an unlinked tube. Both tubes are separated by the epiglottis, a "flap" that covers the opening of the trachea while swallowing.
The folds in both sexes are within the larynx. They are attached at the back (side nearest the spinal cord) to the arytenoids cartilages, and at the front (side under the chin) to the thyroid cartilage. They have no outer edge as they blend into the side of the breathing tube (the illustration is out of date and does not show this well) while their inner edges or "margins" are free to vibrate (the hole). They have a three layer construction of an epithelium, vocal ligament, then muscle (vocalis muscle), which can shorten and bulge the folds. They are flat triangular bands and are pearly white in color. Above both sides of the vocal cord is the vestibular fold or false vocal cord, which has a small sac between its two folds (not illustrated).
The difference in vocal folds size between men and women means that they have differently pitched voices. Additionally, genetics also causes variances amongst the same sex, with men and women's singing voices being categorized into types. For example, among men, there are bass, baritone, tenor and countertenor (ranging from E2 to even F6), and among women, contralto, mezzo-soprano and soprano (ranging from F3 to C6). There are additional categories for operatic voices, see voice type. This is not the only source of difference between male and female voice. Men, generally speaking, have a larger vocal tract, which essentially gives the resultant voice a lower-sounding timbre. This is mostly independent of the vocal folds themselves.
Voice modulation in spoken language
Human spoken language makes use of the ability of almost all persons in a given society to dynamically modulate certain parameters of the laryngeal voice source in a consistent manner. The most important communicative, or phonetic, parameters are the voice pitch (determined by the vibratory frequency of the vocal folds) and the degree of separation of the vocal folds, referred to as vocal fold adduction (coming together) or abduction (separating).
The ability to vary the ab/adduction of the vocal folds quickly has a strong genetic component, since vocal fold adduction has a life-preserving function in keeping food from passing into the lungs, in addition to the covering action of the epiglottis. Consequently, the muscles that control this action are among the fastest in the body. Children can learn to use this action consistently during speech at an early age, as they learn to speak the difference between utterances such as "apa" (having an abductory-adductory gesture for the p) as "aba" (having no abductory-adductory gesture). Surprisingly enough, they can learn to do this well before the age of two by listening only to the voices of adults around them who have voices much different from their own, and even though the laryngeal movements causing these phonetic differentiations are deep in the throat and not visible to them.
If an abductory movement or adductory movement is strong enough, the vibrations of the vocal folds will stop (or not start). If the gesture is abductory and is part of a speech sound, the sound will be called Voiceless. However, voiceless speech sounds are sometimes better identified as containing an abductory gesture, even if the gesture was not strong enough to stop the vocal folds from vibrating. This anomalous feature of voiceless speech sounds is better understood if it is realized that it is the change in the spectral qualities of the voice as abduction proceeds that is the primary acoustic attribute that the listener attends to when identifying a voiceless speech sound, and not simply the presence or absence of voice (periodic energy).
An adductory gesture is also identified by the change in voice spectral energy it produces. Thus, a speech sound having an adductory gesture may be referred to as a "glottal stop" even if the vocal fold vibrations do not entirely stop. for an example illustrating this, obtained by using the inverse filtering of oral airflow.]
Other aspects of the voice, such as variations in the regularity of vibration, are also used for communication, and are important for the trained voice user to master, but are more rarely used in the formal phonetic code of a spoken language.
Physiology and vocal timbre
The sound of each individual's voice is entirely unique not only because of the actual shape and size of an individual's vocal cords but also due to the size and shape of the rest of that person's body, especially the vocal tract, and the manner in which the speech sounds are habitually formed and articulated. (It is this latter aspect of the sound of the voice that can be mimicked by skilled performers.) Humans have vocal folds that can loosen, tighten, or change their thickness, and over which breath can be transferred at varying pressures. The shape of chest and neck, the position of the tongue, and the tightness of otherwise unrelated muscles can be altered. Any one of these actions results in a change in pitch, volume, timbre, or tone of the sound produced. Sound also resonates within different parts of the body, and an individual's size and bone structure can affect somewhat the sound produced by an individual.
Singers can also learn to project sound in certain ways so that it resonates better within their vocal tract. This is known as vocal resonation. Another major influence on vocal sound and production is the function of the larynx, which people can manipulate in different ways to produce different sounds. These different kinds of laryngeal function are described as different kinds of vocal registers. The primary method for singers to accomplish this is through the use of the Singer's Formant, which has been shown to be a resonance added to the normal resonances of the vocal tract above the frequency range of most instruments and so enables the singer's voice to carry better over musical accompaniment.
Vocal registration refers to the system of vocal registers within the human voice. A register in the human voice is a particular series of tones, produced in the same vibratory pattern of the vocal folds, and possessing the same quality. Registers originate in laryngeal functioning. They occur because the vocal folds are capable of producing several different vibratory patterns. Each of these vibratory patterns appears within a particular Vocal range range of pitches and produces certain characteristic sounds. the term register can be somewhat confusing as it encompasses several aspects of the human voice. The term register can be used to refer to any of the following:
- A particular part of the vocal range such as the upper, middle, or lower registers.
- A resonance area such as chest voice or head voice.
- A phonatory process.
- A certain vocal timbre.
- A region of the voice that is defined or delimited by vocal breaks.
- A subset of a language used for a particular purpose or in a particular social setting.
In linguistics, a register language is a language that combines tone and vowel phonation into a single phonological system.
Within speech pathology the term vocal register has three constituent elements: a certain vibratory pattern of the vocal folds, a certain series of pitches, and a certain type of sound. Speech pathologists identify four vocal registers based on the physiology of laryngeal function: the vocal fry register, the modal register, and the falsetto register, and the whistle register. This view is also adopted by many vocal pedagogists.
Vocal resonation is the process by which the basic product of phonation is enhanced in timbre and/or intensity by the air-filled cavities through which it passes on its way to the outside air. Various terms related to the resonation process include amplification, enrichment, enlargement, improvement, intensification, and prolongation; although in strictly scientific usage acoustic authorities would question most of them. The main point to be drawn from these terms by a singer or speaker is that the end result of resonation is, or should be, to make a better sound. There are seven areas that may be listed as possible vocal resonators. In sequence from the lowest within the body to the highest, these areas are the chest, the tracheal tree, the larynx itself, the pharynx, the oral cavity, the nasal cavity, and the sinuses.
Influences of the human voice
The twelve-tone musical scale, upon which some of the music in the world is based, may have its roots in the sound of the human voice during the course of evolution, according to a study published by the New Scientist. Analysis of recorded speech samples found peaks in acoustic energy that mirrored the distances between notes in the twelve-tone scale.
There are many disorders that affect the human voice; these include speech impediments, and growths and lesions on the vocal folds. Talking improperly for long periods of time causes vocal loading, which is stress inflicted on the speech organs. When vocal injury is done, often an ENT specialist may be able to help, but the best treatment is the prevention of injuries through good vocal production. Voice therapy is generally delivered by a speech-language pathologist.
Vocal Cord Nodules and Polyps
Vocal nodules are caused over time by repeated abuse of the vocal cords which results in soft, swollen spots on each vocal cord. These spots develop into harder, callous-like growths called nodules. The longer the abuse occurs the larger and stiffer the nodules will become. Most polyps are larger than nodules and may be called by other names, such as polypoid degeneration or Reinke's edema. Polyps are caused by a single occurrence and may require surgical removal. Irritation after the removal may then lead to nodules if additional irritation persists. Speech-language therapy teaches the patient how to eliminate the irritations permanently through habit changes and vocal hygiene. Hoarseness or breathiness that lasts for more than two weeks is a common symptom of an underlying voice disorder such as nodes or polyps and should be investigated medically.
- Howard, D.M., and Murphy, D.T.M. (2009). http://www.pluralpublishing.com/publication_vsaar.htm Voice science acoustics and recording, San Diego: Plural Press.
- Titze, I. R. (2008). The human instrument. Sci.Am. 298 (1):94 101. http://www.scientificamerican.com/article.cfm?id=the-human-instrument
- Thurman, Leon & Welch, ed., Graham (2000), Bodymind & voice: Foundations of voice education (revised ed.), Collegeville, Minnesota: The VoiceCare Network et al., ISBN 0-87414-123-0
af:Stem be: ca:Veu cs:Hlas da:Vokal (musik) de:Menschliche Stimme et:H l el: es:Voz (fonolog a) eu:Ahots fr:Voix (phoniatrie) gl:Voz (fonolox a) ko: hi: ( ) it:Voce kk: ht:Vwa la:Vox lt: mogaus balsas ln:Lol k nah:Tozquitl nl:Menselijke stem ja: no:Stemme nn:R yst pl:G os ludzki pt:Voz humana ro:Voce uman qu:Kunka (rimay) ru: sq:Z ri njer zor simple:Voice sk:Hlas (zvukov prejav) sv:R sten tl:Tinig tg: uk: ur: vec:V xe fiu-vro:Inemise hel yi: zh: