Lexical set explained

pronounced as /notice/A lexical set is a group of words that share a particular phonological feature.

A phoneme is a basic unit of sound in a language that can distinguish one word from another. Most commonly, following the work of phonetician John C. Wells, a lexical set is a class of words in a language that share a certain vowel phoneme. As Wells himself says, lexical sets "enable one to refer concisely to large groups of words which tend to share the same vowel, and to the vowel which they share". For instance, the pronunciation of the vowel in cup, luck, sun, blood, glove, and tough may vary in different English dialects but is usually consistent within each dialect and so the category of words forms a lexical set,^[1] which Wells, for ease, calls the set. Meanwhile, words like bid, cliff, limb, miss, etc. form a separate lexical set: Wells's set. Originally, Wells developed 24 such labels - keywords - for the vowel lexical sets of English, which have been sometimes modified and expanded by himself or other scholars for various reasons. Lexical sets have also been used to describe the vowels of other languages, such as French,^[2] Irish^[3] and Scots.^[4]

There are several reasons why lexical sets are useful. Scholars of phonetics often use abstract symbols (most universally today, those of the International Phonetic Alphabet) to transcribe phonemes, but they may follow different transcribing conventions or rely on implicit assumptions in their exact choice of symbols. One convenience of lexical sets is their tendency to avoid these conventions or assumptions. Instead, Wells explains, they "make use of keywords intended to be unmistakable no matter what accent one says them in". That makes them useful for examining phonemes within an accent, comparing and contrasting different accents, and capturing how phonemes may be differently distributed based on accent. A further benefit is that people with no background in phonetics can identify a phoneme not by learned symbols or technical jargon but by its simple keyword (like or in the above examples).^[1]

Standard lexical sets for English

The standard lexical sets for English introduced by John C. Wells in his 1982 Accents of English are in wide usage. Wells defined each lexical set on the basis of the pronunciation of words in two reference accents, which he calls RP and GenAm.

"RP" refers to Received Pronunciation, the traditionally prestigious accent in England.
"GenAm" refers to an accent of the General American type, which is associated with a geographically "neutral" or widespread sound system throughout the US.

Wells classifies English words into 24 lexical sets on the basis of the pronunciation of the vowel of their stressed syllable in the two reference accents. Typed in small caps, each lexical set is named after a representative keyword. Wells also describes three sets of words based on word-final unstressed vowels, which, though not included in the standard 24 lexical sets (the final three sets listed in the chart below) "have indexical and diagnostic value in distinguishing accents".

Lexical sets, as defined in
Keyword	RP	GA	Example words
	pronounced as /ɪ/	pronounced as /ɪ/	ship, sick, bridge, milk, myth, busy
	pronounced as /e/	pronounced as /ɛ/	step, neck, edge, shelf, friend, ready
	pronounced as /æ/	pronounced as /æ/	tap, back, badge, scalp, hand, cancel
	pronounced as /ɒ/	pronounced as /ɑ/	stop, sock, dodge, romp, possible, quality
	pronounced as /ʌ/	pronounced as /ʌ/	cup, suck, budge, pulse, trunk, blood
	pronounced as /ʊ/	pronounced as /ʊ/	put, bush, full, good, look, wolf
	pronounced as /ɑː/	pronounced as /æ/	staff, brass, ask, dance, sample, calf
	pronounced as /ɒ/	pronounced as /ɔ/	cough, broth, cross, long, Boston
	pronounced as /ɜː/	pronounced as /ɜr/	hurt, lurk, urge, burst, jerk, term
	pronounced as /iː/	pronounced as /i/	creep, speak, leave, feel, key, people
	pronounced as /eɪ/	pronounced as /eɪ/	tape, cake, raid, veil, steak, day
	pronounced as /ɑː/	pronounced as /ɑ/	psalm, father, bra, spa, lager
	pronounced as /ɔː/	pronounced as /ɔ/	taught, sauce, hawk, jaw, broad
	pronounced as /əʊ/	pronounced as /oʊ/	soap, joke, home, know, so, roll
	pronounced as /uː/	pronounced as /u/	loop, shoot, tomb, mute, huge, view
	pronounced as /aɪ/	pronounced as /aɪ/	ripe, write, arrive, high, try, buy
	pronounced as /ɔɪ/	pronounced as /ɔɪ/	adroit, noise, join, toy, royal
	pronounced as /aʊ/	pronounced as /aʊ/	out, house, loud, count, crowd, cow
	pronounced as /ɪə/	pronounced as /ɪr/	beer, sincere, fear, beard, serum
	pronounced as /ɛə/	pronounced as /ɛr/	care, fair, pear, where, scarce, vary
	pronounced as /ɑː/	pronounced as /ɑr/	far, sharp, bark, carve, farm, heart
	pronounced as /ɔː/	pronounced as /ɔr/	for, war, short, scorch, born, warm
	pronounced as /ɔː/	pronounced as /or/	four, wore, sport, porch, borne, story
	pronounced as /ʊə/	pronounced as /ʊr/	poor, tourist, pure, plural, jury
happ	pronounced as /ɪ/	pronounced as /ɪ/	copy, scampi, taxi, sortie, committee, hockey, Chelsea
lett	pronounced as /ə/	pronounced as /ər/	paper, metre, calendar, stupor, succo(u)r, martyr
comm	pronounced as /ə/	pronounced as /ə/	about, gallop, oblige, quota, vodka

For example, the word rod is pronounced pronounced as //ˈrɒd// in RP and pronounced as //ˈrɑd// in GenAm. It therefore belongs in the lexical set. Weary is pronounced pronounced as //ˈwɪərɪ// in RP and pronounced as //ˈwɪrɪ// in GenAm and thus belongs in the lexical set.

Some English words do not belong to any lexical set. For example, the a in the stressed syllable of tomato is pronounced pronounced as //ɑː// in RP, and pronounced as //eɪ// in GenAm, a combination that is very unusual and is not covered by any of the 27 lexical sets above. Some words pronounced with pronounced as //ɒ// before a velar consonant in RP, such as mock and fog, belong to no particular lexical set because the GenAm pronunciation varies between pronounced as //ɔ// and pronounced as //ɑ//.

The GenAm,,, and range between monophthongal pronounced as /[i, e, u, o]/ and diphthongal pronounced as /[ɪi, eɪ, ʊu, oʊ]/, and Wells chose to phonemicize three of them as monophthongs for the sake of simplicity and as pronounced as //eɪ// to avoid confusion with RP, pronounced as //e//.

The happ set was identified phonemically as the same as for both RP and GenAm, reflecting the then-traditional analysis, although realizations similar to (happy tensing) were already taking hold in both varieties. The notation (IPA|i) for happ has since emerged and been taken up by major pronouncing dictionaries, including Wells's, to take note of this shift. Wells's model of General American is also conservative in that it lacks the cot–caught (–) and horse–hoarse (–) mergers.

Choice of the keywords

Wells explains his choice of keywords ("kit", "fleece", etc.) as follows:

The keywords have been chosen in such a way that clarity is maximized: whatever accent of English they are spoken in, they can hardly be mistaken for other words. Although fleece is not the commonest of words, it cannot be mistaken for a word with some other vowel; whereas beat, say, if we had chosen it instead, would have been subject to the drawback that one man's pronunciation of beat may sound like another's pronunciation of bait or bit.

Wherever possible, the keywords end in a voiceless alveolar or dental consonant.

Usage

The standard lexical sets of Wells are widely used to discuss the phonological and phonetic systems of different accents of English in a clear and concise manner. Although based solely on RP and GenAm, the standard lexical sets have proven useful in describing many other accents of English. This is true because, in many dialects, the words in all or most of the sets are pronounced with similar or identical stressed vowels. Wells himself uses the Lexical Sets most prominently to give "tables of lexical incidence" for all the various accents he discusses in his work. For example, here is the table of lexical incidence he gives for Newfoundland English:

pronounced as /ɪ/

pronounced as /ɛ/

pronounced as /æ/

pronounced as /ɑ/

pronounced as /ɔ̈/

pronounced as /ʊ/

pronounced as /æː/

pronounced as /ɑː/

pronounced as /ɜr [ɝ:]/

pronounced as /iː/

pronounced as /ɛː, ɛɪ/

pronounced as /æ, ɑː/

pronounced as /ɑː/

pronounced as /ʌʊ/

pronounced as /uː/

pronounced as /əi/

pronounced as /əu/

pronounced as /ɛr/

pronounced as /ær/

pronounced as /ɔ̈r/

happ: pronounced as /[i]/
lett: pronounced as /ər [ɚ]/
comm: pronounced as /ə/

The table indicates that, for example, Newfoundland English uses the pronounced as //ɪ// phoneme for words in the lexical set, and that the, and sets are all pronounced with the same vowel pronounced as //ɔ̈r//. Note that some lexical sets, such as, are given with more than one pronunciation, which indicates that not all words in the lexical set are pronounced similarly (in this case, Newfoundland English has not fully undergone the pane–pain merger). pronounced as //ɔ̈// is a back vowel pronounced as /link/; Wells uses the symbol (IPA|ɔ̈) so that the reader does not confuse it with the vowel (which, in the case of many other accents, he writes with (IPA|ɔ) or (IPA|ɔː)).

Wells also uses the standard lexical sets to refer to "the vowel sound used for the standard lexical set in question in the accent under discussion": Thus, for example, in describing the Newfoundland accent, Wells writes that " and are reportedly often merged as pronounced as /[ɪ]/", meaning that the stressed syllables of words in the lexical set and words in the lexical set are reportedly often pronounced identically with the vowel pronounced as /[ɪ]/.

Lexical sets may also be used to describe splits and mergers. For example, RP, along with most other non-rhotic accents, pronounces words such as "father" and "farther" identically. This can be described more economically as the merger of the and lexical sets. Most North American accents make "father" rhyme with "bother". This can be described as the merger of the and lexical sets.

Origin

In a 2010 blog post, Wells wrote:

He also wrote that he claimed no copyright in the standard lexical sets, and that everyone was "free to make whatever use of them they wish".

Extensions

Some varieties of English make distinctions in stressed vowels that are not captured by the 24 lexical sets. For example, some Irish and Scottish accents that have not undergone the fern–fir–fur merger split the lexical set into multiple subsets. For such accents, the 24 Wells lexical sets may be inadequate. Because of this, a work devoted to Irish English may split the Wells set into two subsets, a new, smaller set and a set.^[5]

Some writers on English accents have introduced a set to refer to a set of words that have the vowel in standard accents but may have a different vowel in Sheffield^[6] or in south-east London.^[7] Wells has stated that he didn't include a set because this should be interpreted as an allophone of that is sensitive to the morpheme boundary, which he illustrates by comparing the London pronunciations of goalie and slowly.^[8]

, which documents the phonologies of varieties of English around the world like, employs Wells's standard lexical sets as well as the following supplementary lexical sets, as needed to illustrate finer details of the variety under discussion:

, discussed above
hors, offics, paintd and villge, all referring to the unstressed allophone of that is subject to the weak vowel merger
, and, for the allophones of (in non-rhotic dialects), and before intervocalic pronounced as //r//, commonly subject to Mary–marry–merry merger in North American English
and, for the allophones of and before intervocalic pronounced as //r//, commonly subject to mirror–nearer merger in North American English
treac and unc, both referring to the vocalized pronounced as //əl//
Other supplementary lexical sets include:

,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,, met,,,,,,,, carr, cord, crious,,, bout,,,,,,,,,,,,,

There is also the set, which is the same as Wells's .

Adaptation for Anglo-Welsh dialects

In his work for the Survey of Anglo-Welsh Dialects, David Parry adapted Wells's lexical sets for Anglo-Welsh dialects.

Lexical sets, as defined in .
Keyword	Example words
	bitch, bridge, finger, shilling, squirrel, thimble, whip, with
	buried, deaf, kettle, second, twelve, yellow
	apples, hand, ladder, lamb, man, rabbits, rat, saddle, that, thatch
	butter, furrow, jump, none, nothing, one, onions, suck, uncle
	cross, dog, fox, holly, off, porridge, quarry, trough, wash, wasps, wrong
	bull, butcher, foot, put, sugar, woman, wool
	cheese, geese, grease, key, pea, sheaf, sheep, weasel, weeds, wheel, yeast
	bacon, break, clay, drain, gate, lay (verb), potatoes, spade, tail, take, waistcoat, weigh
	first, heard, third, work (noun)
	chair, hare, mare, pears
	arm, branch, calf, chaff, draught, farmer, farthing, grass
	forks, morning, saw-dust, slaughter-house, straw, walk
	coal, cold, colt, comb, foal, oak, old, road, sholder, snow, spokes, toad, yolk
	dew, ewe, goose, hoof, root, stool, tooth, Tuesday, two
	eye, fight, flies (noun, plural), hive, ivy, mice, white
	boiling, oil, voice
	cow, plough, snout, sow (noun), thousand
	ears, hear, year
	boar, door, four
	fire, iron
	flour, hour

Bibliography

Book: Cruttenden , Alan . 2014. Gimson's Pronunciation of English. 8th. Routledge. 978-0-415-72174-5.
Book: Parry, David. A Grammar and Glossary of the Conservative Anglo-Welsh Dialects of Rural Wales. The National Centre for English Cultural Tradition. 1999. Sheffield.
Book: Schneider. Edgar W.. Burridge. Kate. Kate Burridge. Kortmann. Bernd. Mesthrie. Rajend. Upton. Clive. Clive Upton. 2004. A Handbook of Varieties of English. 1: Phonology. Mouton de Gruyter. 978-3-11-017532-5.

External links

Nicole Taylor (with the collaboration of Norma Mendoza-Denton http://www.u.arizona.edu/~nmd/cv.html), The University of Arizona, Anthropology 383, Standard Lexical Sets, 2002 (in Archive.is)
University of Pennsylvania, Linguistics 001, Lecture 9: Pronunciation of English

Notes and References

Mesthrie, Rajend (2000). "Regional Dialectology". Introducing Sociolinguistics. Edinburgh University Press, p. 50.
Book: Armstrong , Nigel . Social and stylistic variation in spoken French: a comparative approach. John Benjamins. 2001. 90-272-1839-0. Amsterdam. 100ff.
Book: Raymond Hickey. The Dialects of Irish: Study of a Changing Landscape. 29 August 2011. Walter de Gruyter. 978-3-11-023830-3.
Book: Robert McColl Millar. Northern and insular Scots. 2007. Edinburgh University Press. 978-0-7486-2316-7.
Book: Hickey , Raymond . A sound atlas of Irish English. Mouton de Gruyter. 2004. 3-11-018298-X. 54–55.
Stoddart, Upton and Widowson in Urban Voices, Arnold, London, 1999, page 76
Tollfree in Urban Voices, Arnold, London, 1999, page 165
Web site: John Wells's phonetic blog: the evidence of the vows. 2011-05-03. 2014-02-17.