Morphological typology is a way of classifying the languages of the world (see linguistic typology) that groups languages according to their common morphological structures. The field organizes languages on the basis of how those languages form words by combining morphemes. Analytic languages contain very little inflection, instead relying on features like word order and auxiliary words to convey meaning. Synthetic languages, ones that are not analytic, are divided into two categories: agglutinative and fusional languages. Agglutinative languages rely primarily on discrete particles (prefixes, suffixes, and infixes) for inflection, while fusional languages "fuse" inflectional categories together, often allowing one word ending to contain several categories, such that the original root can be difficult to extract. A further subcategory of agglutinative languages are polysynthetic languages, which take agglutination to a higher level by constructing entire sentences, including nouns, as one word.
Analytic, fusional, and agglutinative languages can all be found in many regions of the world. However, each category is dominant in some families and regions and essentially nonexistent in others. Analytic languages encompass the Sino-Tibetan family, including Chinese, many languages in Southeast Asia, the Pacific, and West Africa, and a few of the Germanic languages. Fusional languages encompass most of the Indo-European family—for example, French, Russian, and Hindi—as well as the Semitic family and a few members of the Uralic family. Most of the world's languages, however, are agglutinative, including the Turkic, Japonic, and Bantu languages and most families in the Americas, Australia, the Caucasus, and non-Slavic Russia. Constructed languages take a variety of morphological alignments.
The concept of discrete morphological categories has not been without criticism. Some linguists argue that most, if not all, languages are in a permanent state of transition, normally from fusional to analytic to agglutinative to fusional again. Others take issue with the definitions of the categories, arguing that they conflate several distinct, if related, variables.
Analytic languages show a low ratio of morphemes to words; in fact, the correspondence is nearly one-to-one. Sentences in analytic languages are composed of independent root morphemes. Grammatical relations between words are expressed by separate words where they might otherwise be expressed by affixes, which are present to a minimal degree in such languages. There is little to no morphological change in words: they tend to be uninflected. Grammatical categories are indicated by word order (for example, inversion of verb and subject for interrogative sentences) or by bringing in additional words (for example, a word for "some" or "many" instead of a plural inflection like English -s). Individual words carry a general meaning (root concept); nuances are expressed by other words. Finally, in analytic languages context and syntax are more important than morphology.
Analytic languages include some of the major East Asian languages, such as Chinese, and Vietnamese. Note that the ideographic writing systems of these languages play a strong role in regimenting linguistic continuity according to an analytic, or isolating, morphology (cf. orthography).
Additionally, English is moderately analytic, and it and Afrikaans can be considered as some of the most analytic of all Indo-European languages. However, they are traditionally analyzed as fusional languages.
A related concept is the isolating language, one in which there is only one, or on average close to one, morpheme per word. Not all analytic languages are isolating; for example, Chinese and English possess many compound words, but contain few inflections for them.
Synthetic languages form words by affixing a given number of dependent morphemes to a root morpheme. The morphemes may be distinguishable from the root, or they may not. They may be fused with it or among themselves (in that multiple pieces of grammatical information may potentially be packed into one morpheme). Word order is less important for these languages than it is for analytic languages, since individual words express the grammatical relations that would otherwise be indicated by syntax. In addition, there tends to be a high degree of concordance (agreement, or cross-reference between different parts of the sentence). Therefore, morphology in synthetic languages is more important than syntax. Most Indo-European languages are moderately synthetic.
There are two subtypes of synthesis, according to whether morphemes are clearly differentiable or not. These subtypes are agglutinative and fusional (or inflectional or flectional in older terminology).
Morphemes in fusional languages are not readily distinguishable from the root or among themselves. Several grammatical bits of meaning may be fused into one affix. Morphemes may also be expressed by internal phonological changes in the root (i.e. morphophonology), such as consonant gradation and vowel gradation, or by suprasegmental features such as stress or tone, which are of course inseparable from the root.
The Indo-European and Semitic languages are the most typically cited examples of fusional languages. However, others have been described. For example, Navajo is sometimes categorized as a fusional language because its complex system of verbal affixes has become condensed and irregular enough that discerning individual morphemes is rarely possible. Some Uralic languages are described as fusional, particularly the Sami languages and Estonian. On the other hand, not all Indo-European languages are fusional; for example, Armenian and Persian are agglutinative, while English and Afrikaans lean more analytic.
Agglutinative languages have words containing several morphemes that are always clearly differentiable from one another in that each morpheme represents only one grammatical meaning and the boundaries between those morphemes are easily demarcated; that is, the bound morphemes are affixes, and they may be individually identified. Agglutinative languages tend to have a high number of morphemes per word, and their morphology is usually highly regular, with a notable exception being Georgian, among others.
In 1836, Wilhelm von Humboldt proposed a third category for classifying languages, a category that he labeled polysynthetic. (The term polysynthesis was first used in linguistics by Peter Stephen DuPonceau who borrowed it from chemistry.) These languages have a high morpheme-to-word ratio, a highly regular morphology, and a tendency for verb forms to include morphemes that refer to several arguments besides the subject (polypersonalism). Another feature of polysynthetic languages is commonly expressed as "the ability to form words that are equivalent to whole sentences in other languages". The distinction between synthetic languages and polysynthetic languages is therefore relative: the place of one language largely depends on its relation to other languages displaying similar characteristics on the same scale.
Many Amerindian languages are polysynthetic; indeed, most of the world's polysynthetic languages are native to North America. Inuktitut is one example, for instance the word-phrase: tavvakiqutiqarpiit roughly translates to "Do you have any tobacco for sale?". However, it is a common misconception that polysynthetic morphology is universal among Amerindian languages. Chinook and Shoshone, for instance, are simply agglutinative, as their nouns stand mostly separate from their verbs.
Oligosynthetic languages are ones in which very few morphemes, perhaps only a few hundred, combine as in polysynthetic languages. Benjamin Whorf categorized Nahuatl and Blackfoot as oligosynthetic, but most linguists disagree with this classification and instead label them polysynthetic or simply agglutinative. No known languages are widely accepted as oligosynthetic.
In constructed languages
Constructed languages (conlangs) take a variety of morphological alignments. Most universal auxiliary languages based on the Indo-European family take on the family's typical fusional alignment, such as Ido, Interlingua, and Volapük. Esperanto, however, is a notable exception: it is agglutinative. Fictional languages vary: among J. R. R. Tolkien's languages for the Middle-earth universe, for example, Sindarin is fusional while Quenya is agglutinative. The language Newspeak from 1984 is a variant of English derived from the very concept of agglutination. Among engineered languages, Toki Pona is completely analytic, as it contains only a limited set of words with no inflections or compounds. Lojban is analytic to the extent that every gismu (basic word, not counting particles) involves pre-determined syntactical roles for every gismu coming after it in a clause, though it does involve agglutination of roots when forming calques. Ithkuil, on the other hand, contains both agglutination in its addition of affixes and extreme fusion in that these affixes often result from the fusion of numerous morphemes via ablaut.
While the above scheme of analytic, fusional, and agglutinative languages dominated linguistics for many years—at least since the 1920s—it has fallen out of favor more recently. A common objection has been that most languages display features of all three types, if not in equal measure, some of them contending that a fully fusional language would be completely suppletive. Jennifer Garland of the University of California, Santa Barbara gives Sinhala as an example of a language that demonstrates the flaws in the traditional scheme: she argues that while its affixes, clitics, and postpositions would normally be considered markers of agglutination, they are too closely intertwined to the root, yet classifying the language as primarily fusional, as it usually is, is also unsatisfying.
R. M. W. Dixon (1998) theorizes that languages normally evolve in a cycle from fusional to analytic to agglutinative to fusional again. He analogizes this cycle to a clock, placing fusional languages at 12:00, analytic languages at 4:00, and agglutinative languages at 8:00. Dixon suggests that, for example, Old Chinese was at about 3:00 (mostly analytic with some fusional elements), while modern varieties are around 5:00 (leaning instead toward agglutination), and also guesses that Proto-Tai-Kadai may have been fusional. On the other hand, he argues that modern Finno-Ugric and Dravidian languages are on the transition from agglutinative to fusional, with the Finno-Ugric family being further along. Dixon cites the Egyptian language as one that has undergone the entire cycle in only about three thousand years.
The World Atlas of Language Structures (WALS) sees the categorization of languages as strictly analytic, agglutinative, or fusional as misleading, arguing that these categories conflate multiple variables. WALS lists these variables as:
- Phonological fusion – how intrinsically connected grammatical markers are phonologically to their host words
- Formative exponence – the number of categories expressed in a single marker (e.g. tense + number + gender for verbs in some languages)
- Flexivity – how possession is marked
These categories give rise to highly non-traditional distributions of typological traits. For example, high exponence for nouns (e.g. case + number) is typically thought of as a trait of fusional languages. However, it is absent in many traditionally fusional languages like French, Spanish, and Arabic but present in many traditionally agglutinative languages like Finnish, Yaqui, and Cree.
- Boas, Franz (2010). Handbook of American Indian Languages. 1. Nabu Press. pp. 74–76. ISBN 978-1-177-52533-6.
- Mithun, Marianne (2001). The Languages of Native North America. Cambridge University Press. p. 323. ISBN 978-0-521-29875-9.
- Sloane, Thomas O. (2001). Encyclopedia of Rhetoric. Oxford University Press. p. 442. ISBN 978-0-19-512595-5.
- Bybee, Joan. "Semantic Aspects of Morphological Typology" (PDF). University of New Mexico. Retrieved November 14, 2014.
- Blank, Detlev (1985). "Internationale Plansprachen. Eine Einführung" [International Planned Languages. An Introduction]. Sammlung Akademie-Verlag. Akademie-Verlag. ISSN 0138-550X.
- Tikka, Petri (2007). "The Finnicization of Quenya". Arda Philology: Proceedings of the First International Conference on J. R. R. Tolkien's Invented Languages, Omientielva Minya, Stockholm 2005. Arda Philology. 1. Arda Society. pp. 1–20.
- "Chapter 4: The Shape Of Words To Come: Lojban Morphology". Lojban.org. Retrieved November 19, 2014.
- "Chapter 2: Morpho-Phonology". Ithkuil.net. Retrieved November 19, 2014.
- Garland, Jennifer (2006). "Morphological Typology and the Complexity of Nominal Morphology in Sinhala" (PDF). University of California, Santa Barbara. Retrieved December 8, 2014.
- Dixon, R. M. W. (1998). The Rise and Fall of Languages. Cambridge University Press. pp. 42–43. ISBN 978-0-521-62654-5.
- "Chapter Fusion of Selected Inflectional Formatives". WALS. Retrieved August 5, 2014.
- "Chapter Exponence of Selected Inflectional Formativess". WALS. Retrieved August 5, 2014.
- "Chapter Possessive Classification". WALS. Retrieved August 5, 2014.
- "Linguistic typology" (PDF). (275 KiB), chapter 9 of Halvor Eifring & Rolf Theil: Linguistics for Students of Asian and African Languages
- The book Language: An Introduction to the Study of Speech by Edward Sapir (1921) contains a classic introduction to the subject.
- Japanese Morphological Analysis API Japanese Morphological Analysis API by NTT Resonant