An article is any member of a class of dedicated words that are used with noun phrases to mark the identifiability of the referents of the noun phrases. The category of articles constitutes a part of speech.
In English, both "the" and "a(n)" are articles, which combine with nouns to form noun phrases. Articles typically specify the grammatical definiteness of the noun phrase, but in many languages, they carry additional grammatical information such as gender, number, and case. Articles are part of a broader category called determiners, which also include demonstratives, possessive determiners, and quantifiers. In linguistic interlinear glossing, articles are abbreviated as ART.
A definite article is an article that marks a definite noun phrase. Definite articles such as English the are used to refer to a particular member of a group. It may be something that the speaker has already mentioned or it may be otherwise something uniquely specified.
For example, Sentence 1 uses the definite article and thus expresses a request for a particular book. In contrast, Sentence 2 uses an indefinite article and thus conveys that the speaker would be satisfied with any book.
The definite article can also be used in English to indicate a specific class among other classes:
However, recent developments show that definite articles are morphological elements linked to certain noun types due to lexicalization. Under this point of view, definiteness does not play a role in the selection of a definite article more than the lexical entry attached to the article.[clarification needed]
An indefinite article is an article that marks an indefinite noun phrase. Indefinite articles are those such as English "some" or "a", which do not refer to a specific identifiable entity. Indefinites are commonly used to introduce a new discourse referent which can be referred back to in subsequent discussion:
Indefinites can also be used to generalize over entities who have some property in common:
Indefinites can also be used to refer to specific entities whose precise identity is unknown or unimportant.
Indefinites also have predicative uses:
Indefinite noun phrases are widely studied within linguistics, in particular because of their ability to take exceptional scope.
A proper article indicates that its noun is proper, and refers to a unique entity. It may be the name of a person, the name of a place, the name of a planet, etc. The Māori language has the proper article a, which is used for personal nouns; so, "a Pita" means "Peter". In Māori, when the personal nouns have the definite or indefinite article as an important part of it, both articles are present; for example, the phrase "a Te Rauparaha", which contains both the proper article a and the definite article Te refers to the person name Te Rauparaha.
The definite article is sometimes also used with proper names, which are already specified by definition (there is just one of them). For example: the Amazon, the Hebrides. In these cases, the definite article may be considered superfluous. Its presence can be accounted for by the assumption that they are shorthand for a longer phrase in which the name is a specifier, i.e. the Amazon River, the Hebridean Islands. Where the nouns in such longer phrases cannot be omitted, the definite article is universally kept: the United States, the People's Republic of China. This distinction can sometimes become a political matter: the former usage the Ukraine stressed the word's Russian meaning of "borderlands"; as Ukraine became a fully independent state following the collapse of the Soviet Union, it requested that formal mentions of its name omit the article. Similar shifts in usage have occurred in the names of Sudan and both Congo (Brazzaville) and Congo (Kinshasa); a move in the other direction occurred with The Gambia. In certain languages, such as French and Italian, definite articles are used with all or most names of countries: la France/le Canada/l'Allemagne, l'Italia/la Spagna/il Brasile.
Some languages also use definite articles with personal names. For example, such use is standard in Portuguese (a Maria, literally: "the Maria"), in Greek (η Μαρία, ο Γιώργος, ο Δούναβης, η Παρασκευή) and in Catalan (la Núria, el/en Oriol). It can also occur colloquially or dialectally in Spanish, German, French, Italian and other languages. In Hungarian, the use of definite articles with personal names is quite widespread, mainly colloquially, although it is considered to be a Germanism.
This usage can appear in American English for particular nicknames. One prominent example occurs in the case of former United States President Donald Trump, who is also sometimes informally called "The Donald" in speech and in print media. Another is President Ronald Reagan's most common nickname, "The Gipper", which is still used today in reference to him.
A partitive article is a type of article, sometimes viewed as a type of indefinite article, used with a mass noun such as water, to indicate a non-specific quantity of it. Partitive articles are a class of determiner; they are used in French and Italian in addition to definite and indefinite articles. (In Finnish and Estonian, the partitive is indicated by inflection.) The nearest equivalent in English is some, although it is classified as a determiner, and English uses it less than French uses de.
Haida has a partitive article (suffixed -gyaa) referring to "part of something or... to one or more objects of a given group or category," e.g., tluugyaa uu hal tlaahlaang "he is making a boat (a member of the category of boats)."
A negative article specifies none of its noun, and can thus be regarded as neither definite nor indefinite. On the other hand, some consider such a word to be a simple determiner rather than an article. In English, this function is fulfilled by no, which can appear before a singular or plural noun:
In German, the negative article is, among other variations, kein, in opposition to the indefinite article ein.
The equivalent in Dutch is geen:
The zero article is the absence of an article. In languages having a definite article, the lack of an article specifically indicates that the noun is indefinite. Linguists interested in X-bar theory causally link zero articles to nouns lacking a determiner. In English, the zero article rather than the indefinite is used with plurals and mass nouns, although the word "some" can be used as an indefinite plural article.
Articles are found in many Indo-European languages, Semitic languages (only the definite article), and Polynesian languages; however, they are formally absent from many of the world's major languages including Chinese, Japanese, Korean, Mongolian, many Turkic languages (including Tatar, Bashkir, Tuvan and Chuvash), many Uralic languages (incl. Finnic[a] and Saami languages), Indonesian, Hindi-Urdu, Punjabi, Tamil, the Baltic languages, the majority of Slavic languages, the Bantu languages (incl. Swahili). In some languages that do have articles, such as some North Caucasian languages, the use of articles is optional; however, in others like English and German it is mandatory in all cases.
Linguists believe the common ancestor of the Indo-European languages, Proto-Indo-European, did not have articles. Most of the languages in this family do not have definite or indefinite articles: there is no article in Latin or Sanskrit, nor in some modern Indo-European languages, such as the families of Slavic languages (except for Bulgarian and Macedonian, which are rather distinctive among the Slavic languages in their grammar, and some Northern Russian dialects), Baltic languages and many Indo-Aryan languages. Although Classical Greek had a definite article (which has survived into Modern Greek and which bears strong functional resemblance to the German definite article, which it is related to), the earlier Homeric Greek used this article largely as a pronoun or demonstrative, whereas the earliest known form of Greek known as Mycenaean Greek did not have any articles. Articles developed independently in several language families.
Not all languages have both definite and indefinite articles, and some languages have different types of definite and indefinite articles to distinguish finer shades of meaning: for example, French and Italian have a partitive article used for indefinite mass nouns, whereas Colognian has two distinct sets of definite articles indicating focus and uniqueness, and Macedonian uses definite articles in a demonstrative sense, with a tripartite distinction (proximal, medial, distal) based on distance from the speaker or interlocutor. The words this and that (and their plurals, these and those) can be understood in English as, ultimately, forms of the definite article the (whose declension in Old English included thaes, an ancestral form of this/that and these/those).
In many languages, the form of the article may vary according to the gender, number, or case of its noun. In some languages the article may be the only indication of the case. Many languages do not use articles at all, and may use other ways of indicating old versus new information, such as topic–comment constructions.
|Albanian||Yes, as suffixes||Yes||Yes||No||Yes|
|Arabic||Yes, as prefixes||Yes, as suffixes||No||No||No|
|Armenian||Yes, as suffixes||No||No||No||No|
|Basque||Yes, as suffixes||Yes||No||Yes||Yes|
|Bengali||Yes, as suffixes||Yes||No||Yes||No|
|Bulgarian||Yes, as suffixes||No||Yes||Yes||Only masculine singular|
|Danish||Yes, before adjectives and/or as suffixes||Yes||Yes||Yes (if definite)||No|
|Dutch||Yes||Yes||Yes (if definite)||Yes (if definite)||No, except for the genitive case|
|French||Yes||Yes||Yes (if singular)||Yes||No|
|German||Yes||Yes||Yes||Yes (if definite)||Yes|
|Greek||Yes||Yes||Yes||Yes (if definite)||Yes|
|Hawaiian||Yes||Yes||No||Yes (if definite)||No|
|Hebrew||Yes, as prefixes||No||No||No||No|
|Icelandic||Yes, as suffixes||No||Yes||Yes||Yes|
|Macedonian||Yes, as suffixes||No||Yes||Yes||No|
|Norwegian||Yes, before adjectives and/or as suffixes||Yes||Yes||Yes (if definite)||No|
|Persian||Might be used optionally||Yes||No||No||No|
|Romanian||Yes, as suffixes||Yes||Yes||Yes||Yes|
|Somali||Yes, as suffixes||No||Yes||No||Yes|
|Swedish||Yes, before adjectives and/or as suffixes||Yes||Yes||Yes (if definite)||No|
|Turkish||No||Might be used optionally||No||No||No|
|Welsh||Yes||No||Causes initial consonant mutation to singular feminine nouns||No||No|
|Yiddish||Yes||Yes||Yes||Yes (if definite)||Yes|
|Language||definite article||partitive article||indefinite article|
|Albanian||-a, -ja, -i, -ri, -ni, -u, -t, -in, -un, -n, -rin, -nin, -në, -ën, -s, -së, -ës, -të, -it, -ët (all suffixes)||disa||një|
|Arabic||al- or el ال (prefix)||-n|
|Assamese||-tû, -ta, -ti, -khôn, -khini, -zôn, -zôni, -dal, -zûpa etc.||êta, êkhôn, êzôn, êzôni, êdal, êzûpa etc.|
|Bangla||-টা, -টি, -গুলো, -রা, -খানা||একটি, একটা, কোন|
|Breton||an, al, ar||un, ul, ur|
|Bulgarian||-та, -то, -ът, -ят, -те (all suffixes)||един/някакъв, |
|Catalan||el, la, l', els, les
ses, lo, los, es, sa
|Danish||Singular: -en, -n -et, -t (all suffixes)
Plural: -ene, -ne (all suffixes)
|Dutch||de, het ('t); archaic since 1945/46 but still used in names and idioms: des, der, den||een ('n)|
|French||le, la, l', les
||de, d', du, de la, des,
|un, une, des |
|German||der, die, das
des, dem, den
|ein, eine, einer, eines|
|Greek||ο, η, το
οι, οι, τα
|ένας, μια, ένα|
|Hebrew||ha- ה (prefix)|
|Icelandic||-(i)nn, -(i)n, -(i)ð, -(i)na, -num, -(i)nni, -nu, -(i)ns, -(i)nnar, -nir, -nar, -(u)num, -nna (all suffixes)|
|Irish||an, na, a' (used colloquially)|
|Italian||il, lo, la, l'
i, gli, le
|del, dello, della, dell'
dei, degli, degl', delle
|un', uno, una, un|
|Khasi||u, ka, i
|Luxembourgish||den, déi (d'), dat (d')
|däers/es, däer/er||en, eng |
|Macedonian||-от -ов -он -та -ва -на -то -во -но
-те -ве -не -та -ва -на (all suffixes)
|неколку||еден една едно |
|Manx||y, yn, 'n, ny|
|Malay (Both Malaysia and Indonesian)||Si (usually informal), Sang (more formal)|
|Māori||te (singular), ngā (plural)||he (also for "some")|
|Maltese||(i)l-, (i)ċ-, (i)d-, (i)n-, (i)r-, (i)s-, (i)t-, (i)x-, (i)z-, (i)ż- (all prefixes)|
|Nepali||euta, euti, ek, anek, kunai
एउटा, एउटी, एक, अनेक, कुनै
|Norwegian (Bokmål)||Singular: -en, -et, -a (all suffixes)
Plural: -ene, -a (all suffixes)
|en, et, ei|
|Norwegian (Nynorsk)||Singular: -en, -et, -a (all suffixes)
Plural: -ane, -ene, -a (all suffixes)
|ein, eit, ei|
|Pashto||yaow, yaowə, yaowa, yaowey|
يو, يوهٔ, يوه, يوې
|Persian||in, ān||yek, ye|
|Quenya||i, in, 'n|
|Romanian||-(u)l, -le, -(u)a
-(u)lui, -i, -lor (all suffixes)
|Scottish Gaelic||an, am, a', na, nam, nan|
|Sindarin||i, in, -in, -n, en|
|Spanish||el, la, lo,
|Swedish||Singular: -en, -n, -et, -t (all suffixes)
Plural: -na, -a, -en (all suffixes)
|Welsh||y, yr, -'r|
|Yiddish||דער (der), די (di), דאָס (dos), דעם (dem)||אַ (a), אַן (an)|
The following examples show articles which are always suffixed to the noun:
Examples of prefixed definite articles:
A different way, limited to the definite article, is used by Latvian and Lithuanian. The noun does not change but the adjective can be defined or undefined. In Latvian: galds, a table / the table; balts galds, a white table; baltais galds, the white table. In Lithuanian: stalas, a table / the table; baltas stalas, a white table; baltasis stalas, the white table.
Languages in the above table written in italics are constructed languages and are not natural, that is to say that they have been purposefully invented by an individual (or group of individuals) with some purpose in mind. They do, however, all belong to language families themselves. Esperanto is derived from European languages and therefore all of its roots are found in Proto-Indo-European and cognates can be found in real-world languages like French, German, Italian and English. Interlingua is also based on European languages but with its main source being that of Italic descendant languages: English, French, Spanish, Italian and Portuguese, with German and Russian being secondary sources, with words from further afield (but internationally known and often borrowed) contributing to the language's vocabulary (such as words taken from Japanese, Arabic and Finnish). The result is a supposedly easy-to-learn language for the world. As well as these "auxiliary" languages the list contains two more: Quenya and Sindarin; these two languages were created by Professor Tolkien and used in his fictional works. They are not based on any real-world language family (as are Esperanto and Interlingua), but do share a common history with roots in Common Eldarin.
When using a definite article in Tokelauan language, unlike in some languages like English, if the speaker is speaking of an item, they need not have referred to it previously as long as the item is specific. This is also true when it comes to the reference of a specific person. So, although the definite article used to describe a noun in the Tokelauan language is te, it can also translate to the indefinite article in languages that requires the item being spoken of to have been referenced prior. When translating to English, te could translate to the English definite article the, or it could also translate to the English indefinite article a. An example of how the definite article te can be used as an interchangeable definite or indefinite article in the Tokelauan language would be the sentence “Kua hau te tino”. In the English language, this could be translated as “A man has arrived” or “The man has arrived” where using te as the article in this sentence can represent any man or a particular man. The word he, which is the indefinite article in Tokelauan, is used to describe ‘any such item’. The word he is used in negative statements because that is where it is most often found, alongside its great use in interrogative statements. Though this is something to make note of, he is not used in just in negative statements and questions alone. Although these two types of statements are where he occurs the most, it is also used in other statements as well. An example of the use of he as an indefinite article is “Vili ake oi k'aumai he toki ”, where ‘he toki ’ mean ‘an axe’. The use of he and te in Tokelauan are reserved for when describing a singular noun. However, when describing a plural noun, different articles are used. For plural definite nouns, rather than te, the article nā is used. ‘Vili ake oi k'aumai nā nofoa’ in Tokelauan would translate to “Do run and bring me the chairs” in English. There are some special cases in which instead of using nā, plural definite nouns have no article before them. The absence of an article is represented by 0. One way that it is usually used is if a large amount or a specific class of things are being described. Occasionally, such as if one was describing an entire class of things in a nonspecific fashion, the singular definite noun te would is used. In English, ‘Ko te povi e kai mutia’ means “Cows eat grass”. Because this is a general statement about cows, te is used instead of nā. The ko serves as a preposition to the “te” The article ni is used for describing a plural indefinite noun. ‘E i ei ni tuhi?’ translates to “Are there any books?”
Articles often develop by specialization of adjectives or determiners. Their development is often a sign of languages becoming more analytic instead of synthetic, perhaps combined with the loss of inflection as in English, Romance languages, Bulgarian, Macedonian and Torlakian.
Joseph Greenberg in Universals of Human Language describes "the cycle of the definite article": Definite articles (Stage I) evolve from demonstratives, and in turn can become generic articles (Stage II) that may be used in both definite and indefinite contexts, and later merely noun markers (Stage III) that are part of nouns other than proper names and more recent borrowings. Eventually articles may evolve anew from demonstratives.
Definite articles typically arise from demonstratives meaning that. For example, the definite articles in most Romance languages—e.g., el, il, le, la, lo — derive from the Latin demonstratives ille (masculine), illa (feminine) and illud (neuter).
The English definite article the, written þe in Middle English, derives from an Old English demonstrative, which, according to gender, was written se (masculine), seo (feminine) (þe and þeo in the Northumbrian dialect), or þæt (neuter). The neuter form þæt also gave rise to the modern demonstrative that. The ye occasionally seen in pseudo-archaic usage such as "Ye Olde Englishe Tea Shoppe" is actually a form of þe, where the letter thorn (þ) came to be written as a y.
Multiple demonstratives can give rise to multiple definite articles. Macedonian, for example, in which the articles are suffixed, has столот (stolot), the chair; столов (stolov), this chair; and столон (stolon), that chair. These derive from the Proto-Slavic demonstratives *tъ "this, that", *ovъ "this here" and *onъ "that over there, yonder" respectively. Colognian prepositions articles such as in dat Auto, or et Auto, the car; the first being specifically selected, focused, newly introduced, while the latter is not selected, unfocused, already known, general, or generic.
Standard Basque distinguishes between proximal and distal definite articles in the plural (dialectally, a proximal singular and an additional medial grade may also be present). The Basque distal form (with infix -a-, etymologically a suffixed and phonetically reduced form of the distal demonstrative har-/hai-) functions as the default definite article, whereas the proximal form (with infix -o-, derived from the proximal demonstrative hau-/hon-) is marked and indicates some kind of (spatial or otherwise) close relationship between the speaker and the referent (e.g., it may imply that the speaker is included in the referent): etxeak ("the houses") vs. etxeok ("these houses [of ours]"), euskaldunak ("the Basque speakers") vs. euskaldunok ("we, the Basque speakers").
Speakers of Assyrian Neo-Aramaic, a modern Aramaic language that lacks a definite article, may at times use demonstratives aha and aya (feminine) or awa (masculine) – which translate to "this" and "that", respectively – to give the sense of "the".
Indefinite articles typically arise from adjectives meaning one. For example, the indefinite articles in the Romance languages—e.g., un, una, une—derive from the Latin adjective unus. Partitive articles, however, derive from Vulgar Latin de illo, meaning (some) of the.
The English indefinite article an is derived from the same root as one. The -n came to be dropped before consonants, giving rise to the shortened form a. The existence of both forms has led to many cases of juncture loss, for example transforming the original a napron into the modern an apron.
The Persian indefinite article is yek, meaning one.