TECHNOLOGY REPORT article

Front. Digit. Humanit., 31 March 2017

Sec. Digital Literary Studies

Volume 4 - 2017 | https://doi.org/10.3389/fdigh.2017.00006

A Simple Set of Rules for Characters and Place Recognition in French Novels

  • CB

    Cyril Bornet *

  • FK

    Frédéric Kaplan

  • Digital Humanities Laboratory, DHI, CDH, EPFL, Lausanne, Switzerland

Article metrics

View details

13

Citations

68,8k

Views

51,1k

Downloads

Abstract

This article describes a simple unsupervised system for automatic extraction and classification of named entities in French novels. The solution presented combines a set of different standalone classifiers within a meta-recognition system. The system is tested on 35 classic French novels, representing 5 million words and 3,700 names of people and places. The results demonstrate that although none of the standalone methods clearly outperforms the others, their combined classification offers a robust solution in this context.

1 Introduction

The systematic extraction of characters and places from novels could potentially contribute to new analyses and ultimately better understanding of fictional stories and the way they were written. The ability to view the authors’ narrative strategies by plotting the interleaved presence of recurring characters and places opens new forms of distant visualization and facilitates a distant reading comparison of novels (Moretti, 2013). Similarly, connecting distinct novels by the places and characters they share may offer interesting and innovative navigation and exploration means, as already suggested by some real-world applications (Albanese, 2011).

This paper presents a simple system for automatically extracting named entities within French novels. Named Entity Recognition (NER) has become reasonably well developed, driven notably by the needs for machine translation and automatic information extraction out of flows of news articles (Hirschberg and Manning, 2015). However, only a small number of systems perform well on languages other than English, and in particular, as in our case, on French contents (Azpeitia et al., 2014). Previous initiatives targeted specifically old French newspapers corpora (Mosallam et al., 2014); however, fictional novels are challenging in a different way since their referents are typically not listed in databases commonly used by many NER systems, like DBPedia,1 Wikidata,2 or YAGO.3 One additional difficulty with non-English languages is the limited amount of readily available sets to train machine learning recognizers. Therefore, solutions that can achieve good results without the need of training beforehand are highly relevant.

Our solution4 combines a set of different standalone unsupervised recognition strategies feeding a meta-recognition system. This approach, whose concept was first described aiming for improved accuracy needs in biometrics, has proven to be effective in various other fields (Scheirer et al., 2011), and among them notably NER (Si et al., 2005). The rationale behind meta-recognition is to reach out for a set of independent classifiers, each looking at different features and possibly using different strategies. One would then expect that although none of the standalone method clearly outperforms the others, their combined judgment offers a robust solution to the tackled recognition problem, as their individual errors will tend to cancel out.

The next sections describe the performances obtained with state of the art tools compared to each of our classifiers on a 5M tokens test corpus, and the overall gain brought by averaging their results using a fixed weight and a meta-recognition algorithm. The conclusion outlines some applications for analyses of novels that can be performed on the outputted categorization.

1.1 Test Procedure

We built a test corpus made out of 35 classic works (5M tokens) in French, stemming from digitized books part of Project Gutenberg.5 The works were handpicked in a way to get a mix of different genres and narration types. They include 3 sagas, 6 classical books that were translated from other languages and 25 French classical novels. For each piece of work, we produced a corresponding reference file by extracting the proper nouns occurring more than five times and manually labeling each of them with the category they fall in (character, place, or other). The resulting database references about 3,700 categorized named entities, along with the work they come from and their number of occurrences.

Given our further needs, we considered names referring to pets, companies, and groups of persons as regular characters whenever they serve the same narrative purpose (people mostly interacting with corporations or groups of people). Conversely, they were considered places in the case of names used to qualify where actions take place (as with corporations considered as locations), or ethnical groups used to name their originating place (as in “le pays des Oreillons”). This differs a lot from most knowledge databases, as the very bounds between characters and places might be fuzzy and context dependent. For instance, if the word “France” is part of the expression “Roi de France,” we do not want it to be labeled as a place, but the whole entity as a character instead.

For the actual test procedure, we ran each classifier on our corpus and computed the averagely yield precision (positive predictive value), recall (true positive rate) and F1 score of the classification task. Section 2 shortly discusses state of the art of existing tools under the same constraints, and sections 3.2 to 3.6 give the details of the implementations of each classifier, and the obtained performances.

2 NER in Novels

The field of

NER

is evolving quickly, as big data approaches and access to increasingly large corpora brought up several projects promising interesting results. However, these tools remain mainly purposed to summarize, categorize, and extract meaning out of short texts that relate to the outside world (e.g., news articles, encyclopedias, or otherwise informative texts). Analyses on big portions of self-contained texts like entire chapters or full fiction books, whose named entities are difficult to look up and interpret out of their context, remain a challenging endeavor. Novels in particular are different from traditional applications for

NER

techniques for at least two reasons:

  • Novels create a world of their own, with own recurring characters and places that can be hard to understand out of their context. Many traditional NER methods do not exploit this characteristic as they on the contrary rely on external databases to identify and classify named entities (Jovanovic et al., 2014).

  • Each novel tends to be characterized by specific stylistic features whose purpose precisely is to give the narration a unique taste. On the one hand, a novelist’s writing diverges in nature from journalistic, academic, or other forms of non-fiction writings, and on the other hand, it is usually purposely meant to introduce noticeable differences compared to the style of other authors. This phenomenon is obvious enough that it was shown to allow to confidently attribute texts to their authors in many contexts (Stamatatos, 2009). This intrinsic diversity makes a one-size-fits-all approach to NER difficult and is problematic for pattern-inducing machine learning algorithms.

In order to establish a baseline for our study, we conducted an initial evaluation with one of the current widely used NER systems. OpeNER6 is a multilingual and powerful NER pipeline, promising state of the art results on French corpora (Azpeitia et al., 2014). Even if it was not designed to handle big amounts of text,7 its processing routine is efficient and produces XML result files that can easily be processed and compared to our reference data. Since OpeNER is extracting a wider range of entities than the ones we considered (such as date and time information), we only considered the relevantly tagged words and manually resolved naming differences in order to keep the comparison fair and accurate.

3 A Set of Simple Rules

3.1 Preprocessing

In general, spotting proper nouns in French texts is a rather easy task due to capitalization rules, that are quite similar to English if not simpler due to less false positives (e.g., words related to languages, or days and months are written in lower case) (Geno,

1992

). Still, the task is not entirely trivial, because we need to filter out capitalization due to sentences starts and miscellaneous stylistic effects (such as subtitles, quotes, and verses). Additionally, we want full names extracted as single entities and remove false positives, like honorific titles or named time periods (Grevisse and Lenoble-Pinson,

2009

). In order to do that, we designed a two-pass method:

  • First, our scripts extract a list of all proper nouns that are not leading sentences. To do this, we compute all 1, 2, and 3 g of each sentence while letting out the first word. We then keep all isolated proper nouns (i.e., capitalized words surrounded by lower cased ones), pairs of juxtaposed proper nouns, and triplets of proper nouns joined by an hyphen or a nobiliary particle,8 each of them as if they were single entities.

  • Then, we run a second pass on the full text, where we look for all sentences containing each noun we identified in the first run to include legitimate nouns leading sentences we had left out. For each of them, we store an index of the sentences they appear in for further processing.

This approach allows to recover all occurrences of each capitalized word, as long as they are not systematically at the start of sentences. In practice, the resulting list needs to be refined thereafter, as some capitalized common nouns still happen to end up in it. The reasons for this may be multiple, ranging from sentence tokenization errors, typos in the source text or other stylistic effects that may influcence punctuation and case. This refining can be done accurately and efficiently by combining three strategies:

  • by computing and comparing the mean positions of each word in the sentences they were found in. As we may expect, the distribution of words that are usually capitalized only when leading the sentences will concentrate toward 0, whereas the real proper nouns’ will tend to even out. Figure 1 illustrates that the typical distributions often allow for easy separation, with very few outliers.

  • The said outliers being usually connection words (e.g., pronouns and prepositions) that are often capitalized in different contexts, they can be filtered out using a simple French stopwords list.

  • Third, words related to nationalities or ethnic groups can be easily filtered out as they are usually present in both their singular and plural forms, whereas genuine proper nouns will usually not follow this characteristic. Hence, we remove the words ending in “s” that can also be found without the “s” throughout the same text.

Figure 1

Once identified, proper nouns usually fall in three main categories that serve different purposes to narration: they can namely be characters, places, or others (brands, abstract concepts, acronyms…). We designed and evaluated six independent classifiers. Each classifier gets one word at a time as an input as well as the context that is necessary and relevant for its way of processing data, and returns the predicted category (namely character, place, or other). We first present the implementation characteristics of each component before looking in details at the resulting scores.

3.2 Classifier #1: Obvious Context, Titles, and Predicates

When one encounters a proper noun in a sentence, a good guess on its nature can sometimes easily be taken due to the immediate context. The simplest case, which we will refer here as obvious context, would be if the noun is immediately preceded by a title or a predicate that hints at what it refers to. For instance, the name Vilquin could probably refer to anything without clear preference when out of any context, but it would be very easy to classify, respectively, as a person or as a place, if at least one sentence was mentioning “[…] monsieur Vilquin […]” or, conversely, “[…] rue Vilquin […].”

For this classifier, we compiled a simple list of obvious context classifiers that allow to make good guesses about the nature of the immediately or next to following proper noun:

  • In the case of characters, the list is basically the usual name titles of the French language, such as madame or docteur.9

  • For places, we considered the most common predicates used to qualify toponyms, as for instance ville, avenue, or rue.

  • For other words we wanted specifically to filter out, we added a short list of terms referring to deities (dieu, jésus, marie, vierge, saint).

3.3 Classifier #2: Naive Position

In French, like in many other languages, the grammatical structure makes it more likely for sentences to follow a pattern that puts the subject of the action at the beginning, and the location toward the end. This characteristic can be used when one looks at enough examples to make a simple, yet quite powerful guess about the global roles of the proper nouns.

The accuracy of this classifier is indeed strongly dependent on the writing style of the author, as the frequent use of specific figures of speech may break its work hypothesis, and longer sentences may narrow the gap between the categories or blurry the bounds. This can be seen clearly in Figure 2, where we show the relative positions of identified classes of names for three different stories. In those examples, we can see the effectiveness of a separation guess at around 45% of sentences’ length, which is expected to yield quite good results for the first two books, yet a bit more disappointing ones for the third one.

Figure 2

3.4 Classifier #3: Semantics out of Neighboring Words

Inspired by Latent Semantic Analysis (Dumais, 2005), in which it is hypothesized the meaning of words are the result of their neighbors’, a slightly more sophisticated approach consists of a broader look at neighboring words semantics. This approach is very different from section 3.2 in that we are not expecting the surrounding words to qualify the noun per se but to hint at its nature due to the actual meaning of the near context. For this implementation, we compiled lists of words that are more likely (but not exclusively) to appear, respectively, nearby characters, places, or abstract concepts. For instance, we expect names of characters to be more often surrounded by words related to emotions, body functions, speech, or professions, whereas names of places would be more closely related to motion verbs, place features, and prepositions.

Starting from common nouns that are unambiguously related to one of the categories we are interested in, we used a French synonyms dictionary service10 to put up a list aiming to be as extensive as possible. The final files resulted in 4,500 words for characters, 670 for places and 50 for concepts (see Appendix B for the complete list). The script then looks for these words in the neighborhood of the nouns to be disambiguated and returns the most probable category.

3.5 Classifier #4: Grammatical Structure

As characters and places serve different narrative purposes, one may expect the grammatical constructs surrounding them to differ in a significant way. For instance, place names are often preceded with prepositions or determiners, whereas it is expected for character names to be more often directly followed by verbs. We thus introduced a script classifying names based on its knowledge of the full text, grammatically tagged using TreeTagger,11 and tokenized in sentences. To guess the nature of the names, it then matches all sentences containing them against a set of rules that are typical constructions one uses when writing about a person or a place.

We tried out a set of seven manually established rules covering the most straightforward grammatical constructs (described in details in Table 1), plus two that help filter out tokenization errors at a sentence level by flagging words that are preceded by a punctuation mark or that are alone in their sentence. Formally, rules are patterns possibly matching the set {ti−2, ti−1, ti+1, ti+2} of grammatical tags attributed to up to four neighboring tokens {wi−2, wi−1, wi+1, wi+2} for each proper noun wi. When matched, they increase or decrease the probability score for one or several classifications, and the category yielding the highest score is chosen and returned in the end.

Table 1

ti−2ti−1ti+1ti+2Result
(At least one is a verb)Character
VERCharacter
NAMCharacter
PRPPRPPlace
VERPlace
DETPlace
PRP(Empty)(Empty)Concept
PUNUnknowna
(Empty)(Empty)Unknowna

Rules for grammatical structure classifier.

aProbable tokenization error.

VER, verb; NAM, proper noun, PRP, preposition; DET, determiner; PUN, punctuation.

3.6 Classifier #5: Online-Querying

A lot of proper nouns can be non-ambiguously or with a high probability related to one or several categories based on general knowledge. For instance, a human could make a guess that “Elisabeth” will most likely refer to a person, whereas “Manhattan” is likely to be a place and “Amour,” a concept. But the same knowledge may equivocally tell that those same words could also potentially be related to the ship (RMS Queen Elisabeth), an abstract concept (project Manhattan), or a place (Amur river), probably with a lower likelihood if no other context is available.

For many nouns, the knowledge we are looking for is well captured in the categorization of their related Wikipedia pages. Using categories instead of the text of the articles also presents the advantages of being very straightforward and reduces a lot noisy signals related to text processing techniques. To test this idea, we implemented a simple algorithm that gathers the categories of the page whose name is closest to the noun we are looking for and looks for ones tagging people, places, or abstract concepts. In the case no category gives a hint (which tends to happen both with very complex or very precise pages), it tries to recursively walk up the hierarchy until the necessary clues are found.

3.7 Classifier #6: Quotes-Based

Several works already showed the relevance of locating direct and indirect speech parts to identify characters in novels (Glass and Bangay, 2007; Goh et al., 2012; Karsdorp et al., 2012). Most of these approaches rely heavily on the lexical database WordNet12 to find out speech-related verbs and refine their accuracy, but for performance reasons and since we wanted the classifiers to remain efficient even on very long texts we implemented a simpler version that simply checks the proximity of detected proper nouns to quotation marks. For each proper noun w appearing mw times, the system would essentially count the number qw of mentions that appear near quotations. It then computes: the quotation ratios for each noun, and: the median value of all ratios, to be considered the differentiation threshold for this book. Each noun w is then assigned the character class if rw is higher than or place otherwise.

4 Meta-Classification

Once all classifiers returned their answer for a given word, the last step is to compare these results and to decide on a final answer. This meta-classification step can be done by voting systems, choosing the final result according to the majority of predictions using various strategies, or by a meta-recognition system, aiming to discard classifiers that seem to have encountered a problem on the considered text file. We implemented and discussed the performance of four distinct meta-classification methods.

4.1 Simple Vote

The easiest and most obvious solution to average the different classifications is a simple voting system (i.e., the classification who gets the majority of results wins). However, since there is an even number of classifiers, ties are to be expected. This situation is quite unlikely since it would require exactly three classifiers deciding correctly, and the three others agreeing incorrectly on a wrong categorization. Still, in case, this situation occurred the final choice would be non-deterministic by lack of model to support one option or the other. For this reason, we introduced a second meta-classification, which involves for each classifier to compute a confidence self-assessment score.

4.2 Self-Assessed Confidence

For most classifiers, their internal mechanics allow themselves to evaluate to which extent the strategy they are using seems likely to return reliable results, given the current work context. Hence, a simple strategy to help the voting process in the case of ties is for each classifier to return a confidence index, between 0 and 1.

The used self-assessing strategies are as follows:

  • For Obvious Context, Semantic, and Wikipedia classifiers (sections 3.2, 3.4, and 3.6), we use where Cmax is the count of most and Cmin of least represented categories for which we found clues. For instance, if the Semantic classifier finds 6 neighboring words related to places, 3 to people and 1 to abstract concepts, the confidence index will be . This index is thus expected to equal 1 if the decision was made with no ambiguity and 0 if the clues were equally distributed.

  • For Naive Position, Grammatical, and Quotes (sections 3.3, 3.5, and 3.7) classifiers, we use the difference between the splitting threshold and observed decision value, normalized between 0 and 1. For instance, considering the Quotes classifier computes a ratio of 0.4 for some noun with a threshold of 0.2, the result will be classified as a person with a confidence index of . Again, this index is expected to see its value tend toward 0 for ambiguous cases and toward 1 for the more definite ones.

On top of that, some classifiers are given the possibility to return 0 to mark their results as known to be invalid, and thus irrelevant at voting time. This can happen for instance when we do not find any known title preceding a word throughout the text, if no grammatical rule could be matched, or if Wikipedia does not have any result for the searched word.

The improved voting algorithm then first discards all classifications that have a confidence mark of 0 and proceeds to a simple vote between the remaining ones for each noun. In case of a tie, the results rated with the highest confidence will be privileged.

4.3 Fixed Weighting

Not all classifiers exhibit the same behavior regarding precision and recall. It thus can be justified to put more confidence on some of them in cases when we know they are more likely to succeed. For this test, we used manually set weights putting more importance to the obvious context classifier (section 3.2), due to its high precision rate, all others being treated equally. With the help of confidence rating (section 4.2), we know the low recall rate will not impact negatively on the other classifiers because it will return a confidence score of 0 if it could not find any classification clue. Hence, those cases will be discarded regardless of the coefficient. A good compromise can be reached by giving 3 times more weight to the obvious context classifier, allowing the others to still easily overpower it in the unlikely case a majority of them reach a contradictory agreement.

4.4 Meta-Recognition Approach to Optimized Weighting

A meta-recognition algorithm follows the idea of improving its accuracy by entirely removing one classifier if it detects it is consistently failing, typically due to stylistic biases or other broken assumptions on the considered book. Given the global classification results, one can easily compute an agreement score between the different classifiers, for instance using the Fleiss’ Kappa method (Fleiss and Cohen, 1973). In case this indicator hints at discrepancy, we can simply recompute all averages by systematically letting out one of the classifiers, until the new Fleiss’ Kappa value increased. Our hypothesis here is that since the remaining classifiers reached a higher agreement, the discarded one must have globally failed in some way and needs to be put aside.

5 Results

Let us consider in Figure 3 the precisions vs. recall results that each of the six classifiers achieved on our test corpus. One can immediately see a typical pattern in any information retrieval system: one parameter is detrimental to the other, and no two classifiers behave in a similar way. We can also see that for each of them, some books get incredibly good results, and few others turn out very bad. Interestingly and as backed up by the full numerical values shown in Table 2, those are almost never the same, confirming our hypothesis that some methods may work way better (or worse) on some texts, giving a strong justification for the multi-classifier Mcapproach. The averaged results seem to confirm this intuition. In Figures 4 and 5, we can see that all meta-classification schemes overall pushed the results toward the top, and at the same time made the clustering denser, hence reducing the differences between the books and output more consistant results by removing the worse outliers.

Figure 3

Table 2

1. Obvious context
2. Naive position
3. Semantics
4. Grammar
5. Wikipedia
6. Quotes
PRF1PRF1PRF1PRF1PRF1PRF1
20000 Lieues sous les mers0.6670.2690.38280.8060.7630.78390.7360.6090.66660.8000.8570.82750.6270.6520.63940.5940.5570.6567
Au bonheur des dames0.9470.3540.51530.7370.8640.79560.8390.6950.76070.7780.8400.80770.5920.7010.64220.6580.6110.6033
Belle rose0.9800.4190.58680.7730.8590.80870.8350.8120.80590.7570.8310.78760.3470.3630.34980.6930.6370.6404
Bijou0.9050.2660.41070.4110.5780.45490.7920.7660.77840.5650.8590.67100.2620.5160.33780.5080.7190.5897
Blanche et Bleue1.0000.3120.47620.7610.8230.79060.6590.5420.59460.8020.8850.84180.6340.7080.66890.6020.4060.3379
Boule de Suif1.0000.1540.26670.5430.8080.64930.8570.4420.58350.6210.8850.72990.4070.4420.42400.5950.7310.6561
Candide0.9170.2080.33950.9060.8750.89040.9440.4870.64280.9060.8750.89040.7830.5830.66840.9170.5510.6557
Colomba1.0000.1670.28570.6370.7500.68920.6620.5560.60400.8020.9170.85560.5770.6390.60630.7710.8890.7615
La Comedie Humaine0.8820.4510.59590.6350.8110.70980.7030.6410.67800.6560.8570.73790.5520.6330.58060.6260.6980.6478
Consuelo0.9400.2390.38130.6570.7280.69050.8020.6960.74510.6820.7870.73110.5760.6500.61080.7590.7150.7299
David Copperfield1.0000.3330.50000.5930.6880.63690.7870.7850.78590.7630.9420.84320.6350.8120.71260.6240.6050.6059
Don Quichotte0.9740.3130.47330.5790.7920.66690.7620.5960.72580.5640.8110.66210.4770.5930.53420.6130.6590.6284
Germaine0.8890.3370.48880.5360.5760.55500.6080.5490.57730.6850.7420.71260.6040.6550.62860.7350.5270.6243
Germinal1.0000.1280.22660.7310.7890.75870.7720.6170.71200.8210.8890.85380.6370.6940.66460.4690.4110.4256
Germinie Lacerteux0.8330.2000.32260.5621.0000.60870.8570.5000.63160.5000.8000.59610.2080.6000.26670.4170.6000.4918
Jane Eyre1.0000.2050.33960.7050.8640.77600.8960.9200.90800.8460.9660.90210.6130.7060.65590.5980.5060.4943
La Bête Humaine1.0000.3210.48580.8330.9170.87300.8750.7960.80500.8570.9380.89550.5590.6000.57890.6610.5880.6146
La Femme du mort0.9640.3970.56240.5080.7880.61220.6950.6440.70460.7410.9050.80250.3320.6520.41110.5440.6730.5525
L’Assommoir0.9710.2420.38800.7430.8640.79890.8150.5940.68710.7940.8790.83430.6560.7420.69640.6170.6030.6146
Le Mort vivant1.0000.2070.34290.6670.7070.68620.8100.5670.66690.7660.8580.80930.4960.5020.49890.6820.6230.6404
Le Parfum de la dame en noir1.0000.2740.43040.6890.6090.64670.7030.6390.66930.9030.9190.91120.6550.6830.66860.7670.5830.6494
Les Malheurs de Sophie0.8330.1250.30880.6170.8120.70120.6880.6880.43750.7500.9380.90910.4330.5000.58040.8330.9380.8537
Les Miserables0.9330.3500.50910.7390.8000.76550.8200.7750.82410.7840.8330.80240.4240.4420.42080.7110.6670.6624
Les Tribulations d’un Chinois1.0000.1320.23400.7100.8080.75560.7920.7390.76460.7570.8570.80360.5060.6030.55000.7080.7180.6344
Les trois mousquetaires0.9230.2750.42380.6570.7620.70610.8520.7520.79880.7510.8750.80850.4490.5300.48590.6960.6820.6653
Les Vacances0.8890.1750.29240.5620.6830.61710.8070.7830.79500.6670.8750.75680.5310.5830.55620.6670.6830.6308
Le Tour du Monde en 80 jours1.0000.1900.32000.7240.7010.70860.7380.6100.69250.8940.9390.91030.6520.6910.66540.7710.6850.7389
Le voluptueux voyage1.0000.0420.08000.6710.7010.67440.8750.4660.60810.9230.9580.92440.4200.4360.37560.6790.6170.6256
L’Iliade0.9500.1170.20820.6200.7470.67280.7160.5540.62020.6510.7970.71210.5260.6260.56080.5710.3420.4083
L’Odyssée0.9290.1110.19850.6270.8070.70560.5320.4300.51090.6420.7700.70040.5220.7040.59950.5930.3280.4318
Madame Bovary1.0000.2410.38810.7580.8160.78630.7070.6620.70470.8930.9630.92660.6110.6600.63430.5050.4390.6315
Les Mystères de Paris0.9610.3450.50810.5650.6710.61330.8770.8350.85550.6600.8410.73970.4720.5370.50240.6460.6580.6287
Nana1.0000.2120.32820.5740.8360.69410.6380.6440.69360.6390.9060.76820.4200.5130.47650.5490.6700.5344
Notre-Dame de Paris0.8570.2660.40630.7040.7200.71180.7680.6310.68800.7760.8370.80550.7420.8330.78480.7480.7110.7614
Récits d’un soldat1.0000.0000.00000.6670.7050.68510.6250.4770.54120.9581.0000.97870.4580.5230.48840.6290.7730.6674
Tarass Boulba1.0000.1670.28570.6260.8440.71920.8210.7000.75590.7031.0000.82580.5860.8440.69180.6500.6220.6358
Therèse Raquin0.8330.2780.41670.8380.8330.83540.8750.7780.82350.8750.8330.85370.8890.8890.88890.7330.5560.5636

Average0.9490.2390.3690.6560.7740.7060.7710.6430.6950.7510.8810.8080.5250.6170.5620.6520.5980.615
SD0.0700.1010.1330.1100.0910.0880.1050.1240.1020.1110.0650.0850.1430.1190.1300.0990.1290.103

Precision and recall scores, per classifier and per class.

Lowest and highest scores are higlighted in bold.

Figure 4

Figure 5

By looking at the numerical results (Tables

2

and

3

), several interesting facts can be stated about each classifier:

  • Classifier #1: as one may expect with this kind of simple implementation, the resulting predictions usually translate to a very high precision (100% in nearly 2/3 of the cases, and on average 0.949) but a poor recall rate (only 0.239), as the presence of enough context words leaves little doubt on the categorization, though unlikely to happen for most of the nouns we would want to disambiguate. It is also interesting to see that its highest score was achieved on La Comédie Humaine, one of our longest samples, with an F1 score of 0.596. Actually, all three sagas ranked quite high compared to the shorter books (Les Misérables and Les Mystères de Paris both got an F1 score higher than 0.5).

  • Classifier #2: unsurprisingly, this classifier is one of those that have the overall worst performance when compared to the others. However, one has to note that with a precision and recall of respectively 0.654 and 0.773, this very simple method still outperforms possible more complex ones.

  • Classifier #3: the semantics approach surprisingly did not perform as well as one may have expected. Its precision of 0.769 is the second highest, but it comes with a cost of a recall of 0.651, which brings its F1 score down even below the one of #2.

  • Classifier #4: the grammar classifier resulted in the overall highest F1 score (0.807), with a precision of 0.750 and a recall of 0.879. One can notice it performed best, with a very satisfying 0.909 F1 score, on Les Malheurs de Sophie, which could be explained by since the book was written for children, its syntax might be more regular than other novels.

  • Classifier #5: the quotes classifier did not perform very well. Its F1 score (0.568) is the one to lowest and it has the lowest precision rate of all (0.528). Yet it still achieved a very good score on the novel Thérèse Raquin, where it actually got the highest precision, recall, and F1 scores amongst all classifiers.

  • Classifier #6: this method resulted in a rather disappointing overall precision and recall of 0.644 and 0.597, respectively, which can be explained by the expected lack of fictional entities represented on Wikipedia, and among those who seem to be there, many would actually relate to real-world counterparts that may be very different from the fictional use of the same word.

Table 3

1. Obvious context
2. Naive position
3. Semantics
4. Grammar
5. Wikipedia
6. Quotes
PcharPplacPcharRplacPcharPplacPcharRplacPcharPplacPcharRplacPcharPplacPcharRplacPcharPplacPcharRplacPcharPplacPcharRplac
20000 Lieues sous les mers0.5000.8330.0370.5000.8620.7500.9260.6000.9330.5380.5190.7000.9570.6430.8150.9000.8260.4290.7040.6000.8330.5450.7410.600
Au bonheur des dames1.0000.8950.3120.3950.5560.9190.9380.7910.8750.8040.4380.9530.6670.8890.7500.9300.3850.8000.9380.4650.5240.7930.6880.535
Belle rose1.0000.9600.2220.6150.5810.9661.0000.7180.7780.8920.7780.8460.5480.9660.9440.7180.2160.4780.4440.2820.5520.8330.8890.385
Bijou1.0000.8100.0000.5310.1000.7220.7500.4060.7500.8330.7500.7810.3080.8211.0000.7190.1150.4090.7500.2810.2310.7860.7500.688
Blanche et Bleue1.0001.0000.3750.2500.5770.9440.9380.7080.5450.7730.3750.7080.6520.9520.9380.8330.5710.6960.7500.6670.4550.7500.3120.500
Boule de Suif1.0001.0000.0000.3080.2860.8001.0000.6151.0000.7140.5000.3850.3330.9091.0000.7690.1000.7140.5000.3850.3330.8571.0000.462
Candide1.0000.8330.0000.4170.8121.0001.0000.7501.0000.8890.3080.6670.8121.0001.0000.7500.5651.0001.0000.1670.8331.0000.7690.333
Colomba1.0001.0000.0000.3330.3750.9001.0000.5000.5000.8240.3330.7780.6670.9381.0000.8330.3080.8460.6670.6110.6670.8751.0000.778
La Comedie Humaine0.8570.9080.2310.6710.2711.0001.0000.6210.5330.8720.3080.9750.3290.9831.0000.7140.1830.9200.7690.4970.2880.9640.7310.665
Consuelo1.0000.8800.0000.4780.4000.9131.0000.4570.7500.8540.5000.8910.4860.8790.9440.6300.3780.7740.7780.5220.6360.8820.7780.652
David Copperfield1.0001.0000.0000.6670.1851.0001.0000.3770.6360.9380.7000.8700.5261.0001.0000.8840.2701.0001.0000.6230.3570.8910.5000.710
Don Quichotte1.0000.9470.2860.3400.2190.9391.0000.5850.6670.8570.2860.9060.2590.8681.0000.6230.1470.8060.7140.4720.3120.9140.7140.604
Germaine1.0000.7780.0910.5830.5000.5710.8180.3330.6670.5500.1820.9170.6430.7270.8180.6670.5710.6360.7270.5830.6360.8330.6360.417
Germinal1.0001.0000.0560.2000.5830.8780.7780.8000.7000.8440.3890.8440.6431.0001.0000.7780.4050.8700.9440.4440.2500.6880.3330.489
Germinie Lacerteux1.0000.6670.0000.4000.5000.6251.0001.0001.0000.7140.0001.0000.2500.7501.0000.6000.1670.2501.0000.2000.3330.5001.0000.200
Jane Eyre1.0001.0000.0000.4090.4091.0001.0000.7270.8180.9741.0000.8410.6921.0001.0000.9320.2670.9580.8890.5230.3330.8620.4440.568
La Bête Humaine1.0001.0000.1000.5420.6671.0001.0000.8330.8000.9500.8000.7920.7141.0001.0000.8750.3680.7500.7000.5000.4210.9000.8000.375
La Femme du mort1.0000.9290.4000.3940.2860.7311.0000.5760.6250.7650.5000.7880.6920.7890.9000.9090.2940.3701.0000.3030.4210.6670.8000.545
L’Assommoir1.0000.9410.0000.4850.5260.9601.0000.7270.6670.9630.4000.7880.5881.0001.0000.7580.3700.9411.0000.4850.4000.8330.6000.606
Le Mort vivant1.0001.0000.0000.4140.3331.0001.0000.4140.6670.9520.4440.6900.5710.9600.8890.8280.2270.7650.5560.4480.4550.9090.5560.690
Le Parfum de la dame en noir1.0001.0000.0000.5480.4900.8890.9600.2580.6820.7240.6000.6770.8061.0001.0000.8390.6210.6900.7200.6450.8670.6670.5200.645
Les Malheurs de Sophie1.0000.6670.0000.2500.4000.8331.0000.6250.5000.8750.5000.8750.5001.0001.0000.8750.2000.6670.5000.5000.6671.0001.0000.875
Les Miserables1.0000.8670.2670.4330.5270.9500.9670.6330.8100.8310.5670.9830.6970.8710.7670.9000.2730.5750.5000.3830.5640.8570.7330.600
Les Tribulations d’un Chinois1.0001.0000.1110.1540.6920.7271.0000.6150.8330.7500.5560.9230.6800.8330.9440.7690.6000.4120.6670.5380.7500.6670.6670.769
Les trois mousquetaires1.0000.8460.0000.5500.4750.8401.0000.5250.9170.7870.5790.9250.5940.9091.0000.7500.3420.5560.6840.3750.6250.7670.7890.575
Les Vacances1.0000.7780.0000.3500.2500.8750.6670.7000.6670.9470.6670.9000.3331.0001.0000.7500.1540.9090.6670.5000.3331.0000.6670.650
Le Tour du Monde en 80 jours1.0001.0000.0000.3810.7550.6920.9740.4290.8400.6360.5530.6670.9250.8640.9740.9050.7840.5200.7630.6190.7730.7690.8950.476
Le voluptueux voyage1.0001.0000.0000.0830.6430.7000.8180.5831.0000.7500.1820.7500.8461.0001.0000.9170.3850.4550.4550.4170.6430.7140.8180.417
L’Iliade1.0000.9000.0000.2340.3330.9070.8570.6360.6250.8070.2380.8700.4040.8980.9050.6880.2620.7910.8100.4420.4380.7370.3330.364
L’Odyssée1.0000.8570.0000.2220.3510.9020.9290.6850.3080.7560.2860.5740.3420.9430.9290.6110.2860.7591.0000.4070.4440.7690.2860.370
Madame Bovary1.0001.0000.0000.4810.6000.9170.8180.8150.5000.9130.5450.7780.7861.0001.0000.9260.4210.8000.7270.5930.2610.7500.5450.333
Les Mystères de Paris1.0000.9220.1180.5730.2270.9031.0000.3410.8570.8980.7060.9630.3860.9331.0000.6830.1830.7610.6470.4270.4000.8930.7060.610
Nana1.0001.0000.1430.2810.2330.9151.0000.6720.3750.9020.4290.8590.3330.9451.0000.8120.1000.7410.7140.3120.2170.8700.7140.625
Notre-Dame de Paris1.0000.7140.1670.3660.5080.9001.0000.4390.8120.7230.4330.8290.6170.9350.9670.7070.6510.8330.9330.7320.7410.7560.6670.756
Récits d’un soldat1.0001.0000.0000.0000.8330.5000.9090.5001.0000.2500.4550.5000.9171.0001.0001.0000.7500.1670.5450.5000.8570.4000.5451.000
Tarass Boulba1.0001.0000.0000.3330.3640.8890.8000.8891.0000.6430.4001.0000.7140.6921.0001.0000.4440.7270.8000.8890.8000.5000.8000.444
Therèse Raquin1.0000.6670.3330.2220.8750.8000.7780.8891.0000.7500.5561.0001.0000.7500.6671.0000.8890.8890.8890.8890.8000.6670.4440.667

Average0.9830.9080.0880.3890.4750.8580.9360.6150.7470.7960.4800.8160.6000.9090.9500.8050.3800.6940.7530.4930.5260.7860.6800.567

Overall precision, recall, and F1 scores.

Regarding the meta-classification schemes (Table 4), we first wanted to compare them to our the OpeNER baseline. On our test corpus and using similar evaluation OpeNER averaged a precision and recall of 0.609 and 0.754, respectively. This may seem surprisingly low compared to the standards usually set by this tool, but actually is a good illustration of how difficult it may get to find the correct tagging in fictional texts. The two worst cases (Les Malheurs de Sophie and Germinie Larcerteux) are shown in details in Tables A1 and A2 (Appendix A), and show that those problems are about as much related to bad classifications as missing entities. Our meta-classification methods perform on average better than that, and we can see an encouraging trend that the various strategies we tried out tended to get increasingly better results and to close the standard deviation gap. That being said, as far as SD is concerned and taken into account, this improvement was not statistically significative. Yet, the meta-recognition method managed a F1 score no worse than 0.674 and as high as 0.99 on one of the books, on which it basically found all entities and only misclassified one of them.

Table 4

OpeNER
Simple vote
Self-assessment
Fixed weights
Meta-recognition
PRF1PRF1PRF1PRF1PRF1
20000 Lieues sous les mers0.6960.8580.7690.7600.7760.7680.7320.7570.7440.7320.7570.7440.7320.7570.744
Au bonheur des dames0.5720.7950.6650.8620.9450.9020.8620.9450.9020.8620.9450.9020.8620.9450.902
Belle rose0.6450.6380.6410.8320.9360.8810.8320.9360.8810.8260.9210.8710.8920.9720.931
Bijou0.5690.7880.6610.4540.7660.5700.5080.8120.6250.5320.8280.6480.8720.8750.874
Blanche et Bleue0.6160.7140.6610.8330.9170.8730.7950.8850.8380.7950.8850.8380.8140.9060.858
Boule de Suif0.5240.8570.6500.5430.8080.6490.6730.9230.7790.5760.8460.6850.5760.8460.685
Candide0.8950.7780.8320.9330.9170.9250.9330.9170.9250.9330.9170.9250.9200.9200.920
Colomba0.6320.8640.7300.7390.8890.8070.7690.9170.8360.7690.9170.8360.7690.9170.836
La Comedie Humaine0.5450.7210.6210.7370.8890.8060.7450.9080.8190.7450.9080.8190.7460.8440.792
Consuelo0.6790.7870.7290.7910.8960.8400.7910.8960.8400.7910.8960.8400.8200.9010.859
David Copperfield0.6380.8770.7390.7940.9570.8680.8330.9710.8970.8330.9710.8970.8850.9860.932
Don Quichotte0.4770.7820.5930.5660.7440.6430.5660.7440.6430.5660.7440.6430.5980.7720.674
Germaine0.6100.6940.6490.7210.7840.7510.7210.7840.7510.7210.7840.7510.7880.8220.805
Germinal0.6830.6920.6870.7940.8500.8210.8120.8780.8440.8120.8780.8440.8120.8780.844
Germinie Lacerteux0.3210.6000.4180.4381.0000.6090.5281.0000.6910.5241.0000.6880.8571.0000.923
Jane Eyre0.6190.8910.7310.8750.9770.9230.9090.9890.9470.9090.9890.9470.9501.0000.974
La Bête Humaine0.6830.7110.6970.9340.9790.9560.9340.9790.9560.9340.9790.9560.9801.0000.990
La Femme du mort0.7130.8220.7640.5870.9390.7230.6700.9700.7930.6820.9700.8010.6690.9200.775
L’Assommoir0.4620.6230.5310.8160.9240.8670.8160.9240.8670.8160.9240.8670.8140.8890.850
Le Mort vivant0.7180.8720.7880.7660.8580.8090.8160.8540.8350.8160.8540.8350.8560.8720.864
Le Parfum de la dame en noir0.8110.8780.8430.8850.9190.9020.8640.8990.8820.8640.8990.8820.8640.8990.882
Les Malheurs de Sophie0.2670.5710.3640.7710.9380.8460.9441.0000.9710.9441.0000.9710.9441.0000.971
Les Miserables0.6930.8000.7430.8210.9000.8590.8310.9080.8680.8660.9500.9060.8560.9330.893
Les Tribulations d’un Chinois0.5690.6540.6090.7210.8460.7790.7760.9230.8430.7760.9230.8430.7760.9230.843
Les trois mousquetaires0.5100.7580.6100.7450.8750.8050.7730.9000.8320.7730.9000.8320.8280.9240.873
Les Vacances0.4300.7110.5360.7740.9500.8530.7740.9500.8530.7740.9500.8530.8500.9750.908
Le Tour du Monde en 80 jours0.8020.9030.8500.8000.8200.8100.8450.8920.8680.8450.8920.8680.8450.8920.868
Le voluptueux voyage0.6460.6250.6350.8010.8710.8350.8170.8750.8450.8170.8750.8450.8670.8750.871
L’Iliade0.4940.5830.5350.6160.7620.6810.6530.8010.7200.6530.8010.7200.6940.8330.757
L’Odyssée0.6040.6310.6170.6120.8070.6960.5970.7620.6690.5970.7620.6690.6220.7900.696
Madame Bovary0.4710.6670.5520.8380.8720.8550.8980.9360.9170.8980.9360.9170.9360.9360.936
Les Mystères de Paris0.5420.8000.6460.7030.8790.7810.7300.8910.8030.7820.9160.8440.8850.9460.915
Nana0.5240.7230.6080.6760.9290.7820.6650.9210.7720.6930.8900.7790.6670.8270.738
Notre-Dame de Paris0.7210.7720.7460.7980.8900.8420.7980.8900.8420.7980.8900.8420.7960.8930.842
Récits d’un soldat0.9231.0000.9600.9581.0000.9790.9581.0000.9790.9581.0000.9790.9581.0000.979
Tarass Boulba0.5710.7780.6590.6871.0000.8140.7381.0000.8490.7381.0000.8490.7381.0000.849
Therèse Raquin0.6750.6670.6710.9500.9440.9470.9500.9440.9470.9500.9440.9470.9500.9440.947

Average0.6080.7560.6690.7490.8890.8090.7750.9040.8310.7760.9030.8310.8150.9070.857
SD0.1370.1030.1160.1270.0710.0960.1150.0690.0880.1170.0700.0900.1030.0670.082

Averaged precision, recall, and F1 scores, for each classifier.

Lowest and highest scores are higlighted in bold.

6 Conclusion and Future Works

The contribution of this paper is to establish a set of efficient and autonomous tools that can be run in a limited environment (such as a web server), on any French novel without the need of training or manual user input, yet keeping reliable results. We showed that combining different classifiers, especially using a meta-recognition technique, allowed to attain an overall better score than each of them would separately, and to outperform some state-of-the-art tools in the very narrow considered use case. Yet, we are aware this process implies several hard assumptions that may break under certain circumstances and cause the system to fail, like the systematic extraction limited to capitalized proper nouns.

Future works may focus on coreference resolution by merging entities that refer to the same character, or conversely, disambiguation of homonyms, even if this does not happen very often in closed environments like fictional works. In parallel to a reliable extraction of the characters networks, further textual analytics may try to uncover the nature of the relations between them.

Statements

Author contributions

CB initiated this research, wrote the software components, and elaborated the test sets and methods described in this report. FK supervised this research and advised on the necessary tests and developments needed to improve its relevance.

Acknowledgments

This research was made possible by a very precious and close collaboration with Daniel de Roulet, Swiss architect, computer scientist, and author. It was supported by the Fondation Jan Michalski pour l’Ecriture et la Littérature.

Conflict of interest

The authors declare that the research was conducted in the absence of any commercial or financial relationships that could be construed as a potential conflict of interest.

Footnotes

1.^http://wiki.dbpedia.org.

2.^http://www.wikidata.org.

3.^http://www.mpi-inf.mpg.de/departments/databases-and-information-systems/research/yago-naga/yago.

4.^Made available at https://github.com/dhlab-epfl/3n-tools/.

5.^Project Gutenberg | Browse By Language: French—http://www.gutenberg.org/browse/languages/fr.

6.^The OpeNER project—http://www.opener-project.eu.

7.^The JVM heap size limit set by JRuby needed to be increased to over 8 GB in order to process our bigger samples (yet only about 6 MB in size).

8.^Since nouns can be in any language despite the text being in French, we considered the list of nobiliary particles found on the related English Wikipedia page: http://en.wikipedia.org/wiki/Nobiliary_particle.

9.^Wikipédia: Titres et prédicats—http://fr.wikipedia.org/wiki/Titres_et_prédicats.

10.^http://www.synonymo.fr.

11.^http://www.cis.uni-muenchen.de/~schmid/tools/TreeTagger.

12.^WordNet | A lexical database for English—https://wordnet.princeton.edu/.

References

  • 1

    AlbaneseA. (2011). Small demons makes big splash at Frankfurt book fair. Publishers Weekly258.43: 68.

  • 2

    AzpeitiaA.CuadrosM.GainesS.RigauG. (2014). NERC-fr: supervised named entity recognition for French. In Text, Speech and Dialogue, Vol. 8655, Edited by SojkaP.HorákA.KopečekI.PalaK., 158165. Cham: Springer International Publishing.

  • 3

    DumaisS.T. (2005). Latent semantic analysis. Annual Review of Information Science and Technology38: 188230.10.1002/aris.1440380105

  • 4

    FleissJ.L.CohenJ. (1973). The equivalence of weighted kappa and the intraclass correlation coefficient as measures of reliability. Educational and Psychological Measurement33: 6139.10.1177/001316447303300309

  • 5

    GenoM.G. (1992). A la française: Correct French for English Speakers. London, New York: University Press of America.

  • 6

    GlassK.BangayS. (2007). A Naîve, Salience-Based Method for Speaker Identification in Fiction Books. Durban, South Africa: PRASA. 16.

  • 7

    GohH.-N.SoonL.-K.HawS.-C. (2012). Automatic identification of protagonist in fairy tales using verb. In Advances in Knowledge Discovery and Data Mining, Vol. 7302, Edited by HutchisonD.KanadeT.KittlerJ.KleinbergJ.M.MatternF.MitchellJ.C.NaorM.NierstraszO.Pandu RanganC.SteffenB.SudanM.TerzopoulosD.TygarD.VardiM.Y.WeikumG.TanP.-N.ChawlaS.HoC.K.BaileyJ., 395406. Berlin, Heidelberg: Springer.

  • 8

    GrevisseM.Lenoble-PinsonM. (2009). Majuscules. In Le français Correct: Guide pratique des difficultés, 6th ed, 8892. Bruxelles: Duculot.

  • 9

    HirschbergJ.ManningC.D. (2015). Advances in natural language processing. Science349: 2616.10.1126/science.aaa8685

  • 10

    JovanovicJ.BagheriE.CuzzolaJ.GasevicD.JeremicZ.BashashR. (2014). Automated semantic tagging of textual content. IT Professional16: 3846.10.1109/MITP.2014.85

  • 11

    KarsdorpF.van KranenburgP.MederT.van den BoschA. (2012). Casting a Spell: Identification and Ranking of Actors in Folktales. Lisbon, Portugal: Edições Colibri. 3950.

  • 12

    MorettiF. (2013). Distant Reading. London, New York: Verso.

  • 13

    MosallamY.Abi-HaidarA.GanasciaJ.-G. (2014). Unsupervised named entity recognition and disambiguation: an application to old French journals. In Industrial Conference on Data Mining, 1223. St. Petersburg: Springer.

  • 14

    ScheirerW.J.RochaA.MichealsR.J.BoultT.E. (2011). Meta-recognition: the theory and practice of recognition score analysis. IEEE Transactions on Pattern Analysis and Machine Intelligence33: 168995.10.1109/TPAMI.2011.54

  • 15

    SiL.KanungoT.HuangX. (2005). Boosting performance of bio-entity recognition by combining results from multiple systems. In Proceedings of the 5th international workshop on Bioinformatics, BIOKDD ‘057683. Chicago: ACM.

  • 16

    StamatatosE. (2009). A survey of modern authorship attribution methods. Journal of the American Society for Information Science and Technology60: 53856.10.1002/asi.21001

Appendix

Appendix A

Table A1

EntityOpeNER resultReference
AmériquePlacePlace
AUBERTCharacterN/A
AubertPlaceN/A
Beau-MinonPlaceCharacter
CamillePlaceCharacter
DieuN/AOther
ÉlisabethPlaceCharacter
LAMBERTCharacterCharacter
LambertPlaceCharacter
LouiseN/ACharacter
M. de RéanN/ACharacter
MadeleinePlaceCharacter
Mme d’AubertN/ACharacter
Mme de RéanN/ACharacter
ParisPlacePlace
PaulCharacterCharacter
RéanPlaceN/A
SophiePlaceCharacter

Benchmarked OpeNER results on “Les Malheurs de Sophie.”

Table A2

EntityOpeNERReference
AdèleN/ACharacter
AfriqueN/APlace
DieuCharacterOther
GautruchePlaceCharacter
GerminiePlaceCharacter
JosephN/ACharacter
JupillonPlaceCharacter
M. de VarandeuilN/ACharacter
MllePlaceN/A
Mlle de VarandeuilN/ACharacter
Mme JupillonCharacterCharacter
MontmartreN/APlace
ParisPlacePlace
RévolutionN/AOther
SemproniePlaceCharacter
VarandeuilPlaceN/A

Benchmarked OpeNER results on “Germinie Lacerteux.”

Appendix B

B.1. Characters

titres monsieur, homme, mâle, mec, quidam, seigneur, sieur, madame, dame, mademoiselle, aimée, amie, amoureuse, bien-aimée, commandante, compagne, concubine, copine, dirigeante, dulcinée, employeuse, favorite, femme, fiancée, initiatrice, négrière, pédagogue, primordiale, reine, supérieure, prince, altesse, émir, archiduc, évêque, célébrité, cardinal, dauphin, diadoque, empereur, gloire, grand-duc, hospodar, infant, kronprinz, landgrave, maharadjah, maharajah, maharaja, margrave, monarque, monseigneur, principicule, rajah, rhingrave, roi, sire, souverain, autocrate, césar, chah, despote, dynaste, mogol, kaiser, khan, magnat, majesté, mikado, phénix, pharaon, ponte, potentat, raja, roitelet, sultan, trésor, abbé, capelan, corbeau, curé, curaillon, cureton, dignitaire, ecclésiastique, inquisiteur, inquisitrice, père, pontife, prédicant, prélat, prestolet, prieur, ratichon, révérend, religieux, pape, agréé, collecteur, décimateur, exacteur, fisc, impôt, baderne, belliqueux, caporal, colonel, conscrit, lieutenant, maréchal, martial, polémologique, rengagé, sergent, soldatesque, sous-officier, tactique, troupe, appelé, bachi-bouzouk, bidasse, biffin, bleusaille, bras, cavalier, champion, cipaye, combattant, conquérant, deuxième, pompe, drille, engagé, estafette, evzone, factionnaire, fantassin, franc-tireur, fusilier, garde-voie, goumier, grenadier, griveton, grognard, guérillero, guerrier, jalonneur, janissaire, légionnaire, méhariste, maquisard, mercenaire, morte-paye, palikare, pandour, papalin, partisan, patrouilleur, pertuisanier, pierrot, pionnier, pioupiou, piquier, planton, poilu, pourvoyeur, réquisitionnaire, résistant, reître, recrue, sentinelle, serviteur, soudard, soudrille, spahi, territorial, tirailleur, tommy, tourlourou, tringlot, troufion, troupier, vétéran, fils, aîné, élève, citoyen, descendant, disciple, fieu, fiston, fruit, géniture, garçon, garçonnet, gars, gosse, grand, jeune, jouvenceau, postérité, progéniture, race, rejeton, sang, fille, adolescente, bachelette, bambine, belle-fille, blondinette, bouchon, brebis, brunette, célibataire, catherinette, colombe, coureuse, courtisane, demoiselle, descendante, donzelle, fifille, fillasse, fillette, frangine, gamine, garçonne, gazille, gonzesse, héritière, hétaïre, jeunesse, jouvencelle, louloute, midinette, mignonne, nana, nénette, nonne, nymphette, petite, poulette, prostituée, quille, religieuse, rosière, soeur, soeurette, souris, suivante, tendron, trottin, typesse, bébé, baby, bambin, chiard, enfançon, enfant, gamin, môme, marmaille, marmot, mioche, mouflette, moutard, moutatchou, nourrisson, nouveau-né, petit, petit-salé, poupée, poupard, poupon, têtard, mari, époux, conjoint, épouse, bourgeoise, conjointe, déesse, doudou, ménagère, mariée, mousmé, m, mr, mme, mrs, miss parole accent, accoucher, écorcher, émouvoir, énoncer, apprendre, argot, articulation, articuler, avouer, babiller, bafouiller, bégayer, balbutier, baragouiner, bavarder, blaguer, bredouiller, calomnier, causer, chevroter, chuchoter, citer, communiquer, confabuler, conférer, consulter, converser, débattre, débit, débiter, déblatérer, déclamer, dégoiser, dénoncer, deviser, dialecte, dialoguer, diction, dire, discourir, discuter, disserter, enregistrer, entretenir, exprimer, fasciner, franc-parler, frapper, giberner, gueuler, haranguer, idiome, intervenir, jaboter, jacter, jargon, jargonner, jaser, jaspiner, jobelin, langage, langue, médire, marmonner, marmotter, mentionner, murmurer, nasiller, nommer, palabrer, parlementer, parloter, parole, pérorer, parler, patois, phonation, plaire, pratiquer, prononcer, prononciation, rabâcher, raconter, radoter, révéler, relater, retracer, rognonner, abandonner, adresser, épancher, confier, expliquer, style, usage, usance, verbe, vociférer, aboyer, acclamer, accuser, éclater, égosiller, annoncer, apostropher, appeler, attraper, avertir, bêler, beugler, brailler, braire, bramer, bruire, chahuter, chanter, clabauder, clamer, conspuer, couiner, craquer, criailler, croasser, feuler, gémir, glapir, grincer, grogner, gronder, héler, houper, hucher, hululer, hurler, implorer, interpeller, invectiver, invoquer, jurer, manifester, meugler, mugir, pépier, piailler, plaindre, pleurer, pousser, prévenir, prier, proclamer, proférer, protester, publier, réclamer, répandre, réprimander, retentir, rouspéter, rugir, écrier, époumoner, fâcher, récrier, sermonner, exclamer, signaler, indigner, supplier, tancer, tempêter, tonitruer, tonner, vagir pensée faire, convaincu, avis, persuadé, admettre, élaborer, étudier, aviser, évoquer, cogiter, combiner, concevoir, conjecturer, considérer, contempler, délibérer, devoir, envisager, espérer, estimer, examiner, faillir, attention, gamberger, idée, imaginer, juger, méditer, manquer, mûrir, ordonnancer, pensée, peser, présumer, prévoir, prendre, projeter, rêvasser, rêver, réfléchir, raisonner, rappeler, recueillir, regarder, repenser, rouler, saisir, scruter, concentrer, douter, figurer, sentir, préoccuper, proposer, représenter, souvenir, targuer, intéresser, occuper, songer, souhaiter, soupçonner, spéculer, supposer, trouver nutrition absorber, alimentation, attaquer, avaler, bâfrer, becqueter, bouffe, bouffer, boulotter, boustifailler, brichetonner, briffer, brouter, chipoter, claquer, collationner, consommer, consumer, corroder, croûte, croûter, croquer, croustiller, cuisine, débrider, décolorer, déguster, déjeuner, dîner, dépenser, déteindre, dévorer, dilapider, dissiper, engloutir, engouffrer, entamer, disparaître, faner, festoyer, fricasser, fricoter, friper, gaspiller, goûter, gober, gobichonner, godailler, goinfrer, grailler, grappiller, grignoter, gruger, gueuletonner, ingérer, ingurgiter, mâcher, mangeotter, mastiquer, mets, mettre, mordre, nourrir, nourriture, oublier, paître, pâture, picorer, pignocher, pitance, prodiguer, repas, ripailler, ronger, ruiner, alimenter, sauter, savourer, bourrer, décarêmer, gaver, goberger, gorger, lester, empiffrer, emplir, piffrer, régaler, rassasier, remplir, repaître, restaurer, sustenter, souper, sucer, tâter, tordre, tortiller, tortorer, toucher, transgresser émotions amour, ébullition, agité, agitation, agressif, agressivité, aigreur, algarade, animosité, éréthisme, atrabilaire, atrabile, bile, bilieux, bouffée, bourrasque, chagrin, colérique, courroucé, courroux, crise, déchaînement, effervescence, emportement, exaspéré, exaspération, excitation, explosion, fâcherie, foudre, fulminant, fureur, furie, furieux, grogne, haine, hargne, hargneux, humeur, impatience, impatient, indignation, irascibilité, ire, irrité, irritabilité, irritable, mécontentement, monté, péché, querelle, rage, rageur, rancoeur, représailles, rogne, scène, surexcitation, susceptibilité, tempête, vengeance, violence, accouplement, adoration, adultère, affect, affection, altruisme, amativité, amitié, amourette, ange, aphrodite, archerot, éros, association, attachement, attraction, aventure, babiole, badinage, bagatelle, béguin, baibatifolage, biquet, biquette, bluette, bouillonnement, bricole, caprice, chaleur, charité, coït, coeur, concubinage, concupiscence, conquête, copulation, coquetterie, cupidon, débauche, délicatesse, désir, dévotion, dévouement, dilection, engouement, entente, enthousiasme, estime, faible, fanatisme, fantaisie, ferveur, fièvre, flamme, fleurette, flirt, folie, fréquentation, fraternité, galanterie, goût, grâce, hyménée, hymen, idolâtrie, inceste, inclination, intérêt, intrigue, ivresse, lascivité, libertinage, luxure, maladie, mariage, marivaudage, mouvement, mysticisme, passade, passion, passionnette, penchant, philanthropie, piété, plaisir, pulsion, relation, rut, sens, sensibilité, sentiment, tendance, tendresse, toquade, touche, union, vénération, vénus, agrément, allégresse, amusement, épanouissement, ardeur, avantage, béatitude, bien-être, bienfait, bonheur, consolation, contentement, délice, douceur, enchantement, enjouement, entrain, euphorie, exaltation, extase, exultation, félicité, fierté, folichonnerie, gaieté, gaité, griserie, hilarité, jouissance, liesplaisir, régal, réjouissance, ravissement, rayonnement, rigolade, sourire, abandon, abattement, accablement, affliction, amertume, éplorement, épreuve, assombrissement, asthénie, bourdon, cafard, calamité, découragement, dépression, désabusement, désenchantement, désespérance, désespérer, désespoir, désolation, deuil, douleur, grisaille, laideur, lassitude, lypémanie, mal, malaise, maussaderie, mocheté, monotonie, navrement, neurasthénie, noir, noirceur, nostalgie, nuage, pauvreté, peine, platitude, sévérité, sombreur, souffrance, spleen, suspens, uniformité, affres, alarme, attente, crainte, désarroi, incertitude, inquiétude, peur, souci, transe, abattu, adjudicataire, égrotant, aliéné, alité, altéré, anéanti, anormal, atteint, avarié, cacochyme, chétif, cinglé, client, démoli, déprimé, dérangé, désaxé, détraqué, dingue, dolent, fada, fanatique, fatigué, fiévreux, fou, galeux, gâté, gâteux, grabataire, incommodé, indisposé, infirme, ladre, lépreuse, lépreux, maboul, maladif, malsain, mauvais, morbide, pâle, patient, patraque, perturbé, piqué, rachitique, raide, révolutionné, scrofuleux, secoué, sonné, souffrant, souffreteux, tifosi, timbré, toqué, traumatisé, valétudinaire, abhorrer, abominer, éructer, évacuer, chasser, cracher, débecter, dégobiller, dégorger, dégueuler, détester, exécrer, expectorer, expulser, fulminer, gerber, haïr, honnir, huer, lancer, mépriser, maudire, régurgiter, répudier, rejeter, rendre, gorge, restituer, siffler, souffler, apeurer, cajoler, déconcerter, désemparer, effaroucher, effrayer, embarrasser, gêner, geler, glacer, impressionner, inhiber, inquiéter, menacer, paralyser, terroriser, ébauché, angoissé, approximatif, capon, complexé, confus, craintif, démerdard, discret, douteux, effarouché, effarouchable, effrayé, embarrassé, farouche, flottant, flou, frileux, froussard, fumeux, gêné, gauche, hésitant, honteux, humble, imperceptible, incertain, indécis, indéfini, indéterminé, indistinct, intimidé, lâche, modeste, nébuleux, nuageux, obscur, peureux, poltron, pudibond, pusillanime, réservé, rougissant, sauvage, subtil, timoré, transi, vague, émoi, appréhensif, appréhension, circonspection, confusion, effacement, effarouchement, gêne, gaucherie, hésitation, honte, humilité, indécision, modestie, prémonition, prudence, pudeur, réserve, trac, ébranlé, affecté, affolé, alarmé, éméché, émotionné, éperdu, apitoyé, éploré, attendri, attristé, blessé, bondissant, bouleversé, bouleversant, déchiré, déchirant, empoigné, enflammé, excité, frappé, impressionné, inquiété, intéressé, ivre, pantelant, pathétique, remué, retourné, saisi, suffoqué, surexcité, touché, touchant, tremblant, écoeurement, affadissement, allergie, éloignement, accident, affligé, aigre, élégiaque, assombri, bidoche, bougon, boulet, bourru, cafardeux, chagriné, colère, coléreux, consterné, consternation, content, contrarié, contrariété, contrit, couenne, cuir, déboire, déception, déchirement, dégoût, dépit, dépité, déplaisir, désagrément, désappointement, désolé, détresse, douloureux, ennui, ennuyé, gémissant, grimaud, grincheux, grognon, hypocondriaque, hypocondrie, inconsolable, inquiet, larme, larmoyant, lugubre, mélancolie, malheur, maussade, misanthrope, misère, morne, morose, morosité, mortifié, navré, neurasthénique, pataquès, peau, peiné, pelage, pisse-vinaigre, plaintif, pleurard, pleureur, rabat-joie, rechigné, regret, revêche, sinistre, sombre, tégument, tourment, tracasserie, tribulation, triste, tristesse, anorexie, antipathie, aversion, satiété, blasement, démoralisation, déplaisance, exécration, haut-le-coeur, horreur, humiliation, inappétence, indigestion, mépris, mortification, nausée, répugnance, répulsion, ressentiment, affolement, affres, alerte, émotion, angoisse, anxiété, cauchemar, couardise, dédain, effroi, foire, frayeur, frisson, frousse, intimidation, lâcheté, malepeur, panique, pétoche, phobie, pleutrerie, poltronnerie, pusillanimité, saisissement, souleur, spectre, stupéfaction, terreur, timidité, trouille, venette, vertige, vigilance, préoccupation, ébahissement, abasourdissement, admiration, ahurissement, émerveillement, épatement, épouvante, étonnure, effarement, impression, pétrification, scandale, stupeur, surprise foi assurance, autorité, évangile, catéchisme, certitude, confession, confiance, conviction, créance, credo, croyance, déloyauté, dogme, droiture, espérance, exactitude, fausseté, fidélité, fourberie, franchise, hommage, honnêteté, honneur, loyauté, mystique, opinion, perfidie, promesse, religion, serment, sincérité, spiritualité, témoignage, vérité tromper éblouir, abuser, échauder, égarer, amuser, apaiser, arnaquer, étriller, éviter, bafouer, baiser, balancer, bercer, berner, blouser, bluffer, capter, carotter, circonvenir, cocufier, coiffer, confondre, consoler, couillonner, croire, décevoir, déguiser, déjouer, dépiter, désappointer, dissimuler, duper, embabouiner, emberlificoter, embobeliner, embobiner, empaumer, empiler, encorner, endormir, enfiler, engeigner, enjôler, enquinauder, entôler, entortiller, entourlouper, entuber, envelopper, errer, escroquer, estamper, exploiter, cocu, cornard, marcher, miroiter, feindre, feinter, ficher, flatter, flouer, foutre, frauder, frustrer, gonfler, gourer, imposer, induire, jobarder, jouer, leurrer, méprendre, mentir, moquer, mystifier, pigeonner, piper, promener, refaire, renarder, repasser, resquiller, ruser, séduire, surprendre, trahir, tricher, truquer mort tuer, abattre, achever, échiner, écraser, égorger, allonger, anéantir, épuiser, éreinter, assassiner, assommer, étendre, étouffer, étrangler, étriper, bousiller, buter, casser, chagriner, chouriner, crever, décapiter, décimer, dégeler, dégringoler, démolir, dépêcher, détruire, descendre, empoisonner, emporter, estourbir, exécuter, excéder, exténuer, exterminer, cesser, mourir, fatiguer, faucher, flinguer, foudroyer, fusiller, gâcher, guillotiner, harasser, immoler, juguler, lapider, lasser, liquider, lyncher, massacrer, meurtrir, miner, moissonner, navrer, nettoyer, noyer, occire, peiner, pendre, percer, poignarder, pourfendre, ratiboiser, refroidir, sacrifier, saigner, défaire, servir, stériliser, suicider, supplicier, supprimer, suriner, trucider, user, vanner, mort, écroulement, agonie, anéantissement, ankylosé, apathique, épuisé, éreinté, assassinat, éteint, évanoui, brisé, cadavre, camarde, chute, claqué, condamné, corps, crève, crevé, croupissant, décédé, décès, décomposition, défunt, délavé, dépouille, désert, détruit, cujus, destruction, disparition, disparu, dormant, effondrement, enterrement, enveloppe, esquinté, euthanasie, exécution, excédé, exténué, extinction, fade, fantôme, feu, fichu, figé, fin, fini, flapi, foutu, voyage, harassé, immobile, inanimé, inerte, inhabité, insensible, intimider, irrécupérable, languide, lessivé, macchab, macchabée, malemort, mânes, meurtre, mourant, néant, nécrosé, nuit, éternelle, ombre, parque, passé, perdu, perte, plat, plongeon, rétamé, recru, rendu, repos, éternel, restes, rompu, ruine, silencieux, sommeil, somnolent, stagnant, supplice, tombe, tombeau, torture, tranquille, trépas, trépassé, tué, usé, victime, vidé guérir échapper, adoucir, mieux, améliorer, calmer, châtier, cicatriser, corriger, couper, débarrasser, délivrer, désintoxiquer, estomper, tomber, pallier, réchapper, rambiner, relever, remédier, remettre, renaître, ressusciter, revivre, sauver, fermer, rétablir, soigner, soulager, traiter dormir coucher, dormir, dormailler, lambiner, lanterner, négliger, pioncer, ronfler, roupiller, sommeiller, somnoler salutations bonjour, adieu, bienvenue, bonsoir, salamalec, salut, ciao, bye, salutations vouloir accepter, accorder, acquiescer, affirmer, aimer, alléguer, ambitionner, appéter, aspirer, attendre, envie, briguer, commander, concéder, consentir, convoiter, craindre, décider, défendre, daigner, désirer, demander, disposition, entendre, envier, escompter, essayer, exiger, forme, gouverner, guigner, intention, interdire, lorgner, loucher, manière, nécessiter, objectif, ordonner, ordre, oser, permettre, pouvoir, prétendre, prescrire, résolution, recevoir, requérir, revendiquer, s’efforcer, s’essayer, soif, solliciter, soupirer, subir, tenir, tolérer, viser, vouloir soupir, sanglot soupir, plainte, sanglot, silence, remord, cri, gémissement, geignement, hoquet, jérémiade, larmoiement, pleur, pleurnichement, spasme raison allégation, équité, argument, argumentation, base, but, cause, cerveau, cervelle, compréhension, conception, connaissance, conscience, considérant, considération, couleur, dédommagement, démonstration, discernement, droit, entendement, esprit, excuse, explication, facultés, fondement, indice, intellect, intelligence, judiciaire, jugeote, justesse, justice, justification, lieu, logique, lucidité, lumière, méthode, mobile, modération, motif, occasion, office, origine, philosophie, pondération, pourquoi, prétexte, preuve, principe, principes, probabilité, probité, quoi, réfutation, raison, raisonnement, rapport, rationnel, rectitude, sagesse, satisfaction, tête, tact métiers abatteur, aboyeur, accessoiriste, accisien, acconier, accordéoniste, accordeur, accordeuse, accoucheur, accouveur, accouveuse, accréditeur, accréditeuse, accréditrice, accrocheur, acériculteur, acéricultrice, acheteur, acheteuse, aciériste, aconier, acousticienne, acousticien, acrobate, acteur, actrice, actuaire, acuponcteur, acuponctrice, acupuncteur, acupunctrice, adaptateur, adaptatrice, adjudicateur, adjudicatrice, administrateur, administratrice, aède, aérodynamicien, aéronaute, aérostier, afficheur, affichiste, affileur, affineur, affineuse, affranchisseur, affréteur, affuteur, affuteuse, agencier, agencière, agioteur, agréeur, agréeuse, agriculteur, agricultrice, agrobiologiste, agronome, aiguilleur, aiguillier, aiguiseur, aiguiseuse, ajusteur, ajusteuse, albatrier, alcade, alchimiste, alcoolier, alcoologue, alderman, alem, aleseur, aléseuse, alfatier, algébriste, algologue, alguazil, aliéniste, alisier, allergologiste, allergologue, allopathe, allumettier, allumettière, alphabétiseur, alphabétiseuse, alpiniste, altiste, amareyeur, amareyeuse, ambassadeur, ambassadrice, ambulancier, ambulancière, aménagiste, amidonnier, amidonnière, amodiateur, amphibie, amuseur, amuseuse, analyste, anatomiste, andrologue, anesthésiste, ânier, ânière, animalier, animalière, animateur, animatrice, annaliste, annonceur, annonceuse, annoncière, annoncier, annotateur, annotatrice, antenniste, anthropologue, antiquaire, aouteron, apiculteur, apicultrice, apiéceur, apiéceuse, aplatisseur, apothicaire, appareilleur, appareilleuse, appariteur, apponteur, apprentie, apprenti, apprêteur, apprêteuse, aquaculteur, aquacultrice, aquafortiste, aquarelliste, aquatintiste, aquiculteur, aquicultrice, arbalétrier, arbitragiste, arbitre, arboriculteur, arboricultrice, archéologue, archer, archetière, archetier, archiâtre, architecte, archiviste, ardoisier, ardoisière, argenteur, argenteuse, argentier, argotiste, arithméticien, armaillis, armateur, armoriste, armurier, aromaticienne, aromaticien, arpenteur, arpenteuse, arpète, arpette, arquebusier, arracheur, arrangeur, arrangeuse, arrêtiste, arrimeur, arrondisseur, arroseur, arroseuse, artificier, artilleur, assainisseur, assemblagiste, assesseure, assesseur, assistant, assistante, assureur, assyriologue, astaciculteur, astacicultrice, astrologue, astronaute, astronauticien, astronome, astrophysicien, atomiste, attorney, aubergiste, audiencier, audit, auditeur, auditrice, aumônier, aurige, auriste, auteur, auteure, autocariste, automaticienne, automaticien, autoursier, autrice, avaleur, avaleuse, aventurier, aventurière, aviateur, aviatrice, aviculteur, avicultrice, avionneur, avitailleur, avocat, avocate, avoué, avouée, bachoteur, badigeonneur, baes, baesine, bagagiste, baigneur, baigneuse, baillie, bailli, baladin, balancier, balayeur, balayeuse, baleinier, baliseur, ballerine, bandagiste, banquier, banquière, barbier, barbière, barde, barmaid, barman, barmen, barragiste, barreur, barreuse, baryton, basculeur, basketteur, basketteuse, bassiste, bateleur, bateleuse, batelier, batelière, bâtisseur, bâtisseuse, bâtonnier, bâtonnière, batteur, batteuse, baudrier, bêcheur, bedeau, berger, bergère, bibliographe, bibliothécaire, bijoutier, bijoutière, billettiste, bimbelotière, bimbelotier, bineur, bioethicienne, bioethicien, biogéographe, biographe, biologiste, biométricienne, biométricien, biophysicienne, biophysicien, biscuitier, biscuitière, blanchisseur, blanchisseuse, bobineur, bobineuse, bobinière, bobinier, boiseur, boisselier, boitier, bombagiste, bombeur, bonne, bonnetier, bonnetière, bordier, bordière, botaniste, botteleur, botteleuse, bottier, bottière, boucanier, boucher, bouchère, boucheur, boucholeur, bouchonnier, bouchoteur, bouclier, boueur, bougnat, bouilleur, boulanger, boulangère, boulier, boulinier, bouquetier, bouquiniste, bourgmestre, bourreau, bourrelière, bourrelier, bourrier, boursier, boursière, bousilleur, bouteiller, bouteur, boutillier, boutonnier, boutonnière, bouvier, bouvière, boxeur, boxeuse, boyaudier, boyaudière, braconnier, brancardière, brancardier, brandevinière, brandevinier, brasseur, brasseuse, brillanteur, briqueteur, briquetier, brocanteur, brocanteuse, brocheur, brodeur, brodeuse, broker, bronzeur, bronzeuse, bronzière, bronzier, brossier, brossière, brouetteur, brouettier, broyeur, bruiteur, bruiteuse, brûleur, brunisseur, brunisseuse, buandier, buandière, bucheron, bucheronne, bucheur, buffetier, buffetière, bulbiculteur, bulbicultrice, buraliste, bureauticienne, bureauticien, burineur, buriniste, bustier, bustière, buvetier, cabaretière, cabaretier, cabinier, cabinotier, cableur, cableuse, cabliste, caboteur, cadreur, cadreuse, cafetier, cafetière, caid, caissier, caissière, calandreur, calandreuse, calfat, calicot, calier, calleur, calleuse, calligraphe, cambiste, cambreur, cambusier, camelot, cameraman, camerier, camerière, cameriste, camionneur, camionneuse, campaniste, canalisateur, canalisatrice, cancerologue, canetière, canissier, cannetière, canneur, canneuse, cannier, cannière, cannissier, canonnier, canotier, canotière, cantatrice, cantinier, cantinière, cantonnier, cantonnière, cantor, canuse, canut, capilliculteur, capitaine, carabinier, cardeur, cardeuse, cardiologue, caricaturiste, carillonneur, carillonneuse, cariste, carpettier, carreleur, carreleuse, carrier, carrossier, cartier, cartographe, cartonnier, cartonnière, cartons, cartooniste, cascadeur, cascadeuse, casernier, casquettier, casseur, casseuse, catcheur, catcheuse, catéchiste, caviste, cellerière, celliste, censeur, censeure, censière, censier, ceramiste, cerealier, chainetière, chainetier, chaineur, chaineuse, chainier, chainiste, chaisier, chaisière, chalcographe, chaloupier, chalutier, chambellan, chambrière, chambrier, chamelier, chamoiseur, chancelière, chandelier, changeur, changeuse, chansonnière, chansonnier, chanteur, chanteuse, chantre, chanvrière, chanvrier, chapelier, chapelière, charbonnier, charcutier, charcutière, charpentier, charretier, charron, chartrier, chasseur, chasublier, chasublière, chaudronnier, chaudronnière, chauffagiste, chauffeur, chaufournier, chaumier, chausseur, chaussonnier, chef, cheminot, cheminote, chemisier, chemisière, chercheur, chercheuse, chevillard, chevilleur, chevrière, chevrier, chiffonnière, chiffonnier, chiffreur, chiffreuse, chimiste, chiropracteur, chiropraticien, chirurgien, chirurgienne, chocolatier, chocolatière, chorégraphe, choriste, chromeur, chromiste, chroniqueur, cicérone, cigarettière, cigarettier, cigarier, cigarière, cimentier, cineaste, cireur, cireuse, cirier, cirière, ciseleur, ciseleuse, cithariste, clapman, clarinettiste, claveciniste, clavieriste, claviste, clerc, clicheur, clicheuse, climatologue, clinicienne, clinicien, cloutier, cloutière, clown, coache, coach, cocher, codeur, codeuse, codirecteur, codirectrice, coéditeur, coffreur, cogérante, cogérant, cogniticienne, coiffeur, coiffeuse, colleur, colleuse, coloriste, colporteur, coltineur, colzatier, comédien, comédienne, comique, commandant, commerçante, commerçant, commis, commise, commissaire, communarde, communard, communicateur, communicatrice, compagnon, compositeur, compositrice, comptable, compteuse, concepteur, conceptrice, concierge, conducteur, conductrice, confectionneur, conférencière, conférencier, confiseur, confiseuse, confiturière, confiturier, connecticienne, connecticien, conseiller, conseillère, conservateur, conservatrice, conserveur, conserveuse, consignataire, constable, consul, consule, consultant, consultante, contactologue, conteur, conteuse, contrebassiste, contremaître, contrôleur, contrôleuse, convoyeur, convoyeuse, coolie, coordinateur, coordinatrice, coordonnateur, coordonnatrice, copilote, copiste, coprésidente, coprésident, coqueleuse, coquetière, coquetier, corailleur, corailleuse, cordeur, cordeuse, cordier, cordière, cordonnier, cordonnière, cornac, cornemuseur, cornettiste, corneur, corneuse, corniste, coroner, correcteur, correctrice, correspondante, correspondant, corroyeur, corsaire, corsetier, corsetière, coryphée, cosmétologue, cosmologiste, cosmologue, cosmonaute, costumier, costumière, coteur, cotonnier, cotonnière, coupeur, coupeuse, courriériste, courrier, coursier, coursière, courtier, courtière, couseur, couseuse, coutelier, coutelière, couturier, couturière, couvreur, crassier, créancière, créancier, créateur, créatif, créatrice, crémier, crémière, créoliste, crêpier, crêpière, crieur, crieuse, criminologue, crinier, cristallière, crocheteur, croupier, croupière, cryptographe, cueilleur, cueilleuse, cuiseur, cuisinier, cuisinière, cuisiniste, culottière, culottier, cultivateur, cultivatrice, curateur, curatrice, cytologiste, dactylographe, dalleur, damasquineur, dameur, danseur, danseuse, débardeur, débardeuse, débatteur, débiteur, décapeur, décatisseur, décatisseuse, déchiqueteur, déchiqueteuse, décideur, décideuse, déclamateur, déclamatrice, décolleteur, décolleteuse, décorateur, décoratrice, découpeur, découpeuse, découvreur, découvreuse, décrotteur, défenseur, défibreur, défibreuse, défricheur, défricheuse, dégraisseur, dégraisseuse, dégustateur, dégustatrice, démarcheur, démarcheuse, déménageur, demandeuse, démineur, démographe, démolisseur, démonstrateur, démonstratrice, dendrologue, dentellière, dentellier, dentiste, déontologue, dépanneur, dépeceur, dépeceuse, dépollueur, dépoussiéreur, dermatologue, dermato, designer, dessinateur, dessinatrice, déstockeur, détacheur, detacheuse, détaillant, détaillante, detective, développeur, développeuse, devideur, diablotin, diagnosticien, diagnostiqueur, dialectologue, dialoguiste, diamantaire, dictionnariste, diéséliste, diététicienne, diététicien, dinandier, dinandière, dindonnière, dindonnier, diplomate, directeur, directrice, discothécaire, discounter, discounteur, dispatcher, disquaire, distillateur, distillatrice, divette, docimologue, docker, docteur, doctoresse, documentaliste, documentariste, domestique, dominotière, dominotier, dompteur, dompteuse, donneur, doreur, doreuse, douanier, douanière, doubleuse, doublure, doucheur, doucheuse, dramaturge, drapier, drapière, draveur, draveuse, dresseur, dresseuse, droguiste, duègne, duumvir, dynamiteur, dynamiteuse, ébarbeur, ébaucheur, ébéniste, éboueur, éboueuse, écailleur, écailleuse, écangueur, écangueuse, échanson, échenilleur, échevin, échotier, échotière, éclairagiste, éclaireur, éclaireuse, éclateur, éclusier, éclusière, écogarde, écologue, économètre, économétricien, économiste, écorceur, écorcheur, écorcheuse, écrivain, écrivaine, écureur, écuyer, écuyère, édile, éditeur, éditorialiste, éditrice, éducateur, éducatrice, effileur, effilocheur, effilocheuse, égoutier, égratigneur, élagueur, élagueuse, électricien, électricienne, électronicien, éleveur, éleveuse, émailleur, émailleuse, emballeur, embaumeur, embaumeuse, embouteilleur, embouteilleuse, embryologiste, embryologue, émondeur, émondeuse, émotteur, émotteuse, émouleur, émouleuse, empailleur, empailleuse, empaqueteur, empaqueteuse, empileur, empileuse, employé, employée, encadreur, encadreuse, encaisseur, encaveur, encaveuse, encodeur, encodeuse, encolleur, encolleuse, endocrinologue, endosseur, enfileur, enfileuse, enfourneur, engraisseur, enlumineur, enlumineuse, enquêteur, enquêtrice, enrouleur, ensacheur, ensacheuse, enseignant, enseignante, ensemblier, entraineur, entraineuse, entrepreneur, envoyeur, épaviste, épicier, épicière, épigraphiste, épileur, épileuse, épinceteur, épinceteuse, épinceur, épinceuse, épinglière, épinglier, épistolier, éplucheur, éplucheuse, équilibriste, équipier, équipière, ergonome, ergotherapeute, espion, espionne, essayeur, essayeuse, essayiste, estafier, estampeur, estampeuse, esthéticienne, esthéticien, estimateur, étalagiste, étalière, étalier, étalonneur, étalonneuse, étameur, étameuse, étampeur, étampeuse, ethnobotaniste, ethnographe, ethnologue, ethologue, étiqueteur, étiqueteuse, étireur, étireuse, expéditeur, expéditrice, expert, experte, fabuliste, façadier, facilitateur, facilitatrice, façonneur, façonnier, façonnière, facteur, factrice, fagoteur, fagoteuse, fagotier, fagotière, faïencier, faïencière, faneur, faneuse, farinier, farinière, faucheur, faucheuse, fauconnier, fendeur, fendeuse, ferblantier, fermier, fermière, ferrailleur, ferrailleuse, ferreur, ferronnier, feudiste, feuillagiste, ficelier, figurant, figurante, figuriniste, figuriste, filandier, filandière, filateur, fildeferiste, fileur, fileuse, filmeur, filmeuse, financier, finisseur, finisseuse, fiscaliste, fleuriste, floriculteur, floricultrice, flotteur, flutiste, fonctionnaire, fondeur, fondeuse, fontainier, footballeur, forain, foraine, forceur, forestier, foreur, foreuse, forfaitiste, forgeron, formateur, formatrice, fossoyeur, fossoyeuse, fouacier, foudrier, fouleur, foulon, foulonnier, fourbisseur, fournier, fourreur, fourreuse, fraiseur, fraiseuse, frappeur, frappeuse, freineur, fresquiste, freteur, frigoriste, fripier, fripière, friteuse, friturier, fromager, fromagère, fructiculteur, fruitière, fruitier, fumiste, funambule, gabarier, gabarrier, gabier, gâcheur, gâcheuse, gagiste, gainier, gainière, galeniste, galeriste, galibot, galonnier, galonnière, gambiste, gantier, gantière, garagiste, garanceur, garanceuse, garde, gardeur, gardeuse, gardien, gardienne, gargotière, gargotier, garnisseur, garnisseuse, gaucho, gaufreur, gaufreuse, gaveur, gaveuse, gazetier, gazetière, gazier, gazière, geisha, gemmeur, gemmeuse, gemmologue, gendarme, généalogiste, généraliste, généticienne, généticien, genévrier, géodésienne, géodésien, géographe, geôlier, geôlière, géologue, géomètre, géophysicienne, géophysicien, gériatre, gérontologue, gestionnaire, giletier, giletière, gitologue, glaceur, glaceuse, glacier, glacière, glaciologue, glaneuse, gobeletière, gobeletier, goémonier, goémonière, golfeur, golfeuse, gondolier, gondolière, gonfalonier, gonfanonier, goudronneur, goudronneuse, goudronnier, gouteur, gouteuse, gouvernante, grainetier, grainetière, graineur, grainier, grainière, graisseur, grammairienne, grammairien, grammatiste, granitier, graphiste, graphologue, gratteur, gratteuse, gravatier, graveur, graveuse, gravier, gréeur, greffeur, greffeuse, greffier, greffière, greneur, grenier, grignoteur, grillageur, groom, grossiste, groupeur, groupeuse, grutier, grutière, guetteur, guichetière, guichetier, guide, guillotineur, guitariste, gymnaste, gynécologue, gypsier, habilleur, habilleuse, hagiographe, haleur, haleuse, handicapeur, harengère, harmoniciste, harmoniste, harnacheur, harpiste, haubergeon, haveur, heaumier, héliciculteur, hélicicultrice, hélicier, héliograveur, héliograveuse, hépatologue, héraut, herbager, herboriste, hercheur, herscheur, hippologue, historien, historienne, hockeyeur, hockeyeuse, hongreur, hongroyeur, horloger, horlogère, horticulteur, horticultrice, hortillon, hote, hotelier, hotelière, hotesse, hotteur, hotteuse, houppier, huchet, huchier, huilier, huissier, huissière, humoriste, hybrideur, hybrideuse, hypnologue, hypnotiseur, hypnotiseuse, ichtyologiste, ichtyologue, iconographe, iconologiste, iconologue, illusionniste, illustrateur, illustratrice, ilotier, ilotière, imageur, imagier, imagière, imitateur, imitatrice, imposeur, impresario, imprimeur, imprimeuse, incrusteur, incrusteuse, indexeur, indexeuse, indianiste, industrielle, industriel, infectiologue, infirmier, infirmière, infographiste, informaticien, ingénieriste, ingénieur, inséminateur, inspecteur, inspectrice, installateur, installatrice, instituteur, institutrice, instructeur, instructrice, instrumentiste, intendante, intendant, intermittent, interprète, introducteur, ivoirier, ivoirière, jardinier, jardinière, jardiniste, jaugeur, joaillière, joaillier, jockey, jongleur, jongleuse, journalier, journaliste, juge, junior, jupier, jupière, jurat, juré, juriste, kinesiste, kinésithérapeute, kiosquière, kiosquier, laborantin, laborantine, laboureur, laceur, laceuse, lâcheur, lad, laineur, laineuse, lainier, lainière, laitier, laitière, lamaneur, lamier, lamineur, lamineuse, lampier, lampiste, lancier, lanternier, lapicide, lapidaire, laquais, laqueur, laqueuse, latiniste, lavandière, laveur, laveuse, layetier, layeur, lecteur, lectrice, legionnaire, légiste, legumier, lessiveuse, lessivier, lettreur, lettreuse, leveur, leveuse, levurie, levurier, lexicographe, lexicologue, libraire, librettiste, licier, liégeur, liégeuse, lieur, lieuse, liftier, liftière, limeur, limeuse, limonadier, limonadière, linger, lingère, linguiste, linier, linière, linotypiste, liquoriste, liseur, liseuse, lisseur, lisseuse, lissier, lithographe, livreur, livreuse, logisticien, logisticienne, logographe, logopède, logothète, lotisseur, loueur, loueuse, louvetier, ludologue, ludothécaire, lunetier, lunetière, lunettier, lunettière, lustreur, lustreuse, luthier, luthière, luthiste, lutteur, lutteuse, machiniste, maçon, maçonne, magasinier, magasinière, magicien, magicienne, magistrat, magistrate, magnanarelle, magnanière, magnanier, magnarelle, magnétiseur, magnétiseuse, maieure, maieur, maieuticien, maître, maîtresse, malletière, malletier, malteur, maltotier, mammalogiste, manadier, manager, manageur, manageuse, manieur, manipulateur, manipulatrice, mannequin, manoeuvre, manoeuvrière, manouvrier, manucure, manufacturier, manufacturière, maquettiste, maquignon, maquilleur, maquilleuse, maraicher, maraichère, marathonienne, marathonien, marbreur, marbreuse, marbrier, marcaire, marchand, marchande, marchandiseur, marcheur, mareyeur, mareyeuse, margarinier, margeur, margeuse, margoulin, marieur, marieuse, marin, marinier, marinière, marionnettiste, marketeur, marketeuse, marneur, maroquinier, maroquinière, marqueteur, marqueur, marqueuse, marteleur, masseur, masseuse, massier, mastologue, matelassier, matelassière, matelot, mateur, mathématicien, matrone, mayeure, mayeur, mécanicien, mécanicienne, mécanographe, médailleur, médecin, médiateur, médiatrice, mégissier, mélangeur, mélangeuse, mélodiste, menager, menagère, ménagiste, ménestrel, ménétrier, menuisier, menuisière, mercaticien, mercaticienne, merchandiser, mercier, mercière, messager, mesureur, mesureuse, metallier, metallière, metallographe, metallurgiste, métayère, métayer, météorologiste, météorologue, métreur, métreuse, métrologiste, métrologue, meulier, meunier, meunière, militaire, mime, minerviste, mineur, miniaturiste, ministre, minotier, minotière, mireur, mireuse, miroitière, miroitier, missilier, mitron, mixeur, modèle, modeleur, modeleuse, modélisateur, modélisatrice, modéliste, modiste, moireur, moireuse, moissonneur, moissonneuse, moniteur, monitrice, monnayeur, monteur, monteuse, mortaiseur, morutier, morutière, mosaïste, motociste, motoriste, mouleur, mouleuse, moulineur, moulineuse, moulinier, moulinière, mouliste, moulurier, moulurière, mousse, moutardier, moutonnier, muezzin, mulassier, muletier, muletière, muséographe, musicien, musicienne, musicologue, mycologue, myrmecologue, mytiliculteur, mytilicultrice, nageur, nageuse, naisseur, nattier, nattière, naturaliste, nautonier, nautonière, navigateur, navigatrice, négociante, négociant, négociateur, neoniste, néphrologue, netsurfeur, netsurfeuse, nettoyeur, nettoyeuse, neurologue, nez, nielleur, nielleuse, nivologue, nocher, notaire, notateur, notatrice, noueur, nourrice, nourrisseur, novice, noyauteur, nuiteuse, nurse, nutritionniste, obstétricienne, obstétricien, océanaute, oculariste, oculiste, odontologiste, oenologue, officier, officière, oiseleur, oiseleuse, oiselier, oiselière, oleiculteur, oleicultrice, oliveur, oliveuse, ombudsman, omnipraticien, oncologiste, oncologue, onirologue, opérateur, opératrice, ophtalmologue, opticien, opticienne, optométriste, orchestrateur, orchestratrice, ordinaticienne, ordinaticien, orfèvre, organicien, organicienne, organier, organière, organisateur, organisatrice, organiste, orienteur, orienteuse, ornemaniste, ornithologue, orpailleur, orpailleuse, orthodontiste, orthopédiste, orthophoniste, orthoptiste, ostéopathe, ostéopraticien, ostréiculteur, ostréicultrice, otorhino, outilleur, outplacer, outplaceur, outplaceuse, ouvreur, ouvreuse, ouvrier, ouvrière, oxycoupeur, packager, packageur, pailler, pailleteur, pailleteuse, pailleur, pailleuse, palefrenière, palefrenier, paléographe, paléologue, paléontologue, palettiseur, palissonneur, paloteur, paludier, paludière, paludologue, palynologue, panetier, panseur, panseuse, pantomime, pantouflière, pantouflier, papetier, papetière, papillonneur, parachutiste, parasitologue, pareur, pareuse, parfumeur, parfumeuse, parolier, parolière, parqueteur, parqueur, parqueuse, parquier, parurier, parurière, passementier, passementière, passeur, pastelliste, pasteur, pasteure, pasteurisateur, pastilleur, pastilleuse, pastourelle, patineur, patineuse, pâtissier, pâtissière, pâtre, patron, patronne, patronnier, patronnière, pattière, pattier, paveur, paveuse, paysagiste, paysan, paysanne, péager, péagère, péagiste, peaussier, pêcheur, pêcheuse, pédiatre, pédicure, pédologue, pédopsychiatre, peigneur, peigneuse, peignier, peintre, pelletier, pelletière, pendulier, peon, pepinieriste, percepteur, perceptrice, perceur, perceuse, perchiste, perchman, perforateur, perforatrice, perliculteur, perlicultrice, perlier, perlière, perruquière, perruquier, personnage, peseur, pétrisseur, pétrisseuse, pétrographe, pétrolier, pétrolière, pharmacienne, pharmacien, philologue, phlébologue, phoniatre, photographe, photograveur, physicienne, physicien, physiologiste, physionomiste, phytogeographe, pianiste, picador, pigiste, pileur, pileuse, pilote, pipier, pipière, piqueur, piqueuse, piroguier, pisciculteur, pisteur, pisteuse, pistoleur, pizzaiolo, placeur, placeuse, placier, placière, plagiste, planétologue, planeur, planteur, planteuse, plaquiste, plasticien, plasticienne, plasturgiste, plâtrier, plieur, plieuse, plisseur, plisseuse, plombeur, plombier, plongeur, plongeuse, plumassière, plumassier, plumeur, plumeuse, pneumologue, pocheuse, podiatre, podologue, poêlier, poêlière, poète, poétesse, poinçonneur, poinçonneuse, pointeau, pointeur, pointeuse, poissonnier, poissonnière, policeman, policier, policière, polisseur, polisseuse, pompeur, pompier, pompière, pompiste, ponceur, ponceuse, pontier, pontière, pontonnier, pope, populiculteur, populicultrice, porcelainier, porcelainière, porchère, porcher, porion, portefaix, porteur, porteuse, portier, portière, portraitiste, poseur, poseuse, posticheur, posticheuse, postier, postière, potier, potière, praticien, praticienne, précepteur, préceptrice, préfet, préfète, préparateur, préparatrice, préposé, préposée, présentateur, présentatrice, président, présidente, presseur, presseuse, pressier, prêteur, prêtre, prévisionniste, prévôt, primeur, primeuriste, principal, principale, priseur, priseuse, privée, privé, procureur, procureure, producteur, productrice, professeur, professeure, profileur, programmateur, programmatrice, programmeur, programmeuse, projeteur, projeteuse, promoteur, promotrice, prospecteur, prospectrice, prote, prothésiste, prototypiste, proviseur, psy, psychanalyste, psychiatre, psychologue, publicitaire, puériculteur, puéricultrice, puisatier, pupitreur, pupitreuse, pyrotechnicien, pyrotechnicienne, qualiticien, qualiticienne, questeur, quincaillier, quincaillière, rabatteur, rabatteuse, rabbin, raboteur, raboteuse, raccommodeur, raccommodeuse, racleur, radariste, radio, radioastronome, radiologue, radionavigant, raffineur, raffineuse, ramasseur, ramasseuse, ramendeur, ramendeuse, rameur, ramoneur, rapsode, raseur, raseuse, rateleur, rateleuse, raucheur, ravaleur, ravaudeur, ravaudeuse, ravitailleur, rayonneur, réalisateur, réalisatrice, réassureur, rebouteuse, recenseur, recenseuse, réceptionnaire, réceptionniste, receveur, receveuse, recherchiste, récolteur, récolteuse, recors, recruteur, recruteuse, recteur, rectifieur, rectifieuse, rectrice, récupérateur, récupératrice, rédacteur, rédactrice, redresseur, redresseuse, régent, régente, régisseur, régisseuse, registraire, régleur, régleuse, regrattière, regrattier, régulateur, régulatrice, rejointoyeur, rejointoyeuse, relationniste, relecteur, relectrice, releveur, relieur, relieuse, remailleur, remailleuse, remisier, remisière, remmailleur, remmailleuse, remonteur, remouleur, remouleuse, rempailleur, rempailleuse, remplisseur, remplisseuse, rentoileur, rentoileuse, rentrayeur, rentrayeuse, renvideur, renvideuse, réparateur, réparatrice, repasseur, repasseuse, répétiteur, répétitrice, reporter, reporteur, reportrice, représentante, représentant, repriseuse, réserviste, résinière, résinier, responsable, restaurateur, restauratrice, rétameur, retordeur, retoucheur, retoucheuse, réviseur, réviseuse, rhabilleur, rhabilleuse, rhéteur, rhumatologue, rinceuse, riveur, riveuse, riziculteur, rizicultrice, robeuse, robinetier, roboticien, roboticienne, rocailleur, rôdeur, rogneur, rogneuse, romancier, romancière, romaniste, rosiériste, rôtisseur, rôtisseuse, rouennier, rouennière, rouleur, roulier, routeur, routeuse, routier, routière, rubanier, rubanière, rugbyman, sableur, sableuse, sabotier, sabotière, sabreur, sacristain, sagard, saigneur, saigneuse, saleur, saleuse, saliculteur, salicultrice, salinier, salinière, salmoniculteur, salonnier, salonnière, salpetrière, salpetrier, sandalier, sandalière, santonnier, santonnière, sapeur, sapiteur, sardinier, sardinière, sarodiste, sasseur, sasseuse, satineur, satineuse, saucier, saucière, saunier, saunière, saurisseur, saurisseuse, sauveteur, savetier, savetière, savonnier, savonnière, saxophoniste, sbire, scaphandrier, scénariste, scénographe, scieur, scieuse, scoliaste, scribe, scripte, sculpteur, secouriste, secrétaire, sellier, semeur, semeuse, séranceur, séranceuse, serriste, serrurier, serrurière, sertisseur, servant, servante, serveur, serveuse, sexologue, shampouineur, shampouineuse, sherif, sherpa, signaleur, sismologue, sitologue, sociologue, soignante, soignant, soigneur, soigneuse, solier, soliste, sommelier, sommelière, sondeur, sondeuse, sonneur, soudeur, soudeuse, souffleur, soufreur, soufreuse, souscripteur, souscriptrice, soutier, spadassin, spationaute, speakerine, speaker, spéléo, spéléologue, spéléonaute, stadier, stadière, staffeur, standardiste, stationnaire, statisticienne, statisticien, stéarinier, stenciliste, steno, sténodactylo, sténographe, sténotypiste, steward, stomatologiste, stomatologue, storiste, stripteaseur, stripteaseuse, stucateur, stylicien, stylicienne, styliste, sujet, superviseur, surveillant, surveillante, sylviculteur, sylvicultrice, tabaculteur, tabacultrice, tabletier, tabletière, tacheron, taillandier, tailleur, tailleuse, talonneur, tamiseur, tamisier, tamisière, tanneur, tanneuse, tapisseur, tapisseuse, tapissier, tapissière, taraudeur, taraudeuse, tatoueur, tatoueuse, taulier, taulière, taupier, taupière, tavernier, tavernière, taxateur, taxatrice, taxidermiste, taximan, taxiste, technicien, technicienne, teilleur, teilleuse, teinturière, teinturier, téléacteur, téléactrice, téléaste, téléconseiller, télégraphiste, télémétreur, télémétreuse, téléopérateur, téléphoniste, téléreporter, télévendeur, télévendeuse, téléxiste, tenancier, tenancière, tendeur, tendeuse, tennisman, tenniswoman, terminologue, terrassier, testeur, testeuse, thanatologue, théâtreuse, thermicienne, thermicien, thonier, tilleur, tilleuse, timbreur, timbreuse, timonier, tireur, tisserand, tisserande, tisseur, tisseuse, titreur, titreuse, toiletteur, toiletteuse, toilier, toilière, tolier, tolière, tombeur, tondeur, tondeuse, tonnelier, topographe, tordeur, tordeuse, toreador, torera, torero, torréfacteur, toucheur, toucheuse, toueur, toueuse, toupilleur, tourbière, tourbier, tourier, tourneur, tourneuse, traceur, traceuse, tractoriste, trader, traducteur, traductrice, tragédienne, tragédien, traiteur, trameur, trameuse, traminot, trancheur, trancheuse, transformiste, transitaire, transporteur, transporteuse, trapéziste, trappeur, trappeuse, traqueur, traqueuse, trayeur, trayeuse, tréfileur, tréfileuse, trempeur, trésorier, trésorière, tresseur, tresseuse, tribun, tricoteur, tricoteuse, trieur, trieuse, tripier, tripière, trompettiste, tronçonneur, trousseur, trufficulteur, trufficultrice, truquiste, tubiste, tuilier, tulliste, tuteur, tutrice, tuyauteur, tuyauteuse, typographe, ubiquiste, ufologue, urbaniste, urgentiste, urgentologue, urologue, usineur, usineuse, usinier, usurière, usurier, vacataire, vacher, vachère, vaguemestre, vaisselier, valet, vanneur, vanneuse, vannier, vannière, veilleur, veloutier, vendangeur, vendangeuse, vendeur, vendeuse, veneur, ventriloque, vépéciste, verdier, verdurière, verdurier, vergetier, vérificateur, vérificatrice, vérifieur, vérifieuse, vernisseur, vernisseuse, verrier, verrière, vétérinaire, vibraphoniste, vicaire, vidangeur, vidangeuse, vidéaste, videur, videuse, vigie, vigile, vigil, vigneron, vigneronne, viguier, vinaigrier, vinificateur, vinificatrice, violiste, violoncelliste, violoniste, virolière, virolier, virologue, visagiste, visitateur, visitatrice, visiteuse, viticulteur, viticultrice, vitrier, vitrière, vivandière, vivandier, voilier, voiturier, voiturière, voiturin, volailler, volailleur, volcanologue, voltigeur, voltigeuse, voyagiste, voyer, vulcanologue, wagonnier, wattman, webdesigner, webmarketeur, webmarketeuse, webmaster, webmestre, webplaner, zingueur, zoologiste, zoologue.

B.2. Abstract Concepts

prier prier, adorer, conjurer, louer, supplier, implorer, solliciter, revendiquer, enjoindre culte, religion culte, admiration, adoration, adulation, église, amour, célébration, cérémonie, catholicisme, confession, croyance, déification, dévotion, dévouement, fétichisme, foi, hommage, idolâtrie, liturgie, messe, mystère, office, pèlerinage, passion, piété, pratique, prière, révérence, religion, respect, rite, théogonie.

B.3. Places

habitation village, agglomération, bourg, bourgade, feux, garnison, hameau, bastide, cité, citadelle, habitant, habitante, locataire, métropole, peuple, peuplement, commerce, cabaret, cambuse, guinguette, hôtel, hôtellerie, palace, pension, restau, restaurant, pharmacie, drugstore, officine, boutique, affaire, atelier, attirail, bahut, bande, banneton, bazar, bric-à-brac, camarilla, chapelle, clan, clique, crémerie, débit, maffia, magasin, outillage, réserve, vitrine, vivier, mairie, capitole, capitoul, commune, municipalité, douane, laboratoire, arrière-boutique, cabinet, importation, exportation, académie, école, assemblée, athénée, campus, centre, collège, constitution, cours, département, faculté, laboratoire, lycée, organisme, préfecture, séminaire, société, abri, édifice, église, appartement, artisanal, asile, établissement, baraque, bas-lieu, bâtiment, bâtisse, bercail, bicoque, boîte, bouge, branche, building, bungalow, cabane, cahute, campagne, cassine, chacunière, chalet, château, chaumière, chez-soi, clapier, clinique, construction, couronne, couvert, demeure, descendance, domesticité, domestique, domicile, dynastie, entreprise, ferme, feu, firme, foyer, gîte, gens, habitacle, hôpital, home, hutte, immeuble, institut, institution, intérieur, lares, lieu, lignée, loge, logement, logis, maison, maisonnée, maisonnette, ménage, manoir, manufacture, masure, monde, naissance, nid, nom, origine, palais, pénates, parents, pavillon, pigeonnier, place, prison, propriété, race, réduit, résidence, retraite, séjour, sanctuaire, serviteur, taudis, temple, trône chambre alcôve, alvéole, antichambre, assemblée, association, cabine, cagibi, cambriole, carrée, case, cavité, cellule, chambre, chambrée, chambrette, compartiment, conseil, corps, cour, crèche, creux, dortoir, galetas, gourbi, habitation, local, mansarde, nursery, parlement, pièce, piaule, poêle, reposée, salle, studette, studio, taule, tourelle, tribunal, turne, salon, bal, boudoir, bringue, cercle, club, entourage, exposition, fiesta, hall, living, living-room, musée, nouba, raout, redoute, sauterie chemin rue, allée, artère, asphalte, avenue, boulevard, chaussée, chemin, cheminée, galerie, impasse, loggia, passage, pavé, ruelle, accès, barrage, billet, bouffée, boyau, brèche, bribe, canal, chenal, circulation, col, conduit, conjoncture, corridor, couloir, défilé, dégagement, déplacement, détroit, endroit, extrait, flânerie, foulée, fragment, franchissement, fuite, gorge, goulet, gué, issue, moment, morceau, ouverture, péage, page, pèlerinage, partie, pas, passée, passe, passerelle, pâturage, percée, piste, port, randonnée, route, sédentarisation, saut, sentier, seuil, sillage, stage, strophe, suture, trafic, transit, transition, traversée, traverse, trouée, tunnel, vadrouille, va-et-vient, venelle, venue, vestibule, voie, pont, appontement, aqueduc, bau, bordé, bordage, dunette, embelle, gaillard, passavant, plate-forme, pontil, quai, spardeck, superstructure, wharf nature forêt, affluence, arbre, bois, breuil, feuillage, frondaison, futaie, jungle, labyrinthe, perçoir, perchis, sylve, taillis, taraud, montagne, accident, accumulation, élévation, alpage, amas, éminence, amoncellement, assemblage, chaîne, chaînon, cime, colline, contrefort, djebel, entassement, fatras, fouillis, hauteur, haut, mamelon, monceau, mont, monticule, pic, pile, piton, sierra, sommet, tas, mer, étendue, baille, déluge, espace, essaim, flot, flots, fourmillement, large, marine, océan, onde, plaine, reflux, rivage, île, îlot, archipel, atoll, havre, oasis, affluent, bain, collier, cours, eau, fleuve, flux, gave, oued, ravine, ruisseau, torrent, marais, paysage, bergerie, bucolique, campagne, carte, décor, horizon, localisation, panorama, sous-bois, verdure localisation à, bord, ailleurs, auprès, autour, abords, adjacent, touchant, limitrophe, contre, collé, avoisinant, côtés, côté, proximité, approchant, rapproché, attenant, imminent, jouxte, loin, près, presque, proche, sur, voisin, riverain, depuis, dès, par, pour, dedans, dehors, en, hors, proche, éloigné, distant, adresse, but, destination, sens, trimard, nord, arctique, boréal, borée, hyperboréen, nordique, polaire, septentrion, sud, antarctique, austral, méridional, est, aurore, orient, ouest, occident, occidental actions aborder, aboutir, échoir, apparaître, approcher, arriver, atteindre, avancer, débarquer, dériver, descendre, entrer, marcher, monter, partir, parvenir, rappliquer, diriger, remonter, redescendre, abouler, déplacer, rapprocher, transporter, suivre, surgir, survenir, venir, abandonner, échapper, éclipser, aller, éloigner, émigrer, appareiller, bouger, circuler, déménager, embarquer, enfuir, fuir, nager, promener, provenir, quitter, repartir, tourner, trotter, vider, voyager, traverser, pénétrer, sillonner, transir, retourner, habiter, louer, domicilié, domicilier, cabaner, camper, crécher, demeurer, exister, fréquenter, gîter, hanter, loger, nicher, obséder, occuper, percher, peupler, posséder, poursuivre, préoccuper, régner, résider, séjourner, rencontrer, siéger, travailler, visiter, vivre, rapatriement, exfiltration, réinsertion transports paquebot, barque, bateau, batelet, caboteur, canoë, caraque, caravelle, cargo, chalutier, dériveur, embarcation, felouque, jonque, navire, paquebot, péniche, périssoire, pirogue, radeau, rafiot, steamer, steamboat, skiff, submersible, transatlantique, trirème, vaisseau, vapeur, voilier, aérodyne, aéronef, aéroplane, aéroscaphe, airbus, appareil, avion-taxi, bac, aérien, bombardier, canadair, charter, chasseur, avion, giravion, hydravion, jet, tacot, taxi, transport, U.L.M., voiture, attelage, auto, berline, cabriolet, carriole, charrette, fardier, fourgon, train, véhicule, locomotive, automotrice, coucou, locomotrice, machine, motrice, bagnole, chignole, coche, fiacre, limousine, roadster, tire, tramway, vélo, deux-roues, bécane, bicyclette, moto, cyclomoteur, enduro, engin, motocycle, motocyclisme, scooter, solex, motocyclette, vélomoteur, embarquement.

Summary

Keywords

text processing, named entity recognition, NER, literature, French, distant reading

Citation

Bornet C and Kaplan F (2017) A Simple Set of Rules for Characters and Place Recognition in French Novels. Front. Digit. Humanit. 4:6. doi: 10.3389/fdigh.2017.00006

Received

12 October 2016

Accepted

08 March 2017

Published

31 March 2017

Volume

4 - 2017

Edited by

Simonetta Montemagni, Consiglio Nazionale Delle Ricerche (CNR), Italy

Reviewed by

Mike Kestemont, University of Antwerp, Belgium; Fr Frontini, Université Paul-Valéry Montpellier 3, France; Sara Tonelli, Fondazione Bruno Kessler, Italy

Updates

Copyright

*Correspondence: Cyril Bornet,

Specialty section: This article was submitted to Digital Literary Studies, a section of the journal Frontiers in Digital Humanities

Disclaimer

All claims expressed in this article are solely those of the authors and do not necessarily represent those of their affiliated organizations, or those of the publisher, the editors and the reviewers. Any product that may be evaluated in this article or claim that may be made by its manufacturer is not guaranteed or endorsed by the publisher.

Outline

Figures

Cite article

Copy to clipboard


Export citation file


Share article

Article metrics