We found a match
Your institution may have access to this item. Find your institution then sign in to continue.
- Title
THE CORPUS DE RÉFÉRENCE DU FRANÇAIS CONTEMPORAIN (CRFC) AS THE FIRST GENRE-DIVERSE MEGA-CORPUS OF FRENCH.
- Authors
Siepmann, Dirk; Bürgel, Christoph; Diwersy, Sascha
- Abstract
The Corpus de référence du français contemporain (CRFC) is a new purpose-built genrediverse corpus for investigating modern French. The 310-million-word corpus is the first collection of French to incorporate a substantial amount of spontaneous speech (approx. 30m words) and 'pseudo-spoken' data (approx. 125m words); it is evenly divided between spoken/pseudo-spoken and written sources. The present article begins by comparing the CRFC with previous corpora of French before discussing the design of the corpus and a number of pilot studies based on it, including an investigation into oral uses of common lexemes which have gone unrecorded in dictionaries and a reanalysis of the French subjunctive.
- Subjects
CORPORA; FRENCH language; SPEECH; PILOT projects; LEXEME; POLYGLOT dictionaries; FRENCH language -- Dictionaries; ENGLISH language dictionaries
- Publication
International Journal of Lexicography, 2017, Vol 30, Issue 1, p63
- ISSN
0950-3846
- Publication type
Article
- DOI
10.1093/ijl/ecv043