International audienceConcerning French, it would be natural to turn to the French National Library, which is rich in 14 million documents including 11 million books on the Tolbiac site. This would be comparable to Google Books offer, if access was similarly electronic. Unfortunately the number of documents accessible on the Internet, mainly in the Gallica base, is far from reaching that figure. In reality, the most reliable texts of Gallica, aside from newer ones transmitted by publishers in digital form, are those coming from the Frantext legacy. Those owe nothing to scanning, whose invention in 1974 by Ray Kurzweil is after the initial capturing, carried out by keyboardists on perforated tape. This manual input, duly revised and correcte...
International audienceThe World Wide Web is the greatest information space unseen until now, distrib...
This paper presents some of the design considerations of BREF, a large read-speech corpus for French...
Au terme d’une carrière de cinquante ans entièrement consacrée à la statistique linguistique, l’aute...
International audienceConcerning French, it would be natural to turn to the French National Library,...
International audienceCette communication présentera Frantext (www.frantext.fr), l’une des plus anci...
National audienceThis paper examines to what extent the massive availability of textual data in digi...
In France, as indeed in the whole world, numerous programs or initiatives have been launched during ...
Large means: more than 100 works printed before 1800 in English, Latin, French, German, etc. (no Sla...
The World Wide Web is the greatest information space unseen until now, distributed all over the worl...
International audienceThe editorialization means publishing practices and accessibility of content o...
International audienceQuantitative analysis of cultural history has begun with the appearance of mas...
International audiencePoint n'est besoin de présenter l'Institut de la langue française, où gît un t...
This paper presents some of the computerized linguistic resources of the Research Laboratory ATILF ...
Readability aims to assess the difficulty of texts based on various linguistic predictors (the lexic...
International audienceEn décembre 2010, un article a paru dans Science, qui rendait compte d'une ent...
International audienceThe World Wide Web is the greatest information space unseen until now, distrib...
This paper presents some of the design considerations of BREF, a large read-speech corpus for French...
Au terme d’une carrière de cinquante ans entièrement consacrée à la statistique linguistique, l’aute...
International audienceConcerning French, it would be natural to turn to the French National Library,...
International audienceCette communication présentera Frantext (www.frantext.fr), l’une des plus anci...
National audienceThis paper examines to what extent the massive availability of textual data in digi...
In France, as indeed in the whole world, numerous programs or initiatives have been launched during ...
Large means: more than 100 works printed before 1800 in English, Latin, French, German, etc. (no Sla...
The World Wide Web is the greatest information space unseen until now, distributed all over the worl...
International audienceThe editorialization means publishing practices and accessibility of content o...
International audienceQuantitative analysis of cultural history has begun with the appearance of mas...
International audiencePoint n'est besoin de présenter l'Institut de la langue française, où gît un t...
This paper presents some of the computerized linguistic resources of the Research Laboratory ATILF ...
Readability aims to assess the difficulty of texts based on various linguistic predictors (the lexic...
International audienceEn décembre 2010, un article a paru dans Science, qui rendait compte d'une ent...
International audienceThe World Wide Web is the greatest information space unseen until now, distrib...
This paper presents some of the design considerations of BREF, a large read-speech corpus for French...
Au terme d’une carrière de cinquante ans entièrement consacrée à la statistique linguistique, l’aute...