Pedro Ortiz Suarez
Pedro Ortiz Suarez
Home
Publicaciones
Presentaciones
Proyectos
Contacto
CV
Claro
Oscuro
Automático
Español
Español
Deutsch
English
Français
7
A Data-driven Approach to Natural Language Processing for Contemporary and Historical French
We determine that the importance of the pre-training dataset size was largely overestimated, as we are able to repeatedly show that language models can be pre-trained with corpora of a modest size.
Pedro Ortiz Suarez
PDF
Citar
Theses
TEL
Citar
×