Liliana Tolchinsky, M.Antònia Martí, Mariona Taulé (CLiC-UB)
M. Antònia Martí <amartiub.edu>
CESCA is a Catalan corpus consisting of scholar writing text elaborated by 2,400 scholars between the ages of five and sixteen. Each informant has written different types of text: vocabularies, narrative and definition texts as well as jokes.
This corpus allows studies about language development; literacy; relationship between oral text and spelling; linguistic analysis of spontaneous language, etc.
It does not exist an another corpus of this size and characteristics for Catalan. It allows studies about evolution of language taking into account the same parameters among a huge amount of population.
The development of CESCA has been funded by the following projects: 2006ARIE-10058, 2007ARIE-00005, 2008ARIE-00053 from the Generalitat de Catalunya (AGAUR).