Preterit-imperfect-accuracy

Data for a project focused on accuracy development in L2 Spanish preterit-imperfect marking

Codebook

Code	Description
Corpus	The corpus from which the data was sourced.
Text_ID	The identifying key for each text analyzed.
Participant_ID	The identifying key for each participant. Note, this code was only used for the CEDEL2 data because every text in COWS-L2H came from a unique participant. Hence, the 'Text_ID' is equivalent to the 'Participant_ID' for the COWS-L2H data.
Course_Level	The course in which the student was enrolled when they produced the text. Note, this code was only used for the COWS-L2H data because all of the students derived from the same university; the CEDEL2 students are from various different universities.
Proficiency	The Spanish proficiency level of the participants.
Modality	Spoken or written- the modality in which the text was produced.
Token	The token (form) produced.
Lemma	The lemma of the token produced.
Appropriate, Ambiguous, Inappropriate	In these cells, we marked a '1' to classify the tense-aspect use in question as appropriate, inappropriate or ambiguous (i.e., we were unable to determine its appropriateness).
Marked_form	The tense-aspect classification of the token produced.
Obligatory_form	The tense-aspect classification of what should have been produced in that context (i.e., what is obligatory for suppliance).
Marked_form_simplified	Same as 'Form,' but with only 4 levels: preterit, imperfect, present, or other.
Obligatory_form_simplified	Same as 'Obligatory_form,' but with only 4 levels: preterit, imperfect, present, or other.
Frequency	The log-transformed, summed token frequencies of all preterit and imperfect forms of each lemma. Frequency data was extracted from the EsPal corpus (Duchon et al., 2013).
Regularity	The morphological regularity of the verb in the preterit or imperfect, based on Camps (2005).

Citation

Minnillo, S., Sánchez-Gutiérrez, C., Ruiz, A., Morgan, E. & González, C. (2024). Predictors of accuracy in L2 Spanish preterit-imperfect production. International Journal of Learner Corpus Research, 10(2).

Name		Name	Last commit message	Last commit date
Latest commit History 19 Commits
LICENSE		LICENSE
Minnillo et al. 2024 Models.html		Minnillo et al. 2024 Models.html
README.md		README.md
cedel2_data_accuracy.csv		cedel2_data_accuracy.csv
cows_data_accuracy.csv		cows_data_accuracy.csv

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Preterit-imperfect-accuracy

Codebook

Citation

About

Releases

Packages

Languages

License

sminnillo/Preterit-imperfect-accuracy

Folders and files

Latest commit

History

Repository files navigation

Preterit-imperfect-accuracy

Codebook

Citation

About

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages