EHME: a new word database for research in basque language

  1. Acha, Joana 1
  2. Laka Mugarza, Itziar 1
  3. Landa, Josu 1
  4. Salaburu Etxeberria, Pello 1
  1. 1 Universidad del País Vasco/Euskal Herriko Unibertsitatea

    Universidad del País Vasco/Euskal Herriko Unibertsitatea

    Lejona, España


The Spanish Journal of Psychology

ISSN: 1138-7416

Argitalpen urtea: 2014

Alea: 17

Orrialdeak: 1-10

Mota: Artikulua

DOI: 10.1017/SJP.2014.79 DIALNET GOOGLE SCHOLAR lock_openSarbide irekia editor

Beste argitalpen batzuk: The Spanish Journal of Psychology

This article presents EHME, the frequency dictionary of Basque structure, an online program that enables researchers in psycholinguistics to extract word and nonword stimuli, based on a broad range of statistics concerning the properties of Basque words. The database consists of 22.7 million tokens, and properties available include morphological structure frequency and word-similarity measures, apart from classical indexes: word frequency, orthographic structure, orthographic similarity, bigram and biphone frequency, and syllable-based measures. Measures are indexed at the lemma, morpheme and word level. We include reliability and validation analysis. The application is freely available, and enables the user to extract words based on concrete statistical criteria 1 , as well as to obtain statistical characteristics from a list of words 2

Erreferentzia bibliografikoak

