Software
Models
NLP
Vision
Data
Datasets
Shared Tasks
Benchmarks
Education
Courses
Talks
Podcast
Tutorials
Interactive Demos
Light
Dark
Automatic
language model
ALBETO and DistilBETO: Lightweight Spanish Language Models
In recent years there have been considerable advances in pre-trained language models, where non-English language versions have also …
José Cañete
,
Sebastián Donoso
,
Felipe Bravo-Marquez
,
Andrés Carvallo
,
Vladimir Araujo
Jan 1, 2022
PDF
Cite
Code
Models
arXiv
Evaluation Benchmarks for Spanish Sentence Representations
Due to the success of pre-trained language models, versions of languages other than English have been released in recent years. This …
Vladimir Araujo
,
Andrés Carvallo
,
Souvik Kundu
,
José Cañete
,
Marcelo Mendoza
,
Robert E. Mercer
,
Felipe Bravo-Marquez
,
Marie-Francine Moens
,
Alvaro Soto
Jan 1, 2022
PDF
Cite
Code
arXiv
Spanish Pre-Trained BERT Model and Evaluation Data
The Spanish language is one of the top 5 spoken languages in the world. Nevertheless, finding resources to train or evaluate Spanish …
José Cañete
,
Gabriel Chaperon
,
Rodrigo Fuentes
,
Jou-Hui Ho
,
Hojin Kang
,
Jorge Pérez
Jan 1, 2020
PDF
Cite
Code
Slides
Video
Models
Spanish Word Embeddings
Jorge Pérez
,
José Cañete
,
Cristian Cardellino
,
FastText team
Jan 1, 2020
Code
Models
GLUES: General Language Understanding Evaluation in Spanish
The GLUES Benchmark aims to collect different sources of tasks for evaluating Spanish Language Models in a unified fashion in order to develop and allow the growth of the Spanish NLP Community.
Apr 27, 2016
Cite
Code
Spanish Corpora
Repository that gathers a compilation of corpus in Spanish language with a size of 3B words.
José Cañete
Apr 27, 2016
Cite
Code
Dataset
HuggingFace Hub
Cite
×