BSC-LT/ALIA-40b
Text Generation • 40B • Updated • 93 • 88
A large-scale Greek language corpus is used to train word embeddings and related linguistic resources, with a live web tool provided for their interactive exploration.
Word embeddings are undoubtedly very useful components in many NLP tasks. In this paper, we present word embeddings and other linguistic resources trained on the largest to date digital Greek language corpus. We also present a live web tool for testing the Greek word embeddings, by offering "analogy", "similarity score" and "most similar words" functions. Through our explorer, one could interact with the Greek word vectors.
Get this paper in your agent:
hf papers read 1810.06694 curl -LsSf https://hf.co/cli/install.sh | bash No dataset linking this paper
No Collection including this paper