Charformer: Fast Character Transformers via Gradient-based Subword Tokenization
Paper • 2106.12672 • Published
Still in development phase!!! Weights are updating regularly
Character-level encoder-decoder T5 transformer.
Model arhitecture based on https://arxiv.org/abs/2106.12672
This model is not tokenizer-based.
Use vocab.json for char → id mapping.
This model was developed by Orkun Gedik with the academic and technical support of Gazi University AI center and computer engineering department, Ankara. The project benefited from Gazi University’s intellectual contributions which played an important role in the design, training, and evaluation of the model.