roneneldan/TinyStories
Viewer • Updated • 2.14M • 89.8k • 982
How to use AISE-TUDelft/Custom-Activations-BERT-ReLU with Transformers:
# Use a pipeline as a high-level helper
from transformers import pipeline
pipe = pipeline("fill-mask", model="AISE-TUDelft/Custom-Activations-BERT-ReLU") # Load model directly
from transformers import AutoTokenizer, AutoModelForMaskedLM
tokenizer = AutoTokenizer.from_pretrained("AISE-TUDelft/Custom-Activations-BERT-ReLU")
model = AutoModelForMaskedLM.from_pretrained("AISE-TUDelft/Custom-Activations-BERT-ReLU")Basemodel: roBERTa
Configs: Vocab size: 10,000 Hidden size: 512 Max position embeddings: 512 Number of layers: 2 Number of heads: 4 Window size: 256 Intermediate-size: 1024
Results: