microsoft/ms_marco
Viewer • Updated • 1.11M • 26.2k • 241
How to use LilaBoualili/colbert-distilbert-margin_mse-T2-msmarco-encoder-only with Transformers:
# Use a pipeline as a high-level helper
from transformers import pipeline
pipe = pipeline("feature-extraction", model="LilaBoualili/colbert-distilbert-margin_mse-T2-msmarco-encoder-only") # Load model directly
from transformers import AutoTokenizer, AutoModel
tokenizer = AutoTokenizer.from_pretrained("LilaBoualili/colbert-distilbert-margin_mse-T2-msmarco-encoder-only")
model = AutoModel.from_pretrained("LilaBoualili/colbert-distilbert-margin_mse-T2-msmarco-encoder-only")We provide the encoder model of the complete retrieval trained DistilBert-based ColBERT model. This model is trained with Margin-MSE using a 3 teacher BERT_Cat (concatenated BERT scoring) ensemble on MSMARCO-Passage, for more details check the full model card.
This encoder-only model is used as the oracle for distilling term topic embeddings in our ECIR'23 paper.