Skip to content
Snippets Groups Projects
Unverified Commit b285c6f8 authored by Benjamin Clavié's avatar Benjamin Clavié Committed by GitHub
Browse files

fix(ColbertRerank): calculate ColBERT similarity per token rather than vs...

fix(ColbertRerank): calculate ColBERT similarity per token rather than vs pooled query embeds (#11335)

fix: calculate ColBERT similarity per token rather than vs pooled query embedding
parent ab367af7
No related branches found
No related tags found
No related merge requests found
......@@ -53,9 +53,7 @@ class ColbertRerank(BaseNodePostprocessor):
# Query: [batch_size, query_length, embedding_size] -> [batch_size, query_length, 1, embedding_size]
# Document: [batch_size, doc_length, embedding_size] -> [batch_size, 1, doc_length, embedding_size]
query_encoding = self._tokenizer(query, return_tensors="pt")
query_embedding = (
self._model(**query_encoding).last_hidden_state.mean(dim=1).unsqueeze(0)
)
query_embedding = self._model(**query_encoding).last_hidden_state
rerank_score_list = []
for document_text in documents_text_list:
......
0% Loading or .
You are about to add 0 people to the discussion. Proceed with caution.
Please register or to comment