Viewing a single comment thread. View all comments

sonudofsilence OP t1_irw765w wrote

Yes, i know but in this way the embedding of a word will be created according only to the tokens of the sentence in which it is found, right?

1

ExchangeStrong196 t1_irw93ux wrote

Yes. In order to ensure the contextual token embedding attends to longer text, you need to use a model that accepts larger sequence lengths. Check out Longformer

1