To go the knowledge around the relative dependencies of various tokens appearing at different destinations while in the sequence, a relative positional encoding is calculated by some sort of learning. Two well known forms of relative encodings are:Prompt fine-tuning needs updating hardly any parameters though acquiring efficiency akin to whole mode