• ☆ Yσɠƚԋσʂ ☆@lemmygrad.ml
    link
    fedilink
    arrow-up
    19
    ·
    1 month ago

    Yeah, these things are not fundamentally different from Markov chains. Basically, it has a huge multidimensional graph of tokens, and all it’s doing is predicting the next likely token. So, when you introduce specific tokens into the input then it helps focus it in a particular direction.