• jacksilver@lemmy.world
    link
    fedilink
    English
    arrow-up
    4
    ·
    5 hours ago

    LLMs do suck at math, if you look into it, the o1 models actually escape the LLM output and write a python function to calculate the output, I’ve been able to break their math functions by asking for functions that use math not in the standard Python library.

    I know someone also wrote a wolfram integration to help solve LLMs math problems.