Hugging Face released IDEFICS, an 80B open-access visual language model replicating DeepMind’s unreleased Flamingo. Built entirely on public data, it’s the first of its size available openly. Part of its training utilized OBELICS, a dataset with 141M web pages, 353M images, and 115B text tokens from Common Crawl.

  • AsAnAILanguageModel@sh.itjust.worksOP
    link
    fedilink
    English
    arrow-up
    2
    ·
    1 jaar geleden

    I tried the demo for a bit and it makes mistakes every time, but gets enough things right to be promising! I wonder how this will evolve in the coming months.