haxor@derp.fooMB to Hacker News@derp.fooEnglish · 1 year agoRetentive Network: A Successor to Transformer for Large Language Modelsarxiv.orgexternal-linkmessage-square0fedilinkarrow-up12arrow-down10file-textcross-posted to: hackernews@lemmy.smeargle.fanstechnews@radiation.partylocalllama@sh.itjust.worksmachinelearning@kbin.social
arrow-up12arrow-down1external-linkRetentive Network: A Successor to Transformer for Large Language Modelsarxiv.orghaxor@derp.fooMB to Hacker News@derp.fooEnglish · 1 year agomessage-square0fedilinkfile-textcross-posted to: hackernews@lemmy.smeargle.fanstechnews@radiation.partylocalllama@sh.itjust.worksmachinelearning@kbin.social