Before we start, let’s just get the basics out of the way - yes, stealing the work of hundreds of thousands if not millions of private artists without their knowledge or consent and using it to drive them out of business is wrong. Capitalism, as it turns out, is bad. Shocking news to all of you liberals, I’m sure, but it’s easy to call foul now because everything is wrong at once - the artists are losing their jobs, the slop being used to muscle them out is soulless and ugly, and the money is going to lazy, talentless hacks instead. With the recent implosion of the NFT space, we’re still actively witnessing the swan song of the previous art-adjacent grift, so it’s easy to be looking for problems (and there are many problems). But what if things were different?

Just to put my cards on the table, I’ve been pretty firmly against generative AI for a while, but I’m certainly not opposed to using AI or Machine Learning on any fundamental level. For many menial tasks like Optical Character Recognition and audio transcription, AI algorithms have become indispensable! Tasks like these are grunt work, and by no means is humanity worse off for finding ways to automate them. We can talk about the economic consequences or the quality of the results, sure, but there’s no fundamental reason this kind of work can’t be performed with Machine Learning.

AI art feels… different. Even ignoring where companies like OpenAI get their training data, there are a lot of reasons AI art makes people like me uneasy. Some of them are admittedly superficial, like the strange proportions or extra fingers, but there’s more to it than that.

The problem for me is baked into the very premise - making an AI to do our art only makes sense if art is just another task, just work that needs to be done. If sourcing images is just a matter of finding more grist for the mill, AI is a dream come true! That may sound a little harsh, and it is, but it’s true. Generative AI isn’t really art - art is supposed to express something, or mean something, or do something, and Generative AI is fundamentally incapable of functioning on this wavelength. All the AI works with is images - there’s no understanding of ideas like time, culture, or emotion. The entirety of the human experience is fundamentally inaccessible to generative AI simply because experience itself is inaccessible to it. An AI model can never go on a walk, or mow a lawn, or taste an apple, it’s just an image generator. Nothing it draws for us can ever really mean anything to us, because it isn’t one of us. Often times, I hear people talk about this kind of stuff almost like it’s just a technical issue, as if once they’re done rooting out the racial bias or blocking off the deepfake porn, then they’ll finally have some time to patch in a soul. When artist Jens Haaning mailed in 2 blank canvases titled “Take the Money and Run” to the Kunsten Museum of Modern Art, it was a divisive commentary on human greed, the nature of labor, and the nonsequitir pricing endemic to modern art. The knowledge that a real person at that museum opened the box, saw a big blank sheet, and had to stick it up on the wall, the fact that there was a real person on the other side of that transaction who did what they did and got away with it, the story around its creation, that is the art. If StableDiffusion gave someone a blank output, it’d be reported as a bug and patched within the week.

All that said, is AI image generation fundamentally wrong? Sure, the people trying to make money off of it are definitely skeevy, but is there some moral problem with creating a bunch of dumb, meaningless junk images for fun? Do we get to cancel Neil Cicierega because he wanted to know how Talking Heads frontman David Byrne might look directing traffic in his oversized suit?

Maybe just a teensy bit, at least under the current circumstances.

I’ll probably end up writing a part 2 about my thoughts on stuff like data harvesting and stuff, not sure yet. I feel especially strongly about the whole “AI is just another tool” discourse when people are talking about using these big models, so don’t even get me started on that.

  • KobaCumTribute [she/her]@hexbear.net
    link
    fedilink
    English
    arrow-up
    3
    ·
    3 months ago

    The problem with fetishizing the specific methods and intentionality of details is how that squares with available tools and materials and the purpose of a piece in the first place. That is, how intentional are details in a photograph? How much thought was put into some fuzzy background bit that’s only there because something had to be there? How intentional is some procedurally filled environment in a 3d render? How does mass produced rote decorations like Roman mosaics and statues, or older lost-wax casting fit in? So much of art is in practice just pragmatic methodology: things are reused to save labor and resources, rote actions or patterns are used because they simplify the process of creation or are part of a standard abstract language of a genre, things can be present just because they were available to use, large chunks can be handed off to other workers that are just performing some rote task to follow a description, etc.

    Hitting the “give treat” button on some proprietary off-site treatbox with no controls can only produce nonsense, but the actual mechanical stuff that facilitates that can - in an open source and locally-run package - be used in the same manner as prefab models in a render or already-existing places in a photograph.

    the vast majority of this material is indisputably tripe

    At this point the majority of people interacting with generative AI are techbro dipshits, and most AI art is being churned out by treatboxes with a simple prompt as the only control. Even among the “enthusiast” community most people just use local prompt-only treatboxes because even a basic flowchart UI is “too complicated” for them. I said this in another thread, but I’m heavily reminded of the original proliferation of highly accessible 3d renderer software and how that created a flood of low quality garbage where it’s just some prefab model standing in some prefab set with bad lighting and a bad pose, that people tried to turn into “comics” that are just a long progression of single panel pages with speech bubbles. Even now CGI is something that either has to be carefully hidden and blended into film or animation or which occupies its own realm of stylization and accepted conventions, because anything else looks jarring as hell and bad.

    Generative AI is that cranked up to 11: a tool where the barrier to reach “this is a picture with a character in it, sort of doing something I guess” is basically nothing. And just like CGI, most uses of it are bad and the good uses are more trying to shortcut off it and then hide its presence than use it directly.