Really great review, a question or comment - the more developed these become, the more they become weaponized - i.e. for example perhaps, generating con/text from Russia or Israel - the more we will get problematic and indeterminate/unrecognizable flooding. Things like this are already disrupting political processes, propaganda, etc. I'm more concerned about this than about generation, as a result of the generation itself becoming the ghost or carapace of human production - or testimony..

A toy and a marvel, but similar are being used here in relation to the upcoming elections etc.

- Alan

On Tue, 14 May 2024, Mez Breeze via NetBehaviour wrote:

Date: Tue, 14 May 2024 20:48:41 +1000
From: Mez Breeze via NetBehaviour <[email protected]>
To: NetBehaviour for networked distributed creativity
    <[email protected]>
Cc: Mez Breeze <[email protected]>
Subject: [NetBehaviour] ChatGPT-4o: Breakthrough or Bust?

Hihi All.

So I've just written this Patreon post all about OpenAi's latest release,
ChatGPT-4o. I'm curious to see what peeps here make of it:

--

_ChatGPT-4o: Breakthrough or Bust?_

ChatGPT-4o [or Omni as the cool kids at OpenAI term it] sauntered into the
release-spotlight earlier today, with OpenAi writing on their website that
it?s:

??a step towards much more natural human-computer interaction?it accepts as 
input
any combination of text, audio, and image and generates any combination of
text, audio, and image outputs. It can respond to audio inputs in as little
as 232 milliseconds, with an average of 320 milliseconds, which is similar
to human response time in a conversation.?

Released alongside a bevy of slick videos, ChatGPT-4o is touted to be the
next big thing with multiple video promos parading its new multimodal chops.
One such video shows one version of GPT-4o narrating visual scenes to its
visually impaired AI mate [while it asks questions in turn] all while the
human testers fidgeted impatiently on the sidelines, barely masking their
urge to fast-forward to the good bits.

The grand unveiling of GPT-4o could?ve been lifted straight from a sci-fi
script. We watched [some in awe, some with cynicism] as these videos
attempted to paint a future where AIs chat about the d?cor in an empty room
and then sing a duet about the process. And yet it's hard not to chuckle at
the not-so-subtle desperation in the human testers who seem hell-bent on
skipping/interrupting the more stilted voice-scripted parts of the AI
dialogue and hurry to reach the 'money shot' of the demo [ie the concrete
feature they were trying to showcase]. It begs the question: was the fanfare
a bit premature? After all, this wasn?t the GPT-5 release party everyone had
RSVP?d to.

Diving into the nitty-gritty, the AI?s voice tech is frankly impressive,
reminding us of those Figure 01 Speech-to-Speech Reasoning clips from a
while back. The voices are spot-on: the AI sounds like a person, with
natural human-emulated cadences, sub-vocalisations, and tonalities [the
laughter is weirdly realistic]. What was far less impressive is the
replication of bias in the gendered aspects the simulated speech with the
female voices being far more sexualised/flirty than the male ones. It's
especially disappointing given the potential ramifications for how this will
impact users and perpetuate current gender biases. In 2024, one would hope
we?d be past such gendered gimmicks [but?.no].

Now let?s talk timing: releasing ChatGPT-4o for free might seem like a clever
move, but the cynics among us might sniff something fishy. Rumour has it
OpenAI?s nearly chewed through the entire web for data to feed its language
models, so why not throw open the gates and let the global crowd serve up
fresh fodder? It?s a clever ploy about scraping up every last crumb of human
interaction to power their data-hungry tech.

Let?s not be too harsh, though. OpenAI?s latest toy is a bit of a marvel: in
some ways it?s like watching a new species come to life. But as we ooh and ahh
over this latest OpenAI release, are we so dazzled by the prospect of
talking chat [and vision-processing] bots that we overlook the less
glamorous implications, like privacy erosion and data exploitation? [And let?s
not kid ourselves, wasn't everyone really hanging out for GPT-5?]

In essence ChatGPT-4o is a bit of a mixed bag. It's an impressive party
trick, sure, but when the release-glitter settles we?re left pondering what it
really brings to the table. It?s a step forward no doubt, but also a sidestep,
a fancy detour on the road to more profound innovations. The real kicker isn?t
what this AI can do but what it represents in the grand scheme of things
[all up a blend of breakthrough and bias, of marvels and missed
opportunities]. So let?s marvel at the spectacle but remain skeptical of the
smoke and mirrors. After all [in the current world of current AI
release-hype] every new release is a double-edged sword, and it?s up to us to
suss out which edge to grab.

--
| mezbreezedesign.com




_______________________________________________
NetBehaviour mailing list
[email protected]
https://lists.netbehaviour.org/mailman/listinfo/netbehaviour

Reply via email to