Bluesky Thread

OpenAI Livestream: Announcing ChatGPT Agent

View original thread
OpenAI Livestream: Announcing ChatGPT Agent

successor to Operator, it stands up an entire VM in the cloud with a GUI, web browser, terminal, & private data sources to accomplish tasks
43 7
fwiw i do not currently have access (Plus, $20/mo)
5
they implied that the model was custom trained for this product, via RL on tools
6 1
looks like you can interrupt or add more information while it's busy working
5
yes, it's fully collaborative, in that it will gladly use the user as a "tool" to get more information or clarification.

but unlike deep research, it's not a fixed workflow, it only does it when it needs
6
for important steps like "send email", it asks for confirmation

but it's only an RL safeguard. it's not guaranteed to do it, but since it's trained in, you can probably feel confident that it'll do it properly
7
oh, i missed this before — it has an image generation tool. So it'll make full on powerpoint slides, complete with custom graphics
7
benchies
Humanities last exam & Frontier math
6
agentic bencies
webarena & browsecomp
5
never-heard-of-these benchies
spreadsheetbench & internal banking benchmark
5
is there a love interest going on here?
she's sitting super close to Sam
2
43 likes 7 reposts

More like this

×