lingo.lol is one of the many independent Mastodon servers you can use to participate in the fediverse.
A place for linguists, philologists, and other lovers of languages.

Server stats:

61
active users

#operator

0 posts0 participants0 posts today

L'altre dia vaig veure un tut on indicaven com fer per buscar tota la teva activitat de cop (tuts, m'agrades, respostes...) fent servir un #operador a la #cerca i paraula clau ara no el trobo ni me'n recordo 😞 / The other day I read a toot where they indicated how to search all your #activity at once (toots, likes, replies...) using an #operator and keyword in the #searchbox and now I can't find it or remember it 😞 #mastodontips #ajuda #autocerca #autosearch #Help #INeedSomebody #NotJustAnybody

Continued thread

«OpenAI launches Operator, an AI agent that can operate your computer:
New research "Computer-Use Agent" AI model can jump in and help users with on-screen tasks.»

I am critical of whether this is really useful and whether it can even benefit from people's knowledge ~(°-°)~

🤖 arstechnica.com/ai/2025/01/ope

Ars Technica · OpenAI launches Operator, an AI agent that can operate your computerBy Benj Edwards
#ai#openai#computer

#OpenAI launches #Operator, an #AIagent that can operate your computer

Operator watches on-screen content while you use your computer and executes tasks through simulated keyboard and mouse inputs.

> I don’t trust it enough to give it hands-off control. Especially since they #hallucinate.
#ai #security

arstechnica.com/ai/2025/01/ope

Ars Technica · OpenAI launches Operator, an AI agent that can operate your computerBy Benj Edwards

It looks like LLM-producing companies that are massively #crawling the #web require the owners of a website to take action to opt out. Albeit I am not intrinsically against #generativeai and the acquisition of #opendata, reading about hundreds of dollars of rising #cloud costs for hobby projects is quite concerning. How is it accepted that hypergiants skyrocket the costs of tightly budgeted projects through massive spikes in egress traffic and increased processing requirements? Projects that run on a shoestring budget and are operated by volunteers who dedicated hundreds of hours without any reward other than believing in their mission?

I am mostly concerned about the default of opting out. Are the owners of those projects required to take action? Seriously? As an #operator, it would be my responsibility to methodically work myself through the crawling documentation of the hundreds of #LLM #web #crawlers? I am the one responsible for configuring a unique crawling specification in my robots.txt because hypergiants make it immanently hard to have generic #opt-out configurations that tackle LLM projects specifically?

I reject to accept that this is our new norm. A norm in which hypergiants are not only methodically exploiting the work of thousands of individuals for their own benefit and without returning a penny. But also a norm, in which the resource owner is required to prevent these crawlers from skyrocketing one's own operational costs?

We require a new #opt-in. Often, public and open projects are keen to share their data. They just don't like the idea of carrying the unpredictable, multitudinous financial burden of sharing the data without notice from said crawlers. Even #CommonCrawl has safe-fail mechanisms to reduce the burden on website owners. Why are LLM crawlers above the guidelines of good #Internet citizenship?

To counter the most common argument already: Yes, you can deny-by-default in your robots.txt, but that excludes any non-mainstream browser, too.

Some concerning #news articles on the topic:

Replied in thread

@simon @heiseonline

(6/n)

...afterwards:

"The [#AI] system started realising that while they did identify the threat at times, the human #operator would tell it not to kill that threat, but it got its points by killing that threat. So what did it do? It killed the operator. It killed the operator because that person was keeping it from accomplishing its objective.”

He went on: “👉We trained the system – ‘Hey don’t kill the operator – that’s bad.👈..."

I am looking for a #kuberentes #operator that takes custom resources and spits out multiple custom resources based on the logic taken from a config (or another custom resource). Have you seen something like that? Any keywords to search with?

Use case - take generic "workload CR", create Namespace Resource, SA Resource, NewRepoOnCodeberg CR, etc..