Computer use is a great idea. It gets the job done when nothing else will.
If you're a person trying to get their job done at a big company, but half your job is in 1-2 proprietary tools or is stuck behind an API you can't program against, computer use can allow you, a non-techie, to do your job more efficiently.
I think it's an awesome way to circumvent gate keepers and the IT department to let people accomplish their goals.
It does. I used to be an ahk "script kiddie" and know it front and back. It's sort of burnt into my brains. As a result, I can prompt really really well, notice issues at a glance, and I have a sheer volume of scripts locally for all sorts of tasks some from as far back as 2014. From tiling window managers to OCR all the way to simple hotkeys/hotstrings. I let it grep in that folder and build out whatever I want using those primitives. This gives actually 1-shot immediately usable 100% working scripts even with GPT3.5 level models, as opposed to the iterations needed for typical development.
Example: adding copyright text box to bottom of every slide
Yeah, it's not that computer use is the most theoretically optimal paradigm, but there's a reasonable case that given the constraints of modern software systems and how they're built, that it's the most realistically optimal paradigm.
Computer use is very useful for developing GUI applications since claude code can build and test the entire app end-to-end (accessibility APIs exist but depending on the UI framework of your choosing you can run into walls very fast).
I run it in a VM using a headless wayland compositor, I'd never trust even fable with access to my real system.
I think there's a sweet spot- a lot of the time you're probably better off with "reverse engineer this web page and build me an API or personalized chrome extension to meet my needs".
I have an agent doing price checks for me for an item on a certain website. Instead of blasting through a zillion tokens processing the DOM over and over, it loaded the page once and figured out how to download a json with the price.
It curls the page. I think the approach it took won't actually wouldn't work in my local browser- its getting the value from some conversion reporting code that I'm guessing my ublock extension would hide.
How are folks using “computer use” to click things on intranet portals that are behind an SSO?
Even this OP example shows visitors a url and enter this search term… that is port of useless.
How can I automate things behind an SSO wall? Even if it means I manually authorize it once and watch it do things on its own..
I've never used Gemini computer use, but I assume it's the same:
Claude computer use takes control of your whole computer inputs (mouse and keyboard) plus screenshots. You just log in, tell Claude you're logged in, and let it get to work. It'll use the browser you're logged in with.
The chrome extension is a little better because it only takes control of its own chrome tabs (again: you just log in.)
For anyone trying to figure out how to build a society where no one wants to be a criminal, I highly recommend When Brute Force Fails: How to Have Less Crime and Less Punishment by Mark Kleiman.
There are evidence-backed ways of reducing criminality.
One counterintuitive way of reducing crime is to increase the likelihood of being caught, to have small-but-increasing consequences for committing crimes, and to increase the swiftness of sentencing.
For example, if you are caught drinking and driving, you immediately spend 1-2 days in jail.
Long sentences are not very productive at reducing crime or at least are a very inefficient way to do so.
An intuitive way to understand it is imagining that there was a system where if you stole something, you 100% of the time got hit with a charge to your account of the item value + $10. No one would steal again even if the penalty for getting caught was relatively nothing. Because the feedback loop is so short and guaranteed.
No ones life would be ruined over a dumb choice and yet they would change their behavior very fast.
It’s still the same. If you steal something and have no money you lose the item and get some small penalty, perhaps a day in jail. If there is absolutely no chance you’d get away with it, everyone quickly realises there is no point.
On the machine I have (ResMed AirSense 10) the data gets written to an SD card I can pull out and import into OSCAR to see what the machine detected about my sleep apnea events.
I also (not always, but when I'm testing changes) use a continuous pulse oximeter to test my SpO2 blood oxygen % over the entire night. The data from this can either be combined with the data from the CPAP machine, or used on its own to test things like how I do with the collar and no CPAP machine, since the SpO2 drops are the dangerous thing I'm trying to avoid by eliminating the apnea events.
I use the Wellue SleepU for SpO2 monitoring but there are a lot of similar devices (the kind with finger tip sensors are generally more accurate than the wristwatch based solutions).
I think it does. Crops pull up a set amount of water. If it's concentrated, then they'll pull up a lot of heavy metals. If it's at very low levels, then they won't.
We live in a much, much cleaner world than we did 50 years ago. Legislation and environmental rules have worked. There are some areas where it could obviously be better, but also some areas where regulation is too strict (blocking housing, renewables, transit) and the system is evolving to address those.
I think the loss of local media has made it harder for misdeeds to come to light, but I don't want to throw up my hands and cede everything to commercial interests et al.
I think a look at other countries would do well. There are many with much tighter regulations (e.g. EU countries, Singapore, Japan) and they seem to have good results with that.
> We live in a much, much cleaner world than we did 50 years ago. Legislation and environmental rules have worked.
I think prevention of pollution is one area where very tight regulation is absolutely needed, and this seems to be an argument for that.
Of course regulation can be weaponized and used as a tool to serve entrenched interests as well - but this is then more a problem with the overall political system. Also, I think a proof that this is the case is necessary instead of assuming it by default.
This is written for the 3 models (Sonnet, Haiku, Opus 3). While some lessons will be relevant today, others will not be useful or necessary on smarter, RL’d models like Sonnet 4.5.
> Note: This tutorial uses our smallest, fastest, and cheapest model, Claude 3 Haiku. Anthropic has two other models, Claude 3 Sonnet and Claude 3 Opus, which are more intelligent than Haiku, with Opus being the most intelligent.
Yes, Chapters 3 and 6 are likely less relevant now. Any others? Specifically assuming the audience is someone writing a prompt that’ll be re-used repeatedly or needs to be optimized for accuracy.
That's not just them saving it locally to like `~/.claude/conversations`? Feels weird if all conversations are uploaded to the cloud + retained forever.
If you're a person trying to get their job done at a big company, but half your job is in 1-2 proprietary tools or is stuck behind an API you can't program against, computer use can allow you, a non-techie, to do your job more efficiently.
I think it's an awesome way to circumvent gate keepers and the IT department to let people accomplish their goals.
reply