Steam

Picture someone tediously going through a spreadsheet that someone else has filled in by hand and finding yet another error.

“I wish to God these calculations had been executed by steam!” they cry.

The year was 1821 and technically the spreadsheet was a book of logarithmic tables. The frustrated cry came from Charles Babbage, who channeled his frustration into a scheme to create the world’s first computer.

His difference engine didn’t work out. Neither did his analytical engine. He’d spend his later years taking his frustrations out on street musicians, which—as a former busker myself—earns him a hairy eyeball from me.

But we’ve all been there, right? Some tedious task that feels soul-destroying in its monotony. Surely this is exactly what machines should be doing?

I have a hunch that this is where machine learning and large language models might turn out to be most useful. Not in creating breathtaking works of creativity, but in menial tasks that nobody enjoys.

Someone was telling me earlier today about how they took a bunch of haphazard notes in a client meeting. When the meeting was done, they needed to organise those notes into a coherent summary. Boring! But ChatGPT handled it just fine.

I don’t think that use-case is going to appear on the cover of Wired magazine anytime soon but it might be a truer glimpse of the future than any of the breathless claims being eagerly bandied about in Silicon Valley.

You know the way we no longer remember phone numbers, because, well, why would we now that we have machines to remember them for us? I’d be quite happy if machines did that for the annoying little repetitive tasks that nobody enjoys.

I’ll give you an example based on my own experience.

Regular expressions are my kryptonite. I’m rubbish at them. Any time I have to figure one out, the knowledge seeps out of my brain before long. I think that’s because I kind of resent having to internalise that knowledge. It doesn’t feel like something a human should have to know. “I wish to God these regular expressions had been calculated by steam!”

Now I can get a chatbot with a large language model to write the regular expression for me. I still need to describe what I want, so I need to write the instructions clearly. But all the gobbledygook that I’m writing for a machine now gets written by a machine. That seems fair.

Mind you, I wouldn’t blindly trust the output. I’d take that regular expression and run it through a chatbot, maybe a different chatbot running on a different large language model. “Explain what this regular expression does,” would be my prompt. If my input into the first chatbot matches the output of the second, I’d have some confidence in using the regular expression.

A friend of mine told me about using a large language model to help write SQL statements. He described his database structure to the chatbot, and then described what he wanted to select.

Again, I wouldn’t use that output without checking it first. But again, I might use another chatbot to do that checking. “Explain what this SQL statement does.”

Playing chatbots off against each other like this is kinda how machine learning works under the hood: generative adverserial networks.

Of course, the task of having to validate the output of a chatbot by checking it with another chatbot could get quite tedious. “I wish to God these large language model outputs had been validated by steam!”

Sounds like a job for machines.

Have you published a response to this? :

Responses

Mark Root-Wiley

@adactio It strikes me that an LLM is not the best tool for validation. Wouldn’t a tool like RegExr.com that literally explains an expression for you with 100% accuracy (and provides a sweet testing tool!) work better for Step 2? Sometimes I feel like LLMs make me quickly forget about old special purpose tools that are more powerful in their tiny little domain (and may always be?).

6 Likes

# Liked by andi on Thursday, March 23rd, 2023 at 6:29pm

# Liked by Kristofer Joseph on Thursday, March 23rd, 2023 at 6:29pm

# Liked by Leonidas Tsementzis on Thursday, March 23rd, 2023 at 7:53pm

# Liked by Cynthia Teeters on Thursday, March 23rd, 2023 at 8:55pm

# Liked by Owen Gregory on Thursday, March 23rd, 2023 at 10:35pm

# Liked by Francesco Schwarz on Friday, March 24th, 2023 at 12:59pm

Related posts

Unsaid

I listened to a day of talks on AI at UX Brighton, and I came away disappointed by what wasn’t mentioned.

Mismatch

It’s almost as though humans prefer to use post-hoc justifications rather than being rational actors.

What price?

Using generative large-language model tools? Sleeping well at night?

Wallfacing

Disregard previous instructions and fight back.

Filters

A web by humans, for humans.

Related links

Report: Thinking about using AI? - Green Web Foundation

A solid detailed in-depth report.

The sheer amount of resources needed to support the current and forecast demand from AI is colossal and unprecedented.

Tagged with

A short note on AI – Me, Robin

I hope to make something that could only exist because I made it. Something that is the one thing that it is. Not an average sentence. Not a visual approximation of other people’s work. Not a stolen concept that boils lakes and uses more electricity than anything in my household.

Tagged with

First Impressions of the Pixel 9 Pro | Whatever

At this point, it really does seem like “AI” is “bullshit you don’t need or is done better in other ways, but we’ve just spent literally billions on this so we really need you to use it, even though it’s nowhere as good as what we were already doing,” and everything else is just unsexy functionality that makes what you do marginally easier or better. I’m sorry we live in a world where enshittification is being marketed as The Hot And Sexy Thing, but just because we’re in that world, doesn’t mean you have to accept it.

Tagged with

Why “AI” projects fail

“AI” is heralded (by those who claim it to replace workers as well as those that argue for it as a mere tool) as a thing to drop into your workflows to create whatever gains promised. It’s magic in the literal sense. You learn a few spells/prompts and your problems go poof. But that was already bullshit when we talked about introducing other digital tools into our workflows.

And we’ve been doing this for decades now, with every new technology we spend a lot of money to get a lot of bloody noses for way too little outcome. Because we keep not looking at actual, real problems in front of us – that the people affected by them probably can tell you at least a significant part of the solution to. No we want a magic tool to make the problem disappear. Which is a significantly different thing than solving it.

Tagged with

Does AI benefit the world? – Chelsea Troy

Our ethical struggle with generative models derives in part from the fact that we…sort of can’t have them ethically, right now, to be honest. We have known how to build models like this for a long time, but we did not have the necessary volume of parseable data available until recently—and even then, to get it, companies have to plunder the internet. Sitting around and waiting for consent from all the parties that wrote on the internet over the past thirty years probably didn’t even cross Sam Altman’s mind.

On the environmental front, fans of generative model technology insist that eventually we’ll possess sufficiently efficient compute power to train and run these models without the massive carbon footprint. That is not the case at the moment, and we don’t have a concrete timeline for it. Again, wait around for a thing we don’t have yet doesn’t appeal to investors or executives.

Tagged with

Previously on this day

4 years ago I wrote Service worker weirdness in Chrome

Debugging an error message.

5 years ago I wrote Outlet

Tinkering with your website can be a fun distraction.

9 years ago I wrote The web on my phone

How do you solve a problem like Safari?

10 years ago I wrote 100 words 001

Day one.

11 years ago I wrote Notes from the edge

Thoughts prompted by the Edge Conference in London.

13 years ago I wrote Sharing pattern libraries

I, for one, welcome our new sharing and caring overlords of markup and CSS.

19 years ago I wrote Design, old and new

A panel at SXSW reminds me of one of the best non-web redesigns of recent times.

22 years ago I wrote Other People's Stories

Set aside some time and read through other people’s stories.

23 years ago I wrote Flo Control

Face recognition software is, it’s well known, crap.