Technology

technology@lemmy.world

82285 readers

4473 users here now

This is a most excellent place for technology news and articles.

Our Rules

Follow the lemmy.world rules.
Only tech related news or articles.
Be excellent to each other!
Mod approved content bots can post up to 10 articles per day.
Threads asking for personal tech support may be deleted.
Politics threads may be removed.
No memes allowed as posts, OK to post as comments.
Only approved bots from the list below, this includes using AI responses and summaries. To ask if your bot can be added please contact a mod.
Check for duplicates before posting, duplicates may be removed
Accounts 7 days and younger will have their posts automatically removed.

Approved Bots

founded 2 years ago

MODERATORS

L3s@lemmy.world

enu@lemmy.world

technopagan@lemmy.world

L4s@lemmy.world

L3s@hackingne.ws

Turns out Generative AI was a scam (garymarcus.substack.com)

submitted 1 week ago by floofloof@lemmy.ca to c/technology@lemmy.world

11 comments fedilink hide all child comments

cross-posted from: https://lemmy.bestiver.se/post/952771

Comments

you are viewing a single comment's thread
view the rest of the comments

[–] Curious_Canid@piefed.ca 0 points 1 week ago (2 children)

LLMs are not capable of creating anything, including code. They are enormous word-matching search engines that try to find and piece together the closest existing examples of what is being requested. If what you're looking for is reasonably common, that may be useful. If what you're looking for is obscure, you may get things that don't apply. And the LLM cannot tell the difference. They can be useful but, unlike an LLM, you need to understand the context to use them safely.

I think the most interesting thing about LLMs is actually what they tell us about the repetitive nature of most of what we do.

[–] partial_accumen@lemmy.world 1 point 1 week ago (1 child)

LLMs are not capable of creating anything, including code. They are enormous word-matching search engines that try to find and piece together the closest existing examples of what is being requested. If what you’re looking for is reasonably common, that may be useful.

Just for common understanding, you're making blanket statements about LLMs as though those statements apply to all LLMs. You're not wrong if you're generally speaking of the LLM models deployed for retail consumption like, as an example, ChatGPT. None of what I'm saying here is a defense about how these giant companies are using LLMs today. I'm just posting from a Data Science point of view on the technology itself.

However, if you're talking about the LLM technology, as in a Data Science view, your statements may not apply. The common hyperparameters for LLMs are to choose the most likely matches for the next token (like the ChatGPT example), but there's nothing about the technology that requires that. In fact, you can set a model to specifically exclude the top result, or even choose the least likely result. What comes out when you set these hyperparameters is truly strange and looks like absolute garbage, but it is unique. The result is something that likely hasn't existed before. I'm not saying this is a useful exercise. Its the most extreme version to illustrate the point. There's also the "temperature" hyperparamter which introduces straight up randomness. If you crank this up, the model will start making selections with very wide weights resulting in pretty wild (and potentially useless) results.

What many Data Scientists trying to make LLMs generate something truly new and unique is to balance these settings so that new useful combinations come out without it being absolute useless garbage.

[–] Curious_Canid@piefed.ca 1 point 1 week ago

I write software for a living and I have worked directly with LLM backend code. You aren't wrong about the exceptions, but I think they actually reinforce my main point. If you play with the parameters you can make all kinds of things happen, but all of those things are still driven by the existing information it already has or can find. It can mash things together in random new ways, but it will always work with components that already exist. There is no awareness of context or meaning that would allow it to make intelligent choices about what it mashes together. That will always be driven by the patterns it already knows, positively or negatively.

It's like doing chemistry by picking random bottles from the shelf and dumping them into a beaker to see what happens. You could make an amazing discovery that way, but the chances of it happening are very, very low. And even if it does happen, there's an excellent chance that you won't recognize it.

I'm in favor of using LLMs for tasks that involve large-scale data analysis. They can be quite helpful, as long as the user understands their limitations and performs due diligence to validate the results.

Unfortunately what we are mostly seeing are cases where LLMs are used to generate boilerplate text or code that is assembled from a vast collection of material that someone who actually knew what they were doing had previously created. That kind of reuse is not inherently bad, but it should not be confused with what competent writers or coders do. And if LLMs really do take over a lot of routine daily tasks from people, the pool of approaches to those tasks will stagnate, and eventually degenerate, as LLMs become the primary sources of each others' solutions.

LLMs may very well change the world, but not it in the ways most people expect. Companies that have invested heavily in them are pushing them as the solutions to the wrong problems.