I’ll tell you why this happens. You might use ChatGPT for a bit and your initial...

hansonkd · on Dec 16, 2024

The other problem I find is that LLMs are changing so fast, that what you evaluated 6-12 months ago, might be completely different now with newer models.

So the strengths and weaknesses quickly can become outdated as the strengths grow and weaknesses diminish.

When the first batch of LLMs people tried in 2023 had a lot of weaknesses. At the end of 2024, we can see increases in performance in speed and the complexity of output. People are creating frameworks on top of the LLMs that further increase their value. We went from thousands of tokens in context to millions of tokens pretty fast.

I can see myself dividing problems up into 4 groups:

    1. LLMs currently solve the problem
    2. It doesn't solve it now, but we are within a couple iteration of next generation models or frameworks to be able to solve it 
    3. LLMs are still years off from being able to solve this effectively so wait and implement it when it can.
    4. LLMs will never solve this.

I think a lot of people building products are in group 2 right now.

rcarmo · on Dec 16, 2024

Realism eventually sets in and they move to 3 and 4.

tananan · on Dec 16, 2024

This definitely resonates but I'm left wondering why there hasn't been a collective "sobering up" on this front. Not on a personal/team/company level, but just in terms of the general push to cram AI into everything. For how much longer will new s assault us in software where it ostensibly won't be that useful?

It seems that the effort required to make an LLM work robustly within a single context (spreadsheet, worddoc, email, whatever) is so gargantuan (honestly) that the returns or even the initial manpower wouldn't be there. So any new feels more or less like bloat, and if not fully useless, then at least a bit anxiety inducing in that you have no clue how much you can rely on it.

reaperman · on Dec 20, 2024

Very few managers get quick promotions for NOT rolling out a high-visibility AI enhancement. LLMs can theoretically fit into an amazing diversity of products. Even if just 10% of managers say yes and the other 90% say no, thats still a lot of shoehorning every year in an attempt to book a “win” for a promotion.

rcarmo · on Dec 16, 2024

I can tell you that there has been a lot of sobering up — but that the news isn’t made by those people…

notsydonia · on Dec 25, 2024

Totally. And everytime someone sobers up, there is a cabal of people saying "we've sunk however many $$$ into this, it's the core feature of the xx roll-out...drink up, the hype party continues, like it or not...." So now you see phenoms like the one-time 'premier tier subscriber only feature of co-pilot on Github now pushed to everyone, prompts to use the generative A.I in iStock on every page, compulsory "use Co-pilot to write your draft' prompts on every new doc in MS Word - because I don't think companies are able to grok the widespread disinterest in much of it. I'm still waiting for one that will be non-networked and sit on my desktop to do my tax returns and haggle with phone company bots.