More

davidbarker · 2025-07-29T12:49:57 1753793397

No, it's never promised unlimited — it's always had usage limits: 20× the usage of their regular Pro plan, with a limit of 50 sessions per month (a session being a 5-hour window), although I don't know if they ever enforced this.

They appear to have removed reference to this 50-session cap in their usage documents. (https://gist.github.com/eonist/5ac2fd483cf91a6e6e5ef33cfbd1e...)

So even if these mystery people Anthropic reference who did run it "in the background, 24/7", they still would've had to stay within usage limits.

davidbarker · 2025-05-01T15:14:07 1746112447

  Location: London, UK
  Remote: Yes
  Willing to relocate: No
  Technologies: TypeScript, React, Next.js, PHP/Laravel, Generative AI, Photoshop/Sketch/After Effects
  Website: https://dvy.io
  LinkedIn: https://linkedin.com/in/dvyio
  Email: david@davidbarker.me

I'm a multidisciplinary designer-developer with deep curiosity and a passion for building intuitive, human-centered products, particularly those leveraging generative AI.

My professional roles have typically involved much more than just coding, spanning product design, strategy, marketing, and customer support. I thrive in small, ambitious teams where I can make a tangible impact.

Outside of work, I've built successful side projects, including:

- Balance, a free web app that anonymously helps people with acute anxiety (https://balance.dvy.io/)

- AI Autotagger, an Eagle plugin currently processing over a million images and videos per month (https://community-en.eagle.cool/plugin/4B56113D-EB3E-4020-A8...)

- HN Alerts, a free notification service that sends emails when trending Hacker News stories appear (https://hnalerts.com/)

All projects listed on my personal website: https://dvy.io

---

I’m seeking product design roles at companies building meaningful products, ideally those that integrate cutting-edge AI creatively.

davidbarker · 2025-04-27T22:13:48 1745792028

Currently working on HN Alerts — a simple free site I made to alert me (via email) to trending stories on Hacker News.

It sends me an email once a story hits a certain number of upvotes per minute, so it's useful for keeping track of breaking news.

https://hnalerts.com

nandomrumber · 2025-04-28T00:31:22 1745800282

If you’re not aware, compare https://www.hnreplies.com/

davidbarker · 2025-04-28T00:39:49 1745800789

Thanks. Been a happy user for a few years!

mmarian · 2025-04-28T03:47:06 1745812026

I have a similar domain - https://hackernewsalerts.com - but it's for tracking replies to comments and posts you've made. It's on maintenance mode at the moment, couldn't gather as much interest as I'd hoped. Have open sourced it.

hwj · 2025-04-30T08:32:42 1746001962

In case you continue to work on it, an RSS feed of comments would be interesting...

mmarian · 2025-05-01T07:14:31 1746083671

Hmm, why wouldn't you just use https://hnrss.github.io/ ?

dewey · 2025-04-28T05:12:26 1745817146

Is there any difference to the existing one that made you built another one?

mmarian · 2025-04-28T05:30:31 1745818231

Yep, it notifies you when you get comments on your HN posts. The existing one only tracks replies to comments.

davidbarker · 2025-03-19T22:37:39 1742423859

Pricing: $150 / 1M input tokens, $600 / 1M output tokens. (Not a typo.)

Very expensive, but I've been using it with my ChatGPT Pro subscription and it's remarkably capable. I'll give it 100,000 token codebases and it'll find nuanced bugs I completely overlooked.

(Now I almost feel bad considering the API price vs. the price I pay for the subscription.)

ldjkfkdsjnv · 2025-03-19T23:50:40 1742428240

As far as I'm concerned, all of the other models are a waste of time to use in comparison. Most people dont know how good this model is

dinobones · 2025-03-20T00:19:12 1742429952

Interesting... Most benchmarks show this model as being worse than o3-mini-high and sonnet3.7.

What difference are you seeing from these models that makes it better?

I say this as someone considering shelling out $200 for ChatGPT pro for this.

jbellis · 2025-03-20T00:53:33 1742432013

If you're in the habit of breaking down problems to Sonnet-sized pieces you won't see a benefit. The win is that o1pro lets you stop breaking down one level up from what you're used to.

It may also have a larger usable context window, not totally sure about that.

logankeenan · 2025-03-20T02:54:35 1742439275

> lets you stop breaking down one level up from what you're used to.

Can you provide an example of what you mean by this? I provide very verbose prompts where I know what needs to be done and just let AI “do” the work. I’m curious how this is different?

jbellis · 2025-03-20T09:29:43 1742462983

Partly it means you can tell it to do X and it will figure out that implies Y and Z without you having to spell it out

And partly it can actually execute more at the same time without starting to make mistakes

raylad · 2025-03-20T08:29:17 1742459357

Sonnet 3.7 and O1 Pro both have 200K context windows. But O1 Pro has a 100K output window, and Sonnet 3.7 has a 128K output window. Point for Sonnet.

I routinely put about 100K + of context into Sonnet 3.7 in the form of source code, and in the Extended mode, given the right prompt, it will output perhaps 20 large source files before having to make a "continue" request (for example if it's asked to convert a web app from templates to React).

I'm curious whether O1 Pro actually exceeds Sonnet 3.7 in Extended mode for coding or not. Looking forward to seeing some benchmarks.

consumer451 · 2025-03-20T12:33:52 1742474032

I am very curious how 3.7 and o1 pro perform in this regard:

> We evaluate 12 popular LLMs that claim to support contexts of at least 128K tokens. While they perform well in short contexts (<1K), performance degrades significantly as context length increases. At 32K, for instance, 10 models drop below 50% of their strong short-length baselines. Even GPT-4o, one of the top-performing exceptions, experiences a reduction from an almost-perfect baseline of 99.3% to 69.7%.

https://arxiv.org/abs/2502.05167

futopy · 2025-03-23T22:49:04 1742770144

Anyone ever tries to restructure a 10K text? For example, structure a 45min - 1hr interview transcript in an organized way without losing any detailed numbers / facts / supporting evidence. I find that none of OpenAI's model is capable of this task. Models are trying to summarize and omitting details. I think such task does not require much intelligence, but surprisingly OpenAI's "large" context model cannot make it.

qeternity · 2025-03-20T12:35:07 1742474107

"Usable" is the key word here. Not all context is created equal.

Have a look at the RULER benchmark for a bit more detail.

Tiberium · 2025-03-20T01:02:11 1742432531

There actually were almost no benchmarks for o1 pro before because it wasn't on the API. o1 pro is a different model from o1 (yes, even o1 with high reasoning).

ldjkfkdsjnv · 2025-03-20T00:25:09 1742430309

I regularly push 100k+ tokens into it. So most of my code base/large portions. I use the Repo Prompt product to construct the code prompts. It finds bugs and solutions at a rate that is far better than others. I also speak into the prompt to describe my problem, and find spoken language is interpreted very well.

I also frequently download all the source code of libraries I am debugging, and when running into issues, pass that code in along with my own broken code. Its very good

Hugsun · 2025-03-20T00:38:42 1742431122

How long is it's thinking time when compared to o1?

The naming would suggest that o1-pro is just o1 with more time to reason. The API pricing makes that less obvious. Are they charging for the thinking tokens? If so, why is it so much more expensive if there are just more thinking tokens anyways?

Tiberium · 2025-03-20T01:20:56 1742433656

I think o1 pro runs multiple instances of o1 in parallel and selects the best answer, or something of the sort. And you do actually always pay for thinking models with all providers, OpenAI included. It's especially interesting if you remember the fact that OpenAI hides the CoT from you, so you're in fact getting billed for "thinking" that you can't even read yourself.

ldjkfkdsjnv · 2025-03-20T00:50:48 1742431848

I dont have the answers for you, I just know that if they charged 400$ a month I would pay it. It seems like a different model to me. I never use o3-mini or o3-mini-high. Just gpt4o or o1 pro

jbellis · 2025-03-20T00:57:52 1742432272

Remarkably capable is a good description.

Shameless plug: One of the reasons I wrote my AI coding assistant is to make it easier to get problems into o1pro. https://github.com/jbellis/brokk

andrewinardeer · 2025-03-20T01:04:34 1742432674

I wonder what the input/output tokens will be priced at for AGI.

stavros · 2025-03-20T01:12:25 1742433145

They won't. Your use cases won't be something the AI can't do itself, so why would they sell it to you instead of replace you with it?

AGI means the value of a human is the same as an LLM, but the energy requirements of a human are higher than those of an LLM, so humans won't be economical any more.

dnadler · 2025-03-20T02:14:58 1742436898

Actually, I think humans require much less energy than LLMs. Even raising a human to adulthood would be cheaper from a calorie perspective than running an AGI algorithm (probably). Its the whole reason why the premise of the Matrix was ridiculous :)

Some quick back of the envelope says that it would take around 35 MWh to get to 40 years old (2000 kcal per day)

gcanyon · 2025-03-20T02:54:38 1742439278

I read an article once that claimed an early draft/version that was cut for time or narrative complexity had the human brains being used as raw compute for the machines, with the Matrix being the idle process to keep the minds sane and functional for their ultimate purpose.

ben_w · 2025-03-20T17:48:20 1742492900

I've read a file that claimed to be that script; it made more sense for the machines to use human brains to control fusion reactors than for humans to be directly used as batteries.

(And way more sense than how the power of love was supposed to be a nearly magical power source in #4. Boo. Some of the ideas in that film were interesting, but that bit was exceptionally cliché.)

gcanyon · 2025-03-20T19:50:47 1742500247

I'd love to read that file. Of course, we're close (really close?) to being able to just ask an LLM to give us a personalized version of the script to do away with whatever set of flaws bother us the most.

ben_w · 2025-03-20T21:56:24 1742507784

One of the ways I experiment with LLMs is to get them to write short stories.

Two axies: Quality and length.

They're good quality. Not award winning, but significantly better than e.g. even good Reddit fiction.

But they still struggle with length, despite what the specs say about context length. You might manage the script length needed for a kid's cartoon, but not yet a film.

I'll see if I can find another copy of the script; what I saw was long enough ago my computer had a PPC chip in it.

gcanyon · 2025-03-21T02:36:06 1742524566

> PPC chip

Pizza box? I loved the 6100.

ben_w · 2025-03-21T08:37:59 1742546279

Beige proto-iMac. I had a 5200 as a teen and upgraded to either a 5300 or a 5400 at university for a few years — the latter broke while at university and I upgraded again to an eMac, but I think this was before then.

Looks like there's many different old scripts, no idea which, if any, was what I read back in the day: https://old.reddit.com/r/matrix/comments/rb4x93/early_draft_...

I miss those days. Even software development back then was more fun with REALbasic than today with SwiftUI.

gcanyon · 2025-03-21T11:45:45 1742557545

HA! I used REALbasic a bit back in the day, then spent my time comparing it to LiveCode, back then called Revolution. Geoff Perlman and I once co-presented at WWDC to compare the two tools.

12345ieee · 2025-03-20T02:31:27 1742437887

You need to consider all the energy spent to bring those calories to you, easily multiplying your budget by 10 or 100.

kadushka · 2025-03-20T02:47:33 1742438853

A human runs on ~100W, even when not doing anything useful. It's entirely plausible that 100W will be enough to run a future AGI level model.

rlt · 2025-03-20T03:26:07 1742441167

OpenAI doesn’t have the pre-existing business, relationships, domain knowledge, etc to just throw AGI at every possible use case. They will sell AGI for some fraction of what an equivalent human behind a computer screen would cost.

“AGI” is also an under-specified term. It will start (maybe is already there) equivalent to, say, a human in an overseas call center, but over time improve to the equivalent of a Fortune 500 CEO or Nobel prize winner.

“ASI”, on the other hand, will just recreate entire businesses from scratch.

jasfi · 2025-03-20T12:39:05 1742474345

There's could be something to what you wrote. If AGI were to be achieved by a model, why would they give access to it via an API? Why not just sell what it can do? E.g. business services. That would be far more of a moat.

foobiekr · 2025-03-20T15:34:17 1742484857

Can you describe this "find a bug" workflow?

hooloovoo_zoo · 2025-03-19T23:08:54 1742425734

Is your prompt {$codebase} find bugs?

davidbarker · 2025-03-19T23:12:30 1742425950

Typically something like:

  Look carefully through my codebase and identify any bugs/issues, or refactors that could improve it.

  <codebase>
  …
  </codebase>

Doesn't have to be anything overly complicated to get good results. It also does well if you give it a git diff.

ionwake · 2025-03-19T23:46:49 1742428009

Sorry if this is a noob question, but are you just pasting file strings inbetween those tags? like the contents of file1.js and file2.js?

diggan · 2025-03-19T23:58:41 1742428721

I do something similar, but "raw" markdown instead + filename, so all my prompts end up like this basically:

    Do blah blah blah while taking blah and blah into account. Here is my current code:

    File `file1.js`:

    ```javascript
    console.log('I am number one!')
    ```

    File `file2.js`:

    ```javascript
    console.log("I am number two :(")
    ```

Not sure if I'm imagining, but when I tried with/without the markdown code blocks, it seems to do better when I used markdown code blocks, so wrote a quick CLI that takes a directory path + prompt and creates something like that automatically for me. Often times I send identical prompts to ChatGPT+DeepThink+Claude, compare the approaches and continue with the one that works best for that particular problem, so having something reusable really saved time for this.

Edit: fuck it, in case people are curious how my little CLI works, I threw it up here: https://github.com/victorb/prompta (beware of bugs and whatnot, I've quite literally hacked this together without much thought)

atxtechbro · 2025-03-20T00:18:03 1742429883

I actually have a perfect tool called siphon-cli for this. It adds the headers in between files and everything. https://docs.siphon-cli.com/

diggan · 2025-03-20T00:21:40 1742430100

Yeah, like a lightweight version of my prompta CLI :)

What I end up with, is one .md file that uses variables like "$SRC", "$TESTS" and "$DOCS" inside of it, that gets replaced when you run `prompta output`, and then there is also a JSON file that defines what those variables actually get replaced with.

Bit off-topic, but curious how your repository ends up having 8023 lines of something for concatenating files, while my own CLI sits on 687 lines (500 of those are Rust) but has a lot more functionality :)

quartzic · 2025-03-20T01:51:35 1742435495

Not OP, but practically all of those lines are from a package-lock.json file (6755 lines) and a changelog (541 lines). It looks like the actual source is 179 lines long.

pridkett · 2025-03-19T23:54:26 1742428466

Repomix can take care of this for you. I pack it, cat the file to my clipboard with pbcopy, and just paste it into the prompt.

https://github.com/yamadashy/repomix

diggan · 2025-03-20T00:00:48 1742428848

I tried the web demo (https://repomix.com/) and it seems to generate unnecessarily complex "packs" for no reason, probably hurts LLM performance too. Why is there "Usage Guidelines" and "File Format" explanations in this, when it's supposed to just be the code "packed"? Better to just have the contents+filename, it'll infer that its directory structure and everything else.

jakereps · 2025-03-20T00:27:36 1742430456

While possibly being strange defaults, both of those are options. Remove the file summary and directory structure, both featured on the UI, and on the CLI tool, and voila, it's in your "better" state. There are also additional compression options beyond those two tweaks.

davidbarker · 2025-03-20T00:30:46 1742430646

That's right. I made a VS Code extension to combine all the files I have open into one long string I can copy & paste.

https://marketplace.visualstudio.com/items?itemName=DVYIO.co...

keizo · 2025-03-20T00:29:42 1742430582

I wrote a script to do exactly this a while ago (with the help of o1 pro). Makes it way easier https://github.com/keizo/ggrab

crossroadsguy · 2025-03-20T06:37:06 1742452626

Do you get this -

When you say: But is that really a bug?

GPT: That's right. Now that I see it again this is not a bug….and a lot of blah blah.

davidbarker · 2025-03-11T17:19:37 1741713577

In theory there shouldn't be — LLMs are pretty robust to typos and usually infer the intended meaning regardless.

davidbarker · 2025-03-04T13:21:56 1741094516

  Location: London, UK
  Remote: Yes
  Willing to relocate: No
  Technologies: TypeScript, React, Next.js, PHP/Laravel, Generative AI, Photoshop/Sketch/After Effects
  Website: https://dvy.io
  LinkedIn: https://linkedin.com/in/dvyio
  Email: david@davidbarker.me

I'm a multidisciplinary designer-developer with deep curiosity and a passion for building intuitive, human-centered products, particularly those leveraging generative AI.

My professional roles have typically involved much more than just coding, spanning product design, strategy, marketing, and customer support. I thrive in small, ambitious teams where I can make a tangible impact.

Outside of work, I've built successful side projects, including:

- Balance, a free web app that anonymously helps people with acute anxiety (https://balance.dvy.io/)

- AI Autotagger, an Eagle plugin currently processing over a million images and videos per month (https://community-en.eagle.cool/plugin/4B56113D-EB3E-4020-A8...)

- HN Alerts, a free notification service to send emails when trending Hacker News stories appear (https://hnalerts.com/)

All projects listed on my personal website: https://dvy.io

I’m seeking product design roles at companies building meaningful products that integrate cutting-edge AI thoughtfully and creatively.

davidbarker · 2025-02-24T22:23:57 1740435837

Claude 3.5 Sonnet is great, but on a few occasions I've gone round in circles on a bug. I gave it to o1 pro and it fixed it in one shot.

More generally, I tend to give o1 pro as much of my codebase as possible (it can take around 100k tokens) and then ask it for small chunks of work which I then pass to Sonnet inside Cursor.

Very excited to see what o3 pro can do.

davidbarker · 2025-02-24T21:04:08 1740431048

If you do `/cost` it will tell you how much you've spent during that session so far.

davidbarker · 2025-02-15T01:29:55 1739582995

If it's useful to anyone, I made a VS Code/Cursor extension that combines all open files into one big text document.

I use it with ChatGPT's o1 pro (which can handle around 100,000 tokens).

1. Open all of the files I think are relevant

2. Use the extension to combine them

3. Copy and paste into ChatGPT

https://marketplace.visualstudio.com/items?itemName=DVYIO.co...

replwoacause · 2025-02-15T04:34:11 1739594051

I’ll be using this, thank you!

davidbarker · on Oct 11, 2024

It disappoints me when otherwise intelligent people take him for his word at this point. Even ignoring his descent into political madness and conspiracy, he's simply not trustworthy.

Fool me once, shame on Elon. Fool me 194 times, shame on me.