I built you this: https://tools.simonwillison.net/hacker-news-filtered It shows ...

yencabulator · 2025-07-21T16:04:52 1753113892

That exclusion filter seems to be just a very dumb substring test? Try filtering out "a" and almost everything disappears. That means filtering out "ai" filters out "I used my brain not a computer".

simonw · 2025-07-15T19:37:24 1752608244

I updated it to fetch 200 stories instead of 30, so even after filtering you still get hopefully 140+ things to read.

https://github.com/simonw/tools/commit/ccde4586a1d95ce9f5615...

postalcoder · 2025-07-16T00:18:50 1752625130

I’ve built a site that does the same sort of exclusion filtering, with a lot more bells and whistles. Very much in the spirit of “what if HN stayed HN, but had actual, very useful features.”

Here’s an “anti-ai” timeline filter:

https://hcker.news/?filter=top30&exclude=llm%2C+vibe%2C+open...

I’m not using the Algolia API, I ingest the hn fire hose on my own server so the filtering is very fast.

rottc0dd · 2025-07-15T15:17:20 1752592640

Top story: Kiro: new agentic IDE

samtheprogram · 2025-07-15T15:20:27 1752592827

Just add “agent” to the search box. It’s saved in local storage.

simonw · 2025-07-15T15:21:42 1752592902

I just added "agent" to the default exclusion list.

lossolo · 2025-07-15T16:12:58 1752595978

"Onedrive is slow on Linux but fast with a “Windows” user-agent"

"Agents raid home of fired Florida data scientist who built Covid-19 dashboard"

"Confessions of an ex-TSA agent"

"Terrible real estate agent photographs"

etc etc

simonw · 2025-07-15T16:47:20 1752598040

See comment here: https://news.ycombinator.com/item?id=44571740#44572312

lossolo · 2025-07-15T23:00:41 1752620441

I'm not sure what I'm supposed to see there. From my point of view, this is a low-effort, vibe coded app that doesn't solve the problem the OP had but it's solving a different one. You'd need to at least train a small classifier based on something like BERT to actually address the issue. What I showed in my comment just demonstrates that this doesn't solve the problem OP had.

luke-stanley · 2025-07-15T15:33:08 1752593588

Still seeing `Kiro: A new agentic IDE` BTW.

simonw · 2025-07-15T15:40:44 1752594044

If the filters UI at the top shows "llm, ai" instead of "llm, ai, agent" then you probably have that previous search saved in localStorage.

hereonout2 · 2025-07-15T18:44:18 1752605058

Huge respect for all your articles and work on llms, but this example should have been using AI to create a tool that uses AI to intelligently filter hacker news :)

Sabinus · 2025-07-16T12:52:35 1752670355

Someone posted that last week.

https://www.hackernews.coffee/

NotPractical · 2025-07-15T16:15:01 1752596101

Probably would work better as a userscript, so you don't have to rely on a random personal website never going down just to use HN. I don't have a ChatGPT account but I am curious as to if it could do that automatically too.

aleksituk · 2025-07-15T17:40:56 1752601256

Interesting idea, we could consider that as an alternative implementation to https://www.hackernews.coffee/. While we are planning on making it open-source, a userscript would be even more robust as a solution, although would need a personal API key to one of the services.

simpaticoder · 2025-07-15T15:26:34 1752593194

An interesting example of both LLMs' strengths and weaknesses. It is strong because you wrote a useful tool in a few minutes. It is weak because this tool is strongly coupled to the problem: filtering HN. It's an example of the more general problem of people wanting to control what they see. This has existed at least since the classic usenet "killfiles", but is an area that, I believe, has been ripe for a comprehensive local solution for some time.

OTOH, narrow solutions validate the broader solution, especially if there are a lot of them. Although in that case you invite a ton of "momentum" issues with ingrained user bases (and heated advocacy), hopelessly incompatible data models and/or UX models, and so on. It's an interesting world (in the Chinese curse sense) where such tools can be trivially created. It's not clear to me that fitness selection will work to clean up the landscape once it's made.

azath92 · 2025-07-15T17:36:10 1752600970

Not sure what a local solution would look like when what you see is on websites, maybe a browser extension? we just made a similar reskin as a website, and it works great, but is ultimately another site you have to go to. Its another narrow solution with some variation (we do use AI to do the ranking rather than keyword filtering), but im interested in the form factors that might give maximal control to a user.

aorloff · 2025-07-15T18:10:57 1752603057

It is strong because you believed it created something of value. Did it work ? Maybe. But regardless of whether it worked, you still believed in the value, and that is the "power" of AIs right now, that humans believe that they create value.

CL_ergo · 2025-07-15T15:17:59 1752592679

There's a special kind of irony to use AI to help out the people who hate AI.

It's not hypocrisy or anything negative like that, but I do find it amusing for some reason.

tuveson · 2025-07-15T19:42:06 1752608526

> to help out the people who hate AI.

Was it? I feel like it was clearly meant to be smug and inflammatory rather than useful in any meaningful way.

simonw · 2025-07-15T21:29:29 1752614969

I was gong for smug, inflammatory and useful at the same time.

owebmaster · 2025-07-15T16:37:33 1752597453

There is an even more special kind of irony to see it failing as the top ranked story now is "Kiro: A new agentic IDE"

simonw · 2025-07-15T16:47:47 1752598067

Already fixed https://github.com/simonw/tools/commit/f95b306be7b584f388256...

owebmaster · 2025-07-15T16:54:46 1752598486

I know but the irony stands. We will get used to people getting embarrassed by AI results.

bee_rider · 2025-07-15T17:00:33 1752598833

This seems like exactly the type of problem human-written filtering systems fall into as well.

owebmaster · 2025-07-15T17:37:50 1752601070

human-written filtering systems don't brag about having a solution for a problem in 2 minutes and fail.

zimmund · 2025-07-16T04:14:30 1752639270

It got 80% of the problem solved for OP, and the remaining 20% can be fixed by humans afterwards. Prompt early, prompt often (?)

bee_rider · 2025-07-15T19:01:05 1752606065

This sounds more like a complaint about the human author, than the system itself.

owebmaster · 2025-07-15T19:27:59 1752607679

Not at all, simonw's work is fantastic. But it was a funny #fail.

Tainnor · 2025-07-17T11:01:05 1752750065

I mean, many people who "hate AI" don't think that LLMs are useless for everything. I'm very unconvinced by e.g. using LLMs for coding, but that they'd be good at tagging content, sentiment analysis, etc.? That's not really hard to believe.

pxc · 2025-07-15T15:17:27 1752592647

This is neat, but with the given filters you autoselected (just the phrases "llm" and "ai"), of the 14 stories I see when I visit the page, 4 of them (more than 25%!) are still stories about AI. (At least one of them can't be identified by this kind of filtering because it doesn't use any AI-related words in its headline, arguably maybe two.)

azath92 · 2025-07-15T17:48:41 1752601721

people have said it elsewhere, but I think you might have to fight fire with fire if you want semantic filtering.

IanCal · 2025-07-15T18:12:42 1752603162

> of the 14 stories I see when I visit the page, 4 of them (more than 25%!)

Llm maths? ;)

juped · 2025-07-15T18:48:46 1752605326

25% of 14 is 3.5. 4 is more than 3.5. Ask grock if you still don't get it.

michaelcampbell · 2025-07-18T13:02:49 1752843769

This is interesting, but I found it amusing that you used:

"I built..." and "o3 knocked it out in a couple minutes...", not ironically, talking about a tool to keep us from having to be inundated with AI/LLM stuff.

Jotalea · 2025-07-22T18:32:40 1753209160

Love how an AI block/filter was made using the same AI it's trying to block.

bodash · 2025-07-15T18:17:14 1752603434

I also built https://lessnews.dev (HN filtered by webdev links)

One decision I had to make was whether the site should update in real time or be curated only. Eventually, I chose the latter because my personal goal is not to read every new link, but to read a few and understand them well.

jtbaker · 2025-07-15T15:22:09 1752592929

feature request for OP: sort by "LLM Agentic AI" embedding cosine distance desc

heavyset_go · 2025-07-16T03:21:19 1752636079

Almost certain you can use the HN Algolia to do the same thing by excluding terms

simonw · 2025-07-16T05:19:18 1752643158

I had to switch away from Algolia - the problem is they only model "show items on the homepage" using a tag that's automatically applied to exactly 30 items, which means any filtering knocks that down to eg 15.

I switched to using the older firebase API which can return up to 500 item IDs from the current (paginated) homepage - then I fetch and filter details of the first 200.

https://github.com/simonw/tools/commit/ccde4586a1d95ce9f5615...

heavyset_go · 2025-07-16T22:20:20 1752704420

I should have looked at your source, I didn't realize you were using the Algolia API

schmookeeg · 2025-07-15T21:30:43 1752615043

AI solving the too-much-AI complaint is heart-warming. We're at the point where we will start demanding organic and free-range software, not this sweatshop LLM one-shot vibery.

Love it. :D

butlike · 2025-07-15T15:49:43 1752594583

It only shows 13 stories? And no pagination.

duncangh · 2025-07-15T19:03:56 1752606236

simon how do you get so much done? It’s incredible. Would love to see the day in the life TikTok :P

Bukhmanizer · 2025-07-15T21:11:21 1752613881

I think there is a fundamental disconnect in this response. What the user is asking for is for a procedural and cultural change. What you’ve come up with is a technical solution that kind of mimics a cultural change.

I don’t think it’s wrong, but I also don’t think we can really “AI generate” our way into better communities.

op00to · 2025-07-15T23:12:16 1752621136

Simonw’s response is the right response. You should not bend the community to your will simply because you do not like the majority of the posts. Obviously many people do like those posts, as evidenced by them making the front page. Instead, find ways to avoid the topics you do not desire to read without forcing your will on people who are happy with the current state.

Let me stop folks early, don’t make comparisons to politics or any bullshit like that. We’re talking only about hacker news here.

0x000xca0xfe · 2025-07-15T21:03:58 1752613438

You can even make it live with SSE/EventSource.

rjh29 · 2025-07-15T19:26:29 1752607589

Such a tonedeaf response... you're like the biggest enemy of people who want a break from AI/LLM stuff. Even in a thread devoted to filtering out AI, they can't get away from you.

dang · 2025-07-15T21:59:31 1752616771

Please don't cross into personal attack.

simonw · 2025-07-15T19:31:22 1752607882

In a thread devoted to filtering out AI, I gave them a way of filtering out AI.

(The fact that I wrote it using AI doesn't really matter, but I personally found it amusing so I included the prompts.)

dttze · 2025-07-15T20:29:35 1752611375

> The fact that I wrote it using AI doesn't really matter

Given that it is a poorly implemented solution that doesn't really do what the OP asked, yes it is.

tptacek · 2025-07-15T23:32:52 1752622372

It really isn't incumbent on you to feed the trolls here.

fouronnes3 · 2025-07-15T15:19:02 1752592742

Great example of the power of vibe coding. The first item is literally "Kiro: A new agentic IDE".

raincole · 2025-07-15T15:26:58 1752593218

There is literally an input box to put terms you want to exclude...

The prompt asks for "filters out specific search terms", not "intelligently filter out any AI-related keywords." So yes, a good example of the power of vibe coding: the LLM built a tool according to the prompt.

MisterTea · 2025-07-15T19:01:18 1752606078

> The prompt asks for "filters out specific search terms"

So if I want a front page free of LLM "agents" but also want to view stories about secret agents it will do that, right?

simonw · 2025-07-15T19:03:37 1752606217

See comment here: https://news.ycombinator.com/item?id=44571740#44572312

throwaway290 · 2025-07-15T17:28:06 1752600486

The prompt was to exclude llm and ai by default though

marcellus23 · 2025-07-15T17:38:42 1752601122

the prompt was "default to "llm, ai"", which is exactly what it did. Nothing in the prompt about defaulting to other related terms

throwaway290 · 2025-07-16T02:34:42 1752633282

it's irony

Tostino · 2025-07-15T17:42:08 1752601328

And that title didn't contain either of those words...what is the complaint again?

ackfoobar · 2025-07-15T18:07:31 1752602851

if all you want is word filtering in the title, you can simply write an adblock rule.

dawnerd · 2025-07-15T18:35:10 1752604510

But how are you supposed to hype AI by using old tech like that?

nbex0080 · 2025-07-15T19:41:35 1752608495

Have AI write the rule and an article about having AI write the rule.

nice_byte · 2025-07-15T18:42:34 1752604954

because the point is literally to filter based on vibes not precise keywords

Tostino · 2025-07-15T19:03:39 1752606219

That is not what the prompt I saw above asked for. It took him a few min. Write your own with a semantic based filter instead of a keyword based filter if that's what you want.

FroshKiller · 2025-07-15T15:35:03 1752593703

So I have to stay up to date on AI stories just to know what buzzwords I should filter so I don't see AI stories?

simonw · 2025-07-15T15:41:24 1752594084

Sounds to me like you want a deeper version of this that uses AI instead of keywords to help filter out AI stories.

shepherdjerred · 2025-07-15T16:02:56 1752595376

At a certain point it’s ironic

tolerance · 2025-07-15T16:40:28 1752597628

I think we're well past that stage. Using AI to escape AI. Does that count?

voisin · 2025-07-15T17:02:30 1752598950

I think there’s another step here: Using AI to build tools that use AI to escape AI.

Eventually: using AI to build tools that use AI to escape AI using tools that use AI.

tolerance · 2025-07-15T17:21:20 1752600080

> using AI to build tools that use AI to escape AI using tools that use AI

Few illustrations are so absurd yet feasible enough to depict as horrendous a reality as this.

jwillp · 2025-07-15T18:54:09 1752605649

Clearly the US needs a constitutional amendment to preserve the right to keep and bear AI tools. Then we can arm the victims of AI tools with their own AI tools, for self-defense. If we're lucky, AI will send its AI thoughts and AI prayers in carefully calculated quantities.

tolerance · 2025-07-15T19:34:17 1752608057

Better yet, such expressions would be categorized as tokens of condolence at no expense to the public. Subsidized by the arms manufacturers.

aleksituk · 2025-07-15T17:25:58 1752600358

Lol, yup. See azath92 comment - https://www.hackernews.coffee/

furyofantares · 2025-07-15T17:03:07 1752598987

Add the buzzword when you see a story you don't like. Or settle with it filtering 90% of the AI content and just don't click on whatever remains, I doubt you expect the top story to be interesting to you 100% of the time.

Reubachi · 2025-07-15T16:05:15 1752595515

Our brain decodes info based on context and extrapolation

This submission we're commenting on could be about filtering out any data, not just AI stuff. Politics, crypto, AI etc. Or more minute like "Trump" "fracking" "bitcoin" etc.

In any of these scenarios, with a tool designed to filter out content based on limited context, when would you ever be perfectly satisfied?

would you like AI to help you build the perfect context-filter model?

bee_rider · 2025-07-15T16:58:35 1752598715

And certainly in our anti-politics filter we’d want to include the filtering of stories that promote the extreme political position that tech is somehow detached from politics! (Especially Silicon Valley startup tech that owes so much to the local politics and economy of California).

Which is to say, filtering politics out is absurd, one person’s extreme politics is another’s default view of the universe.

lazide · 2025-07-15T17:39:41 1752601181

In a similar vein, I’ve had people assert (in all seriousness), their English had no discernible accent because they were American.

It’s a similar kind of mindset.

throwanem · 2025-07-15T17:55:30 1752602130

Isn't it enough to bury yourself under the rock? - you want the fact of your having done so concealed from you also? But what about the fact of wanting that?

ChromaticPanic · 2025-07-15T15:41:24 1752594084

That's how any filtering service works

sergiotapia · 2025-07-15T17:05:58 1752599158

sounds like you need an AI to sort out and predict what you won't want to see ;)

raincole · 2025-07-15T15:41:56 1752594116

...Yes? This is how this tool is coded. Machines do what one codes them to do, not what one wants them to do. If you're interested in making a more intelligent tool you can do it. This tool does exactly what @simonw says it does.

barbazoo · 2025-07-15T18:58:48 1752605928

How about a version with LLM integration that detects "AI" related stories in a more clever way? /s

arcfour · 2025-07-15T15:43:21 1752594201

A tool was offered that can accomplish what you want, with a very small amount of added effort on your part.

No, you do not have to "stay up to date on AI stories"—if you see one, add the keyword to the list and move on. There are not as many buzzwords as you seem to be implying, anyways.

If you are dissatisfied, you are welcome to build your own intelligent version (but I am not sure this will be straightforward without the use of AI).

johnb231 · 2025-07-15T18:53:14 1752605594

[flagged]

Dilettante_ · 2025-07-15T19:25:58 1752607558

If you're unable to discern that the word serves a purpose(emphasis) in that sentence, I literally don't know what to say to you.

lazide · 2025-07-15T20:21:13 1752610873

It used to be that literally had a meaningful definition - quite literally. Now it doesn’t (see #2) [https://www.merriam-webster.com/dictionary/literally]

Not everyone has caught up.

johnb231 · 2025-07-15T19:46:16 1752608776

Of course I can discern that. I think it sounds stupid and childish, and makes someone appear less intelligent. Overused and misused word. But this is now derailing the thread.

hluska · 2025-07-15T21:11:41 1752613901

I’m with you here - it’s a completely superfluous word that the young have adopted as some form of belonging ritual. It has no purpose, adds no emphasis and is just poor English masquerading as a statement.

hluska · 2025-07-15T19:45:56 1752608756

Superfluous words serve no purpose, though your use of one here emphasizes your lack of maturity. If that’s your goal, good writing.

pc86 · 2025-07-15T19:59:25 1752609565

It's bad enough to expect other people to change the way they communicate to make you feel better.

It's another thing entirely when the way they're communicating is accurate and correct.

iLoveOncall · 2025-07-15T18:57:48 1752605868

But there literally is an input box.

firesteelrain · 2025-07-15T16:50:12 1752598212

I like this because things can stay permanently filtered. Just not across devices. But that wasn't one of the original requirements.

aorloff · 2025-07-15T18:08:43 1752602923

Also a great example of how software can be perfectly to spec and also completely broken.

savolai · 2025-07-15T18:45:35 1752605135

llm, ai, cuda, agent, gpt.

Wish it returned more unfiltered items tho.

bee_rider · 2025-07-15T19:06:27 1752606387

Isn’t knocking out CUDA going to take out a significant chunk of GPGPU stuff with it? I can see wanting to avoid AI stuff, for sure, but I can’t imagine not wanting to hear anything about the high-bandwidth half of your computer…

savolai · 2025-07-16T04:13:19 1752639199

Sure, if that suits you it makes sense. Just happened to take out one piece I felt was ai-related at the time. I suppose mlx would have worked better.

voxl · 2025-07-15T23:12:07 1752621127

[flagged]

simonw · 2025-07-15T23:58:00 1752623880

Not at all. I think you misunderstood the point I was making here.

I think the idea of splitting Hacker News into AI and not AI is honestly a little absurd.

If you don't want to engage with LLM and AI content on Hacker News, don't engage with it! Scroll right on past. Don't click it.

If you're not going to do that, then since we are hackers here and we solve things by building tools, building a tool that filters out the stuff you don't want to see is trivial. So trivial I could get o3 to solve it for me in literally minutes.

(This is a pointed knock at the "AI isn't even useful crowd", just in case any of them are still holding out.)

There's a solid meta-joke here too, which is that filtering on keywords is a bad way to solve this problem. The better way to solve this profile... is to use AI!

So yeah, I'm not embarrassed at all. I think my joke and meta joke were pretty great.

ukprogrammer · 2025-07-17T12:07:34 1752754054

It solves the problem in a simpler and faster way than OP requested. OP does not wish to see AI content, this tool solves it. Simple.

Your statement is factually incorrect. Have you no embarrassment?

mvdtnz · 2025-07-15T19:09:10 1752606550

[flagged]

dang · 2025-07-15T21:59:48 1752616788

Please don't cross into personal attack.

pton_xd · 2025-07-15T17:14:42 1752599682

5 prompts? Not impressed. I can give a human (you) one prompt, and then that human will go off, create the site, promote it on social media, read and incorporate feedback, and then discuss potential future iterations.

It's really not even close ;)

gamerDude · 2025-07-15T17:26:23 1752600383

Now that's impressive. I've worked with and managed many humans and almost never do I get want I want back in one prompt.

Even ones with detailed specs and the human agreed to them don't come back exactly as written.

paulddraper · 2025-07-15T17:29:45 1752600585

tf humans do you work with?

That's at least 5 JIRA tickets.

lazide · 2025-07-15T20:22:46 1752610966

Also a lot of cursing that I’ve been told to cut down on by HR. (/s, kinda)

aleksituk · 2025-07-15T17:43:58 1752601438

I think it's a bifurcation between 0-1 prompts (self-driven) and a 1,000 prompts :)

th0ma5 · 2025-07-15T17:22:00 1752600120

Perhaps you should add a privacy policy or just release the source rather than assume people will trust your site. Why do you do these demos if you aren't upfront about all the things the LLMs didn't do?

simonw · 2025-07-15T17:44:36 1752601476

I released the source: https://github.com/simonw/tools/blob/main/hacker-news-filter... (Apache 2 licensed) and a commit history listing the prompts I used. https://github.com/simonw/tools/commits/main/hacker-news-fil... - also displayed on the site here: https://tools.simonwillison.net/colophon#hacker-news-filtere...

I don't think I need a privacy policy since the app is designed so that nothing gets logged anywhere - it works by hitting the Algolia API directly from your browser, but the filtering happens locally and is stored in localStorage so nobody on earth has the ability to see what you filtered.

The API it uses is https://hn.algolia.com/api/v1/search?tags=front_page - which is presumably logged somewhere (covered by Algolia's privacy policy) but doesn't serve any cookies.

> Why do you do these demos if you aren't upfront about all the things the LLMs didn't do?

What do you mean by that?

th0ma5 · 2025-07-16T00:05:49 1752624349

You should try to get other people to make your demos is all I'm saying. I don't know why you keep inserting yourself either. Why didn't someone else post the thing you made? Were they waiting for you to do it or do you think people aren't smart enough to do it? I'm just trying to understand why every damned LLM story has to feature you. In what ways could you avoid such a filter of your posts?

simonw · 2025-07-16T02:39:59 1752633599

> Why didn't someone else post the thing you made?

You mean this thing? https://tools.simonwillison.net/hacker-news-filtered

I built it literally minutes before I posted it, and I built it specifically for this thread.

Other people post my stuff all the time - if you take a look at this list here, very few of them were submitted by my user account: https://news.ycombinator.com/from?site=simonwillison.net

(Currently only one of the posts on that page were by me.)

owebmaster · 2025-07-16T01:43:42 1752630222

let simonw be prolific, lots of people enjoy his content and that is why his comments in the post are always ranked in the top. This animosity isn't constructive.

johnb231 · 2025-07-15T19:32:36 1752607956

The site does not request any personal information, therefore no privacy policy is required.

th0ma5 · 2025-07-16T00:06:20 1752624380

It has no server side log? How do I know that if there is no policy?

simonw · 2025-07-16T02:21:49 1752632509

You could read the code. This is Hacker News after all.

krapp · 2025-07-16T22:18:38 1752704318

You can't actually read the code. They use a proprietary fork of Arc Forums that they won't release.