More

docandrew · 2025-10-29T19:17:56 1761765476

Hype aside, if you can get an answer to a computing problem with error bars in significantly less time, where precision just isn’t that important (such as LLMs) this could be a game changer.

alyxya · 2025-10-29T20:13:16 1761768796

Precision actually matters a decent amount in LLMs. Quantization is used strategically in places that’ll minimize performance degradation, and models are smart enough so some loss in performance still gives a good model. I’m skeptical how well this would turn out, but it’s probably always possible to remedy precision loss with a sufficiently larger model though.

fastball · 2025-10-29T22:06:38 1761775598

LLMs are inherently probabilistic. Things like ReLU throw out a ton of data deliberately.

alyxya · 2025-10-29T22:30:02 1761777002

No that isn’t throwing out data. Activation functions perform a nonlinear transformation to increase the expressivity of a function. If you did two matrix multiplications without ReLU in between, your function contains less information than with a ReLU in between.

fastball · 2025-10-30T02:41:53 1761792113

How are you calculating "less information"?

shwaj · 2025-10-30T06:01:28 1761804088

I think what they meant was:

Two linear transformations compose into a single linear transformation. If you have y = W2(W1*x) = (W2*W1)*x = W*x where W = W2*W1, you've just done one matrix multiply instead of two. The composition of linear functions is linear.

The ReLU breaks this because it's nonlinear: ReLU(W1*x) can't be rewritten as some W*x, so W2(ReLU(W1*x)) can't collapse either.

Without nonlinearities like ReLU, many layers of a neural network could be collapsed into a single matrix multiplication. This inherently limits the function approximation that it can do, because linear functions are not very good at approximating nonlinear functions. And there are many nonlinearities involved in modeling speech, video, etc.

docandrew · 2025-10-12T05:35:05 1760247305

I had a Palm Pre and really enjoyed this, shame it didn’t make it.

docandrew · 2025-10-10T21:22:05 1760131325

Feels like maybe this is retreading ground covered by Why3ML, but perhaps I’m missing something.

https://www.why3.org/doc/whyml.html

lgas · 2025-10-11T05:03:25 1760159005

Presumably this is aimed at people that want to take advantage of it in Lean.

docandrew · 2025-08-27T17:31:07 1756315867

My favorite diagramming tool hands-down! It’s the only one that’s ever “clicked” for me, I use it all the time.

docandrew · 2025-07-20T02:43:48 1752979428

Maybe other folks’ vibe coding experiences are a lot richer than mine have been, but I read the article and reached the opposite conclusion of the author.

I was actually pretty impressed that it did as well as it did in a largely forgotten language and outdated platform. Looks like a vibe coding win to me.

grumpyprole · 2025-07-20T09:12:22 1753002742

Sure it did ok with examples that are easily found in a text book like drawing a circle.

sixothree · 2025-07-20T03:57:07 1752983827

Here's an example of a recent experience.

I have a web site that is sort of a cms. I wanted users to be able to add a list of external links to their items. When a user adds a link to an entry, the web site should go out and fetch a cached copy of the site. If there are errors, it should retry a few times. It should also capture an mhtml single file as well as a full page screenshot. The user should be able to refresh the cache, and the site should keep all past versions. The cached copy should be viewable in a modal. The task also involves creating database entities, DTOs, CQRS handlers, etc.

I asked Claude to implement the feature, went and took a shower, and when I came out it was done.

nico · 2025-07-20T04:19:23 1752985163

Im pretty new to CC, been using it in a very interactive way.

What settings are you using to get it to just do all of that without your feedback or approval?

Are you also running it inside a container, or setting some sort of command restrictions, or just yoloing it on a regular shell?

sixothree · 2025-07-25T16:29:51 1753460991

So CC has a planning mode. Shift-Tab twice to enter planning mode. I wrote out about a paragraph of text for this and it gave me back a todo list. I said "make it so" and it went and did it.

hammyhavoc · 2025-07-20T04:17:10 1752985030

Let us know how the security audit by human beings on the output goes.

sixothree · 2025-07-25T05:44:08 1753422248

It's really just a personal project for myself. Why else would I add that feature without any guardrails?

catmanjan · 2025-07-20T04:28:59 1752985739

The auditors are using llms too!

docandrew · 2025-07-13T16:45:53 1752425153

nginx and Roblox and redis and nmap and neovim and cryengine … the list goes on

There are a LOT of tools with embedded Lua scripting capabilities.

docandrew · 2025-07-13T16:39:53 1752424793

Not having to put length-1 everywhere is a good thing, actually.

const_cast · 2025-07-14T09:57:17 1752487037

Probably just use a .last method or something. Reads better too.

docandrew · 2025-06-15T10:25:48 1749983148

I think his point is that ORMs (and maybe DBs in general) are used for data persistence by folks who just don’t know any alternative.

TZubiri · 2025-06-15T22:10:16 1750025416

Yes that is my contention.

This Simpson's clip summarizes it in a more poetic style

https://www.youtube.com/watch?v=2BT7_owW2sU

I've seen a common problem for auto-didacts is that, since the advanced and modern concepts outnumber the fundamentals, they often find themselves learning advanced concepts before the basics.

This is especially common in programming with stackoverflow or AIs where the devs look for the quickest and easiest to use solution, pushing the code and complexity beneath the rug under the dependency layer, so that their implementing code looks nice and clean.

It's hard to figure out as a begginer that the simplest and most basic solution are 10 lines of POSIX function calls, instead of three lines of "import solution" "setup solution" "use solution".

docandrew · 2025-03-02T02:07:06 1740881226

Ada for bigger projects, D for quick one-offs and more “scripty” work.

fuzztester · 2025-03-02T23:47:35 1740959255

I had played around with D some time ago, and wrote some small programs in it for fun and learning. I both liked and disliked things about the language.

there was some Russian dev running a systems tech company, I forget his name, living in Thailand, like in koh samui or similar place. he used D for his work, which was software products. came across him on the net. I saw a couple of his posts about D.

one was titled, why D, and the other was, D as a scripting language.

I thought both were good.

docandrew · 2025-03-04T00:01:11 1741046471

It’s a little like go in that it compiles quickly enough to replace scripts while still yielding good enough performance for a lot of systems tasks. It predates go and I wish Google had just supported D, it’s a much nicer language IMO

johnisgood · 2025-03-04T20:15:40 1741119340

What are you using Ada for?

docandrew · 2025-03-04T22:20:40 1741126840

Fun side projects mostly, my GH username is the same as here if you’re (morbidly) curious.

johnisgood · 2025-03-05T06:51:56 1741157516

Will do! :D

I did, a quick thought, regarding https://github.com/docandrew/SPARKTLS: you might find https://github.com/Componolit/libsparkcrypto useful, too, if you have not already.

Nice projects BTW!

docandrew · 2025-02-15T02:47:09 1739587629

Move to EKS and you still need a k8s engineer, but one who also knows AWS, and you also pay the AWS premium for the hosting, egress, etc. It might make sense for your use case but I definitely wouldn’t consider it a cost-saving measure.