More

fi-le · 2025-11-14T15:33:27 1763134407

Thank you for the kind words.

fi-le · 2025-11-14T15:26:34 1763133994

Thank you! As the other commenters already figured out, I manually write HTML and use tufte.css with some minor customizations.

fi-le · 2025-11-12T16:14:38 1762964078

What percentage, in numbers?

fi-le · 2025-11-09T23:00:53 1762729253

Evals for programming languages with formal verification. It's not clear how far we are from good coding performance in less popular languages in general, and formal verification has some quirks on top also.

fi-le · 2025-10-05T20:24:32 1759695872

Hi, thanks! If someone posts better translations I will update them.

yorwba · 2025-10-05T20:44:32 1759697072

For a start, you could replace all occurrences of "No Code" (无码) with "Uncensored."

fi-le · 2025-10-05T20:49:23 1759697363

Done, thank you!

fi-le · 2025-09-12T18:07:01 1757700421

Did anyone get confirmation that the form got sent? There is no feedback from pressing "submit" for me.

wokki · 2025-09-25T14:56:53 1758812213

jtfrench · 2025-09-12T18:53:21 1757703201

thornewolf · 2025-09-12T18:30:35 1757701835

Same issue

fi-le · 2025-07-05T11:43:56 1751715836

Good point. The architectural solution that would come to mind is 2D text embeddings, i.e. we add 2 sines and cosines to each token embedding instead of 1. Apparently people have done it before: https://arxiv.org/abs/2409.19700v2

ninjha · 2025-07-05T12:14:24 1751717664

I think I remember one of the original ViT papers saying something about 2D embeddings on image patches not actually increasing performance on image recognition or segmentation, so it’s kind of interesting that it helps with text!

E: I found the paper: https://arxiv.org/pdf/2010.11929

> We use standard learnable 1D position embeddings, since we have not observed significant performance gains from using more advanced 2D-aware position embeddings (Appendix D.4).

Although it looks like that was just ImageNet so maybe this isn't that surprising.

yorwba · 2025-07-05T13:45:19 1751723119

They seem to have used a fixed input resolution for each model, so the learnable 1D position embeddings are equivalent to learnable 2D position embeddings where every grid position gets its own embedding. It's when different images may have a different number of tokens per row that the correspondence between 1D index and 2D position gets broken and a 2D-aware position embedding can be expected to produce different results.

fi-le · 2025-07-05T11:41:00 1751715660

Two corrections, as written in the post: At least Claude not able to solve the standard levels at all, and community levels are definitely in scope.

fi-le · 2025-07-05T11:36:50 1751715410

At least in this instance, it came from my fleshy human brain. Although I perhaps used it to come off as smarter than I really am - just like an LLM might.

fi-le · 2025-06-12T06:45:35 1749710735

That is the sweetest compliment I could have hoped for, thank you.