More

ATechGuy · 2026-02-13T00:57:56 1770944276

Would love to see performance numbers with nested virtualization, particularly that of IO-bound workloads.

ATechGuy · 2026-02-13T00:52:26 1770943946

> a website can expose functions like searchProducts(query, filters) or orderPrints(copies, page_size) with full parameter schemas

How would this not create backend load and abuse?

verdverm · 2026-02-13T06:51:08 1770965468

Why do you believe it to automatically do so?

ATechGuy · 2026-02-12T23:38:15 1770939495

Thanks for your insights!

QQ: does uxwizz also show AI agents visiting websites? have you seen traffic from AI agents?

XCSme · 2026-02-12T23:41:40 1770939700

I currently do not automatically detect the agents themselves, and most of the bot traffic is ignored, but for the traffic that is not ignored it usually shows as a 0s session (so you can filter for all sessions that have a 0s replay, which are usually bots).

I am still torn whether I should actually track bots/AI traffic or simply drop it, maybe I will add a toggle in the setting, maybe it's at least interesting to see how much spam the website gets and coming from where.

XCSme · 2026-02-12T23:42:37 1770939757

Oh, and as for the agents themselves browsing the website, if the agent uses an actual browser with JS enabled (like headless puppeteer), then you would actually be able to see how the agent browsed the website.

ATechGuy · 2026-02-13T00:20:31 1770942031

Yes, but how can you tell if it was an agent with full browser?

XCSme · 2026-02-13T00:25:31 1770942331

If they don't set the proper UserAgent, it's not trivial.

You could add some JS to test it and store it as a tag, maybe using something like: https://stackoverflow.com/a/78629469/407650

ATechGuy · 2026-02-13T00:35:49 1770942949

That's a good start I'd say, but I agree with you that detection is not trivial. I wonder if there's enough value in distinguishing between AI agents (with full browser) and humans. What use cases would it enable?

XCSme · 2026-02-13T13:43:52 1770990232

For the distinguishing part, it's hard to tell if it can be done anyway, as the agent browsers are still new and constantly changing, and it's up to them if they will correctly identify themselves or not (same as with crawler/bots, the main indication is still the source IP address).

There could be use cases, like tracking if your content was stolen/parsed by an AI, maybe future pay-per-request for LLMs, etc.

ATechGuy · 2026-02-04T22:54:50 1770245690

I agree. However, how to define these permissions when agent behavior is undefined?

ATechGuy · 2026-02-03T23:39:01 1770161941

Curious to know what made you DIY this?

ashishb · 2026-02-03T23:40:53 1770162053

Tell me a better alternative that allows me to run, say, 'markdown lint', an npm package, on the current directory without giving access to the full system on Mac OS?

ATechGuy · 2026-02-03T23:50:10 1770162610

sandbox-exec -f curr_dir_access_profile.sb markdownlint

ashishb · 2026-02-04T00:00:21 1770163221

So you have to install npm package markdownlint on your machine and let it run it's potentially dangerous postinstall step?

ATechGuy · 2026-02-04T00:35:08 1770165308

You can customize curr_dir_access_profile.sb to block access to network/fs/etc. Why is this not enough?

ashishb · 2026-02-04T00:52:11 1770166331

Some tools do require Internet access.

Further, I don't even want to take the risk of running 'npm install markdownlint' anymore on my machine.

ATechGuy · 2026-02-04T04:34:12 1770179652

I understand the concern. However, you can customize the profile (e.g., allowlist) to only allow network access to required domains. Also, looks like your sandboxing solution is Docker based, which uses VMs on a Mac machine, but will not use VMs on a Linux machine (weak security).

ashishb · 2026-02-04T05:17:50 1770182270

That's why I wrote my own sandbox. Everyone hand waives these concerns.

Further, I don't know why docker is weak security on Linux. Are you telling me that one can exploit docker?

KurSix · 2026-02-04T15:42:06 1770219726

dockerd is a massive root-privileged daemon just sitting there, waiting for its moment. For local dev it’s often just unnecessary attack surface - one subtle kernel bug or namespace flaw, and it’s "hello, container escape". bwrap is much more honest in that regard: it’s just a syscall with no background processes and zero required privileges. If an agent tries to break out, it has to hit the kernel head-on instead of hunting for holes in a bloated docker API

NamlchakKhandro · 2026-02-10T03:11:09 1770693069

then use podman instead.

ATechGuy · 2026-02-03T23:36:57 1770161817

These are all wrappers around VMs. You could DIY these easily by using EC2/serverless/GCP SDKs.

thundergolfer · 2026-02-04T04:05:32 1770177932

Modal engineer here. This isn’t correct. You can DIY this but certainly not by wrapping EC2 which is using the Nitro hypervisor and is not optimized for startup time.

Nearly all players in this space use Gvisor or Firecracker.

sebmellen · 2026-02-04T06:13:34 1770185614

Do you know Eric Zhang by chance? I went to school with him and saw that he was at Modal sometime back. Potentially the smartest person I’ve ever met… and a very impressive technical mind.

Super impressed with what you’ve all done at Modal!

thundergolfer · 2026-02-04T23:45:48 1770248748

yeh of course I worked with him for a few years! Agree, smartest person I've ever worked with, and there's a smart crowd at Modal.

easton · 2026-02-04T02:43:55 1770173035

You can and can’t, at least in AWS. For instance, you can’t launch a EC2 to a point you can ssh in less than 8-10 seconds (and it takes a while to get EBS to sync the entire disk from s3).

Many a time I have tried to figure a self scaling EC2 based CI system but could never get everything scaled and warm in less than 45 seconds, which is sucky when you’re waiting on a job to launch. These microvm as a service thingys do solve a problem.

(You could use lambda, but that’s limited in other ways).

ATechGuy · 2026-02-04T04:17:18 1770178638

To the commenters here: thanks for correcting me! So AWS is losing AI sandboxing market to GCP due to high cold start times of EC2...very interesting!

ATechGuy · 2026-02-03T20:05:50 1770149150

I will ask what I've asked before: how to know what resources to make available to agents and what policies to enforce? The agent behavior is not predefined; it may need access to a number of files & web domains.

For example, you said: > I don't expose entire /etc, just the bare minimum How is "bare minimum" defined?

> Inspecting the log you can spot which files are needed and bind them as needed. This requires manual inspection.

senko · 2026-02-03T20:45:18 1770151518

Article author here. I used trial and error - manual inspection it is.

This took me a few minutes but I feel more in control of what's being exposed and how. The AI recommended just exposing the entire /etc for example. It's probably okay in my case, but I wanted to go more precise.

On the network access part, I let it fully loose (no restrictions, it can access anything). I might want to tighten that in the future (or at least disallow 192.168/16 and 10/8), for now I'm not very concerned.

So there's levels of how tight you want to set it.

ATechGuy · 2026-02-03T20:52:32 1770151952

> I feel more in control of what's being exposed and how

Makes complete sense. Thanks for your insights!

aflag · 2026-02-03T20:21:40 1770150100

Ask the agent to bubblewrap itself

ATechGuy · 2026-02-03T19:30:36 1770147036

In the last one year, we have seen several sandboxing wrappers around containers/VMs and they all target one use case AI agent code execution. Why? perhaps because devs are good at building (wrappers around VMs) and chase the AI hype. But how are these different and what value do they offer over VMs? Sounds like a tarpit idea, tbh.

Here's my list of code execution sandboxing agents launched in the last year alone: E2B, AIO Sandbox, Sandboxer, AgentSphere, Yolobox, Exe.dev, yolo-cage, SkillFS, ERA Jazzberry Computer, Vibekit, Daytona, Modal, Cognitora, YepCode, Run Compute, CLI Fence, Landrun, Sprites, pctx-sandbox, pctx Sandbox, Agent SDK, Lima-devbox, OpenServ, Browser Agent Playground, Flintlock Agent, Quickstart, Bouvet Sandbox, Arrakis, Cellmate (ceLLMate), AgentFence, Tasker, DenoSandbox, Capsule (WASM-based), Volant, Nono, NetFence

ushakov · 2026-02-03T20:37:58 1770151078

why? because there’s a huge market demand for Sandboxes. no one would be building this if no one would be buying.

disclaimer: i work at E2B

ATechGuy · 2026-02-03T20:47:59 1770151679

I'm not saying sandboxes are not needed, I'm saying VMs/containers already provide the core tech and it's easy to DIY a sandbox. Would love to understand what value E2B offers over VMs?

kommunicate · 2026-02-03T22:08:50 1770156530

making a local sandbox using docker is easy, but making them work at high volume and low latency is hard

ATechGuy · 2026-02-04T04:20:25 1770178825

That's right. But they (E2B) rely on the underneath Cloud infra to achieve high scalability. Personally, I'm still not sure about the value they add on top of Cloud hosted VMs. GCP/AWS already offer huge discounts to startups, which should be enough for VM-based sandboxing of agents in the MVP phase.

ushakov · 2026-02-03T20:56:41 1770152201

we offer secure cloud VMs that scale up to 100k concurrent instances or more.

the value we sell with our cloud is scale, while our Sandboxes are a commodity that we have proudly open-sourced

ATechGuy · 2026-02-03T21:03:21 1770152601

> we offer secure cloud VMs that scale up to 100k concurrent instances or more.

High scalability and VM isolation is what the Cloud (GCP/AWS, that E2B runs on) offers.

kommunicate · 2026-02-03T22:06:48 1770156408

don't forget runloop!

messh · 2026-02-04T01:46:55 1770169615

And shellbox.dev

ATechGuy · 2026-02-03T18:57:39 1770145059

> allowNet: ["api.openai.com", "*.anthropic.com"],

How to know what domains to allow? The agent behavior is not predefined.

CuriouslyC · 2026-02-03T19:24:26 1770146666

The idea is to gate automatic secret replacement to specific hosts that would use them legitimately to avoid exfiltration.

falcor84 · 2026-02-03T19:26:55 1770146815

Well, this is the hard part, but the idea is that if you're working with both untrusted inputs and private data/resources, then your agent is susceptible to the "lethal trifecta"[0], and you should be extremely limiting in its ability to have external network access. I would suggest starting with nothing beyond the single AI provider you're using, and only add additional domains if you are certain you trust them and can't do without them.

[0] https://simonwillison.net/2025/Jun/16/the-lethal-trifecta/

ATechGuy · 2026-01-28T18:30:59 1769625059

What about VMs? They offer strong isolation, as they don't share kernels, and have long been a foundational piece for multi-tenant computing. Then, why would we put an extra layer on top and rebrand it as an AI agent sandboxing solution? I'm genuinely curious what pushes everyone to build their own and launch here Is it one of those tarpit ideas: driven by own need and easy to build?