More

tudorg · 2025-05-23T18:21:33 1748024493

I think you are describing Xata Agent :) https://github.com/xataio/agent It watches metrics/logs, queries pg_stat_stetements for slow queries, the sends slack notification with suggestions, including indexes, but can be more varied than that.

tudorg · 2025-05-23T16:28:10 1748017690

We use event triggers both in pgroll [0] and pgstream [1] to capture DDL changes. It's a feature of pgroll that it captures the schema migration history, regardless if you do the changes via pgroll or with another tool. And pgstream uses it to put the schema changes in the logical replication stream.

One thing to be aware is that on many Postgres DBaaS EVENT TRIGGERS are not allowed, because they generally require superuser. But RDS and Aurora do support them, we (Xata) support them of course, and I think Supabase is working on adding support for them.

[0]: https://github.com/xataio/pgroll

[1]: https://github.com/xataio/pgstream

tudorg · 2025-05-18T10:54:30 1747565670

[author] Ha, they are actually react-flow, which we use both in the app and on https://xata.io/

For the blog post I actually just used screenshots of the react-flow diagrams, because I was lazy.

tudorg · 2025-05-18T10:47:34 1747565254

I would say it's primarily testing: either testing schema changes or testing your application changes. Copy-on-Write branches help because you can spin up an environment fast with relevant data from production (anonymized).

tudorg · 2025-05-18T10:44:32 1747565072

There's another thread here: https://news.ycombinator.com/item?id=44016289

Might make sense to merge.

tudorg · 2025-05-18T06:31:27 1747549887

[Author here] I'm sorry for the bad experience with Xata. If it's still an issue I'd be glad to have a look.

Just to make sure, I suppose this is on the Xata Lite free tier, not on the Postgres at scale platform that this blog post talks about.

tudorg · 2025-05-16T18:56:49 1747421809

You can try with data subsetting tools, a few that I know are: greenmask, jailer, tonic condenser, and I think there are more. They are not exactly for your use case, but might help you a bit. The problem is most of these tools can walk foreign keys, but only in one directions.

Also, since you said FK are not fully implemented, then there just won't be enough information for them to work with. So you can either work on fixing the foreign keys or have more complex scripts.

polishdude20 · 2025-05-16T19:13:46 1747422826

Thanks for the suggestions!

Yeah due to the unavailability of FK's, I'd have to somehow map certain ID columns to the tables they're for, accounting for slight naming convention differences.

woyten · 2025-05-22T11:04:14 1747911854

In case you don’t have FKs you could specify virtual foreign keys in Greenmask.

Check out docs for details

https://docs.greenmask.io/latest/database_subset/#virtual-re...

tudorg · 2025-05-14T10:59:14 1747220354

[Disclaimer: I work for Xata]

As it happens, we've just launched our new Xata platform (https://xata.io/) which has some of the key Neon features: instant copy-on-write branching and separation of storage and compute. As an extra twist, we also can do anonymization (PII masking) between your production database and developer branches.

The way we do copy-on-write branches is a bit different. We haven't done any modifications to Postgres but do it completely at the storage layer, which is a distributed system in itself. This also brings some I/O performance opportunities.

While Xata has been around for a while, we're just launching this new platform, and it is in Private Beta. But we are happy to work with you if you are interested.

Btw, congrats to the Neon team!

atombender · 2025-05-14T11:18:09 1747221489

Is this open source? A major point of Neon is that it's open source and self-hostable.

tudorg · 2025-05-14T11:58:03 1747223883

Several components are open source as their own projects (see below) which will allow you to reproduce most of the features on top of regular Postgres. But the storage part is not open source. We are considering a simpler implementation of it that would be realistic to self-host and still do copy-on-write branching.

These are the open source components:

* pgstream for the anonymization from the production branch

* pgroll for schema changes

* Xata Agent for the LLM-powered optimizations

atombender · 2025-05-14T12:08:50 1747224530

I think when people look at Neon, the Aurora-style disaggregated compute/data architecture allowing highly scalable read replicas on cloud storage is the defining feature, and it's the only such project that offers it for Postgres. So the storage part is the point.

Hawxy · 2025-05-14T12:22:33 1747225353

The PII masking aspect is very interesting and something we couldn't get when we decided on DBLab a month ago. What does the deployment model within AWS look like?

tudorg · 2025-05-14T13:21:49 1747228909

If you want to deploy the whole platform inside your own AWS account, we have a Bring Your Own Cloud model: https://xata.io/byoc

If you want to get anonymization from your RDS/Aurora instance and into Xata branches, then you run only a CLI command (`xata clone`) which does something similar to pg_dump/pg_restore but with masking. It is based on our pgstream open source project.

Happy to organize a demo any time.

chime · 2025-05-14T15:33:04 1747236784

1. Would you sign BAA (for HIPAA) for the Pay As You Go plan? Can't find that anywhere on your site except for that Lite is HIPAA compliant (https://lite.xata.io/security).

2. FYI, couldn't request access via the BYOC form so I sent an email as per the error: There was an error, please try again or contact us at info@xata.io.

tudorg · 2025-05-14T15:37:56 1747237076

1. Yes, we will sign BAA for Pay As You Go.

2. Thanks, I see you sent the email already, not sure why it failed. Will reach out over email.

horse666 · 2025-05-15T13:26:11 1747315571

Hi @tudorg - do the Xata copy-on-write branches work like Neon in that you effectively get an isolated Postgres cluster, allowing you to test roles, extensions, creating/dropping DBs, altering schema & data of existing DBs, etc? I looked in the docs but it wasn’t clear to me.

tudorg · 2025-05-15T14:04:24 1747317864

Yes, exactly, you get a new isolated Postgres database, just started from essentially a snapshot at the time of the fork. You can test any types of changes.

horse666 · 2025-05-15T14:42:01 1747320121

Great, thanks for confirming

Squarex · 2025-05-14T13:40:16 1747230016

Do you support http or websocket connections like https://github.com/neondatabase/serverless? In my experience neon is ultra fast that way in serverless environments like 1-5ms per query with network roundtrip.

tudorg · 2025-05-14T15:54:41 1747238081

We have support for SQL over HTTP in Xata Lite: https://lite.xata.io/docs/sdk/sql/overview

Squarex · 2025-05-16T13:39:03 1747402743

Any plans for websockets? Neon supports transactions over them.

ajc23 · 2025-05-14T13:21:38 1747228898

Neon also supports anonymization: https://neon.tech/docs/extensions/postgresql-anonymizer as well as schema only branching

gulcin_xata · 2025-05-14T13:33:04 1747229584

Hey!

(Disclaimer: I work at Xata.) Just wanted to mention that we also support anonymization, in case that’s something you're looking into: https://xata.io/postgres-data-masking

tudorg · 2025-03-30T08:04:43 1743321883

This is great, I like in particular that there are extensions plugins. I’ll be looking at integrating this in the Xata Agent (https://github.com/xataio/agent) as custom tooling.

spennant · 2025-03-30T15:11:09 1743347469

Xata.io looks very interesting!!! I was thinking about building an intelligent agent for pg-mcp as my net project but it looks like you did a lot of the hard work already. When thinking about the "AI Stack" I usually separate concerns like this:

  UI <--> Agent(s) <--> MCP Server(s) <--> Tools/Resources
             |
           LLM(s)

tudorg · 2025-03-31T08:10:41 1743408641

That's very similar to what we are thinking as well, and we'd like to separate the Agent tools into an MCP server as well as use MCP for custom tools.

tudorg · 2025-03-23T09:24:45 1742721885

Check out also https://github.com/xataio/agent which builds on this idea in the form of a dedicated agent for Postgres.

Disclaimer: I’ve worked on this project.