maximus93's comments

maximus93 · on Dec 17, 2024

1. Fuelling Adaptive AI Models

Real-time data allows AI systems to respond instantly to dynamic changes, improving accuracy in areas like fraud detection, predictive maintenance, and personalised recommendations.

2. Breaking Data Silos

Data activation integrates fragmented data across platforms enabling AI to access rich, unified inputs for better insights and cross functional decision making.

3. Turning Insights into Action

Reverse ETL moves AI-generated insights from data warehouses back into operational tools (like CRMs) ensuring predictions drive real-world actions.

What other ways does data activation empowers AI Models?

maximus93 · on Dec 6, 2024

Prompt stuffing: Quick, dirty, and like cramming for an exam, works until it doesn’t.

Fine tuning: Great if you have static data and deep pockets for compute.

RAG inclusive Vector DB: The gold standard. Think of it as having your data whisper the answers to the LLM.

With AI Squared, you can keep your data fresh, dynamic, and external because nobody wants to retrain a model every time the boss changes their mind. :D

maximus93 · on Dec 5, 2024

Egoless engineering is such a great mindset—focusing on team goals over individual credit really makes collaboration smoother and ideas stronger. It’s amazing how much better things get when everyone’s just working toward the best solution, not personal recognition. Definitely something more teams should embrace!

maximus93 · on Dec 5, 2024

DuckDB really seems to be having its moment—projects like Evidence and DuckDB GSheets are super cool examples of its potential. And yeah, Postgres’s longevity is insane, it just keeps adapting.

On the AI front, vector databases like Pinecone and pgvector are exciting, but I’d love to see something even more integrated with AI workflows. The possibilities are huge. Curious to hear what others think!

vjerancrnjak · on Dec 5, 2024

I like Clickhouse more.

Unrelated, not sure if it is just me, but ever since LLMs became popular, I've been seeing an enormous amounts of special utf8 characters no one used regularly, like this em dash you used.

How is this simple to type? If you're on a phone keyboard, you have to switch to special characters, and then have to long-hold the dash and then slide to em dash.

arglebarnacle · on Dec 5, 2024

On a full keyboard it’s not too bad—just hold alt and tap 0151 on the numpad. Honestly I wish it was harder to type for stylistic reasons—it would help cure me of my em-dash addiction

vjerancrnjak · on Dec 5, 2024

Haha, you haven't used an em dash at all. You even used -- in some of your comments.

LLMs are everywhere.

airstrike · on Dec 6, 2024

- — are built into Mac OS with Opt+- and Opt+Shift+-. I use them all the time

there are also bindings for bullets • and probably other things I'm forgetting (or that may be conflicting with other bindings I have setup)

conductr · on Dec 5, 2024

I noticed that certain browsers started auto converting a double hyphen to an emdash as I type, no LLM needed, I think that’s just a timing coincidence

cfiggers · on Dec 5, 2024

Hi, not the person you asked, but I have an answer to the question.

I have an AutoHotkey script that auto-replaces "--" with "—" (and auto-disables it in contexts where it's likely intended, like SQL editors and my terminal).

I also auto-replace "`-" with "–" so I can conveniently do number ranges with the (objectively correct) n-dash to indicate a range.

elcritch · on Dec 5, 2024

On macs you can do `alt` and `-` to get a –. Even on a phone a proper emdash can add effect over a regular dash.

dhosek · on Dec 5, 2024

Mac people call it option, not alt. alt-minus gives – and alt-shift-minus gives —. Certainly, it’s much easier than the windows enter a numeric code thing which seems insane.

elcritch · on Dec 6, 2024

I've been a Mac guy for a long time. Guess calling it alt instead of option is just a hard habit to break. ;)

alwa · on Dec 6, 2024

My iPhone autocorrects two consecutive hyphens to an em dash. I’m fairly sure it’s not something I configured manually, so I assume it is or was a default. Possibly a component of the “smart punctuation” setting for the keyboard.

tejaskumar_ · on Dec 6, 2024

> I’d love to see something even more integrated with AI workflows

Do you mean a database still? Or something like Langflow or Dify? Curious what "something even more integrated" would look like as just a DB.

aravindputrevu · on Dec 6, 2024

Oh I think there are better vector stores than pinecone. For ex Marqo or extreme case Elasticsearch

tejaskumar_ · on Dec 6, 2024

What makes them better? Have we tried Astra DB or Milvus? Curious where that stands in relation to the others.

boundlessdreamz · on Dec 6, 2024

This comment was most likely generated using AI It is reusing phrases from previous comments -

https://news.ycombinator.com/item?id=42330710 and https://news.ycombinator.com/item?id=42330639

maximus93 · on Dec 2, 2024

Low code tools can indeed be effective for simple use cases or prototyping, but their limitations often surface when scaling or customizing is needed. As others have pointed out, a framework like Rails offers the flexibility to expand and adapt while maintaining structure.

Building the Multiwoven product based on Rails has been incredibly helpful in balancing rapid development with the ability to scale and customize as user demands evolve. It provides a structured yet flexible foundation, allowing us to adapt quickly without compromising on quality.

It’s about knowing when to leverage low code for speed and when to transition to more robust solutions for long-term scalability.

maximus93 · on Nov 22, 2024

Honestly, I think it’s somewhere in between. LLMs are great at spotting patterns in data and using that to make predictions, so you could say they build a sort of "world model" for the data they see. But it’s not the same as truly understanding or reasoning about the world, it’s more like theyre really good at connecting the dots we give them.

They dont do science or causality theyre just working with the shadows on the wall, not the actual objects casting them. So yeah, they’re impressive, but let’s not overhype what they’re doing. It’s pattern matching at scale, not magic. Correct me if I am wrong.

maximus93 · on Nov 21, 2024

Multiwoven is an open-source Reverse ETL platform that simplifies data activation for businesses of all sizes.

Tech Stack:

Backend: Ruby Frontend: React Infrastructure: Docker, Kubernetes

Areas Needing Help:

Documentation: Enhancing user and contributor guides. Code: Developing new connectors and improving existing ones. Design: Refining the UI/UX of our platform.

Level:

Beginner-Friendly: Documentation improvements and minor code enhancements. Advanced: Building new connectors and optimizing data pipelines.

Get Involved:

GitHub: (https://github.com/Multiwoven/multiwoven/) Join Our Community: Slack (https://join.slack.com/t/multiwoven/shared_invite/zt-2bnjye2...) We welcome contributors passionate about data integration and open-source development. Feel free to reach out if you're interested!

maximus93 · on Nov 21, 2024

Great discussion here! At AI Squared, we have also been exploring the evolving landscape of stream processing and SQL engines. While batch engines like DataFusion excel at handling static data, we recognize the challenges around integrating streaming capabilities and infrastructure seamlessly.

Our focus has been on simplifying data activation pipelines with tools like Multiwoven, which aims to bridge the gap between static and dynamic data needs by supporting connectors for both traditional databases and real-time platforms like Kafka. However, the need for more embedded, developer-friendly streaming solutions is clear, and it’s exciting to see the progress in projects like Arroyo, Materialize, and ClickHouse.

For us, the balance lies in usability and flexibility—how can we empower teams to embed robust data capabilities (whether streaming or batch) into their workflows without overloading on infrastructure complexity? As this ecosystem evolves, we’re optimistic about collaborating and contributing to solutions that make streaming SQL as accessible as traditional SQL.

Looking forward to seeing how this space develops—and kudos to the teams pushing boundaries! https://github.com/Multiwoven/multiwoven/