Gordon Murray’s Post

View profile for Gordon Murray

Staff AWS Systems Engineer | Modernizing & Rebuilding Cloud Infrastructure | Terraform | Automation | Security

Two years ago I built a small CDC pipeline using Flink and Hudi then mostly forgot about it. I noticed the repo still gets a few regular visits and clones. So tonight I updated it: Flink 1.19.1 and Hudi 1.0.2. Hudi is a good choice if you want your data lake to behave a bit more like a database, that is able to handle updates, deletes, and keep things consistent It’s a complete, working example with Docker Compose, MariaDB CDC, MinIO instead of S3, and a Flink SQL job handling real-time updates. Nice to see it’s still helping a few people out there. Repo: https://lnkd.in/ex7zMpzd

  • No alternative text description for this image

To view or add a comment, sign in

Explore content categories