(评论)
(comments)

原始链接: https://news.ycombinator.com/item?id=43705991

Bauplan是一个新的数据管道平台,旨在用代码优先的方法取代自定义框架和Notebook。用户可以在云端,基于对象存储,从他们的IDE运行SQL/Python函数。该系统强调版本控制、可组合性、时间旅行能力以及类似Git的分支功能。 该平台具有共同设计的抽象层和运行时,可在FaaS和数据操作方面实现优化,例如比AWS快15倍的函数重建速度。它为人工交互(CLI)和机器交互(SDK)提供了简单的API。开发人员正在寻求对其将数据工程工作流与熟悉的软件开发抽象(如表、函数、分支和CI/CD)对齐的方法的反馈。一位评论者要求提供一个10分钟的YouTube视频演示该产品的功能。


原文
Hacker News new | past | comments | ask | show | jobs | submit login
Bauplan – Git-for-data pipelines on object storage (bauplanlabs.com)
16 points by barabbababoon 1 hour ago | hide | past | favorite | 2 comments










Looking to get feedback for a code-first platform for data: instead of custom frameworks, GUIs, notebooks on a chron, bauplan runs SQL / Python functions from your IDE, in the cloud, backed by your object storage. Everything is versioned and composable: time-travel, git-like branches, scriptable meta-logic.

Perhaps surprisingly, we decided to co-design the abstractions and the runtime, which allowed novel optimizations at the intersection of FaaS and data - e.g. rebuilding functions can be 15x faster than the corresponding AWS stack (https://arxiv.org/pdf/2410.17465). All capabilities are available to humans (CLI) and machines (SDK) through simple APIs.

Would love to hear the community’s thoughts on moving data engineering workflows closer to software abstractions: tables, functions, branches, CI/CD etc.



I'd love to see a 10 minute YouTube video of the capabilities of this product.






Join us for AI Startup School this June 16-17 in San Francisco!


Guidelines | FAQ | Lists | API | Security | Legal | Apply to YC | Contact



Search:
联系我们 contact @ memedata.com