原文
| ||||||||||||||||||
| ||||||||||||||||||
![]() |
原始链接: https://news.ycombinator.com/item?id=43705991
Bauplan是一个新的数据管道平台,旨在用代码优先的方法取代自定义框架和Notebook。用户可以在云端,基于对象存储,从他们的IDE运行SQL/Python函数。该系统强调版本控制、可组合性、时间旅行能力以及类似Git的分支功能。 该平台具有共同设计的抽象层和运行时,可在FaaS和数据操作方面实现优化,例如比AWS快15倍的函数重建速度。它为人工交互(CLI)和机器交互(SDK)提供了简单的API。开发人员正在寻求对其将数据工程工作流与熟悉的软件开发抽象(如表、函数、分支和CI/CD)对齐的方法的反馈。一位评论者要求提供一个10分钟的YouTube视频演示该产品的功能。
| ||||||||||||||||||
| ||||||||||||||||||
![]() |
Perhaps surprisingly, we decided to co-design the abstractions and the runtime, which allowed novel optimizations at the intersection of FaaS and data - e.g. rebuilding functions can be 15x faster than the corresponding AWS stack (https://arxiv.org/pdf/2410.17465). All capabilities are available to humans (CLI) and machines (SDK) through simple APIs.
Would love to hear the community’s thoughts on moving data engineering workflows closer to software abstractions: tables, functions, branches, CI/CD etc.
reply