Skip Navigation
I just released my first OSS library! Introducing Aqueducts, a framework to build ETL pipelines using rust
  • I have a couple of years of experience writing functional scala as a backend web dev and switched to doing data engineering 2 years ago. Before that some C/C++ (this is where my rust interest came from).

    I definitely understand the feeling of learning from scratch, I had the same experience learning functional programming but having learnt that made learning rust much easier

  • I just released my first OSS library! Introducing Aqueducts, a framework to build ETL pipelines using rust
  • Oh no worries I am quite new to rust myself but am lucky to be able to use it at work and already got some experience with datafusion and delta-rs :). Accessing postgresql with this is not supported yet but am trying to figure out using OpenDAL for that which should hopefully make it quite easy to implement

  • I just released my first OSS library! Introducing Aqueducts, a framework to build ETL pipelines using rust
  • ETL stands for extract transform and load and it is a widely used architecture for data pipelines where you load some data from different sources (like an S3 or gcs bucket), apply some transformation logic to either aggregate the data or do some other data transformation like changing the schema and then output the result as a different data product.

    These pipelines are then usually run on a schedule or triggered to periodically output data for different time periods to be able to deal with large sets of data by breaking them down into more manageable pieces for a downstream data science team or for a team of data analysts for example.

    What this library is aiming at is to combine the querying capabilities of datafusion which is a query parser and query engine, with the delta lake protocol to provide a pretty capable framework to build these pipelines in a short amount of time. I've used both datafusion and delta-rs for some time and I really love these projects as they enable me to use rust in my day job as a data engineer which is usually a python dominated field.

    However they are quite complex as they cover a wide variety of usecases and this library tries to reduce the complexity using them by constraining them for the use case of building simple data pipelines.

  • I just released my first OSS library! Introducing Aqueducts, a framework to build ETL pipelines using rust
  • Basically yes. The usecases I have found so far at work is to build an API around this to dynamically register automatic reports for data analysts, clients and non devs. In general this also greatly speeds up dev time for any ETL that we need to deploy (am part of a data engineering team). Another usecase I found is that using the CLI tool we can create run books for our SRE team to run queries for debugging/data validation purposes. I think we'll find more as we go but another part of it was to simplify working with datafusion and deltalake as their APIs expose a lot of lower level stuff.

  • I just released my first OSS library! Introducing Aqueducts, a framework to build ETL pipelines using rust
    github.com GitHub - vigimite/aqueducts: Framework to build data pipelines declaratively

    Framework to build data pipelines declaratively. Contribute to vigimite/aqueducts development by creating an account on GitHub.

    GitHub - vigimite/aqueducts: Framework to build data pipelines declaratively

    This is my first try at anything open source so any feedback is welcome :)

    8
    Writing (serverless) data pipelines in rust

    Hey, I held a talk at the Vienna rust meetup in January about how we use rust to write data pipelines in our company. I really enjoy writing ETLs like this so I wanted to share

    1
    InitialsDiceBearhttps://github.com/dicebear/dicebearhttps://creativecommons.org/publicdomain/zero/1.0/„Initials” (https://github.com/dicebear/dicebear) by „DiceBear”, licensed under „CC0 1.0” (https://creativecommons.org/publicdomain/zero/1.0/)KA
    kato @programming.dev
    Posts 2
    Comments 4