Saturday, September 28, 2024

Growing agile ETL flows with Ballerina


sheets:Spreadsheet sheet = test spreadsheetClient->createSpreadsheet(sheetName);
_ = test spreadsheetClient->
   appendValue(sheet.spreadsheetId, ["Product", "Sales", "Date"], {sheetName: workSheetName});
foreach var {product, gross sales, date} in salesSummary {
   _ = test spreadsheetClient->
       appendValue(sheet.spreadsheetId, [product, sales, date], {sheetName: workSheetName});
}

Deploying and testing ETL flows

Growing particular person ETL duties as microservices permits all the ETL movement to be deployed in a Kubernetes cluster. Every ETL process is usually a pod within the Kubernetes deployment, making it potential to extend or lower the variety of pods of particular person ETL duties based mostly on the load. Nevertheless, organizations normally have a number of ETL flows, every with many duties. Moreover, these ETL flows will be owned by totally different groups. Subsequently, it’s essential to have correct CI/CD pipelines, permission fashions, monitoring capabilities, and a number of environments for improvement, testing, efficiency validations, and manufacturing.

Ballerina can work with all widespread CI/CD, monitoring, and deployment applied sciences, making it seamless to combine Ballerina-based ETL flows with a corporation’s present infrastructure. For instance, Ballerina ETL supply code will be maintained in GitHub, CI/CD actions will be carried out utilizing Jenkins, ETL flows will be deployed on Amazon EKS, and the executions will be monitored utilizing Prometheus and Grafana.

Related Articles

LEAVE A REPLY

Please enter your comment!
Please enter your name here

Latest Articles