
GitHub - oxnr/awesome-bigdata: A curated list of awesome big …
Apache Metron - a platform that integrates a variety of open source big data technologies in order to offer a centralized tool for security monitoring and analysis.
Awesome Open-Source Data Engineering - GitHub
Awesome Open-Source Data Engineering This Awesome List aims at providing an overview of open-source projects related to data engineering. This is a community effort: please contribute …
GitHub - trinodb/trino: Official repository of Trino, the distributed ...
Official repository of Trino, the distributed SQL query engine for big data, formerly known as PrestoSQL (https://trino.io) trino.io java distributed-systems data-science sql database big …
big-data-projects · GitHub Topics · GitHub
May 19, 2021 · GitHub is where people build software. More than 150 million people use GitHub to discover, fork, and contribute to over 420 million projects.
big-data · GitHub Topics · GitHub
3 days ago · The world's fastest open query engine for sub-second analytics both on and off the data lakehouse. With the flexibility to support nearly any scenario, StarRocks provides best-in …
GitHub - pawl/awesome-etl: A curated list of awesome ETL …
Talend - "an open source application for data integration job design with a graphical development environment" N8n - "Free and open fair-code licensed node based Workflow Automation Tool.
IrBigDta/Awesome-Modern-Open-Source-Data-Engineering
This curated list brings together powerful open-source tools, frameworks, and resources for data engineering 🛠️ and data science 📈. It started with inspiration from pracdata's awesome-open …
Logica: language of Big Data - GitHub
Logica: language of Big Data Logica is an open source declarative logic programming language for data manipulation. Logica is a successor to Yedalog, a language created at Google earlier.
Apache Spark - A unified analytics engine for large-scale data
Apache Spark Spark is a unified analytics engine for large-scale data processing. It provides high-level APIs in Scala, Java, Python, and R (Deprecated), and an optimized engine that supports …
High Performance Software-Defined Object Storage for Big Data …
High Performance Software-Defined Object Storage for Big Data and AI, that supports Amazon S3 and Openstack Swift - open-io/oio-sds