About 1,330,000 results
Open links in new tab
  1. GitHub - oxnr/awesome-bigdata: A curated list of awesome big …

    Apache Metron - a platform that integrates a variety of open source big data technologies in order to offer a centralized tool for security monitoring and analysis.

  2. Awesome Open-Source Data Engineering - GitHub

    Awesome Open-Source Data Engineering This Awesome List aims at providing an overview of open-source projects related to data engineering. This is a community effort: please contribute …

  3. GitHub - trinodb/trino: Official repository of Trino, the distributed ...

    Official repository of Trino, the distributed SQL query engine for big data, formerly known as PrestoSQL (https://trino.io) trino.io java distributed-systems data-science sql database big …

  4. big-data-projects · GitHub Topics · GitHub

    May 19, 2021 · GitHub is where people build software. More than 150 million people use GitHub to discover, fork, and contribute to over 420 million projects.

  5. big-data · GitHub Topics · GitHub

    3 days ago · The world's fastest open query engine for sub-second analytics both on and off the data lakehouse. With the flexibility to support nearly any scenario, StarRocks provides best-in …

  6. GitHub - pawl/awesome-etl: A curated list of awesome ETL …

    Talend - "an open source application for data integration job design with a graphical development environment" N8n - "Free and open fair-code licensed node based Workflow Automation Tool.

  7. IrBigDta/Awesome-Modern-Open-Source-Data-Engineering

    This curated list brings together powerful open-source tools, frameworks, and resources for data engineering 🛠️ and data science 📈. It started with inspiration from pracdata's awesome-open …

  8. Logica: language of Big Data - GitHub

    Logica: language of Big Data Logica is an open source declarative logic programming language for data manipulation. Logica is a successor to Yedalog, a language created at Google earlier.

  9. Apache Spark - A unified analytics engine for large-scale data

    Apache Spark Spark is a unified analytics engine for large-scale data processing. It provides high-level APIs in Scala, Java, Python, and R (Deprecated), and an optimized engine that supports …

  10. High Performance Software-Defined Object Storage for Big Data …

    High Performance Software-Defined Object Storage for Big Data and AI, that supports Amazon S3 and Openstack Swift - open-io/oio-sds