A Spark SQL extension which provides SQL Standard Authorization for Apache Spark
ACL Management for Apache Spark SQL with Apache Ranger. This library has been contributed to https://github.com/apache/submarine as a sub-module, and that module can still be used individually. The project here will no longer be updated. If you have any questions please go to https://github.com/apache/submarine/tree/master/docs/submarine-security/spark-security/README.md to learn how to use and give feedback to the apache submarine community by following https://submarine.apache.org/community/contributors.html
A library that brings useful functions from various modern database management systems to Apache Spark
PostgreSQL and GreenPlum Data Source for Apache Spark
A Fully HiveServer2-like Multi-tenancy Spark Thrift Server Supporting Impersonation and Multi-SparkContext with Ranger Authorization (GO TO https://github.com/NetEase/kyuubi INSTEAD)
A curated list of awesome Apache Spark packages and resources.
Mirror of Apache Ranger
Python interface to Hive and Presto. 🐝
Scrapy, a fast high-level web crawling & scraping framework for Python.
Spark Library for Hadoop Upserts And Incrementals
Apache Kyuubi is a distributed multi-tenant JDBC server for large-scale data processing and analytics, built on top of Apache Spark
APM, Application Performance Monitoring System
Apache Spark - A unified analytics engine for large-scale data processing
Apache Kyuubi Site
Draft page for acah2021 conference
We get to decide what our story is.
Alerting and monitoring tool for Apache Spark
Apache IoTDB
Apache Parquet
Translating your AngularJS 1.x apps
Apache Iceberg
Free universal database tool and SQL client
Kent Yao' Blog
Godot Engine official documentation
A markdown parser for docutils
Apache Spark Website
JDK main-line development
Submarine is Cloud Native Machine Learning Platform.
ANTLR (ANother Tool for Language Recognition) is a powerful parser generator for reading, processing, executing, or translating structured text or binary files.
Davinci is a DVaaS (Data Visualization as a Service) Platform
Wormhole is a SPaaS (Stream Processing as a Service) Platform
Mirror of Apache Zeppelin
NetEase Spark Courses
A simple, in-browser, markdown-driven slideshow tool.