:boom: :rocket: 封装spark读取kafka,sparkstreaming动态调节batch time;封装sparkstreaming 1.6 - kafka 010 用以支持 SSL;封装spark与其他组件;
:boom: :alien: :hotsprings::rocket:Encapsulated spark 与其他组件的结合api,方便使用,例如 es,hbase,kudu,kafka,mq等
kafka flink 例子,flink state 实现wordcount,并写入hbase
Spark Version Test Code.:boom:
java 人脸识别; Face recognition
Recommendation algorithm
SparkMeasure is a tool for performance troubleshooting of Apache Spark workloads. It simplifies the collection and analysis of Spark task metrics data.
storm的相关程序。包括kafka,hbase的结合
data Structure Sample Code .
grafana prometheus
Spark权威指南( Spark The Definitive Guide) -中文版翻译项目
rocksdb demo
kafka offset manager ,record to zookeeper
java 如何调用js脚本和python脚本(jython)
Spark RDD to read and write from HBase
rabbitmq consumer or producer util demo
High Performance Kafka Consumer for Spark Streaming. Compatible with every Spark and Kafka versions including latest Spark 2.2.0 and Kafka 0.11.0. Now supports Kafka Security. Offset management in Zookeeper. Reliable No-Dataloss gurantee. No dependency on HDFS or Checkpointing and WAL. In-built PID rate controller. Support Message Interceptor . Offset Lag checker.
Table of contents generator.
PredictionIO, a machine learning server for developers and ML engineers. Built on Apache Spark, HBase and Spray.
Write your Spark data to Kafka seamlessly
Avro Data Source for Apache Spark
Code examples that show to integrate Apache Kafka 0.8+ with Apache Storm 0.9+ and Apache Spark Streaming 1.1+, while using Apache Avro as the data serialization format.
大数据相关基础知识整理
Spark源码剖析
A tool for managing Apache Kafka.
Sparkline BI Accelerator provides fast ad-hoc query capability over Logical Cubes. This has been folded into our SNAP Platform(http://bit.ly/2oBJSpP) an Integrated BI platform on Apache Spark.