Lijie Xu

Institute of Software, CAS

SparkInternals 3786

Notes talking about the design and implementation of Apache Spark

SparkLearning Scala 140

Learning to write Spark examples

MyNotes Python 91

Self-written notes that may be useful

blogs 43

My blogs

SparkFaultBench Scala 12

A Spark Reliability Testing Suite

SparkProfiler Python 9

Profiling Spark Applications for Performance Comparison and Diagnosis

MySlides 9

Self-written slides that may be useful

SparkGC Scala 4 CSS 3

My Homepage

streaming-benchmarks * Java 2

Benchmarks for Low Latency (Streaming) solutions including Apache Storm, Apache Spark, Apache Flink, ...

Spark-core Scala 2

The core library from Spark

SparkBench Scala 2

Spark benchmark

enhanced-Eclipse-MAT Java 1

extracting framework & user objects from task's heap dump

Misc 1

Store miscellaneou things

GraphExamples Scala 1

Examples of GraphX

spark-sql-perf * Scala 1

BenchmarkScripts Java 1

Benchmark scripts in master

FMEM Java 1

A Fine-grained Memory Estimator for MapReduce Jobs

SparkStreamingBench Scala 1

Testing SparkStreaming

MyPaper 1

My papers and technique reports

moa * Java 0

MOA is an open source framework for Big Data stream mining. It includes a collection of machine learning algorithms (classification, regression, clustering, outlier detection, concept drift detection and recommender systems) and tools for evaluation.

TR 0

My technical reports

SparkExamples Scala 0

Basic examples for learning Spark

AndroidGCProfiler Python 0

Profiling the GC activities in Android ART JVM * 0

My blog

GellyLearning 0

yahoo-streaming-benchmark * Java 0

An extension of Yahoo's Benchmarks

Mprof Java 0

A Memory Profiler for Diagnosing Memory Problems in MapReduce Applications

spark * Scala 0

Mirror of Apache Spark

hadoop-1.2.0-enhanced Java 0

Enhanced hadoop-1.2.0 by LJX

MemoryEstimator Python 0

MEMR Java 0

OOMJobs Java 0

MapReduce jobs that will cause OOME

GarageBand XML 0

My music project

fdps-vii * Scala 0

Code & data for Fast data processing with Spark V2

public * C++ 0

SailingLab's Petuum project.

parameter_server * C++ 0

A distributed machine learning framework.

OOMCases CSS 0

Real-world OOM cases in MapReduce jobs

Hadoop-0.20.2-LJX Java 0

A part of Hadoop-0.20.2 source code (Some MapReduce Framework related code has been modified by Lijie Xu).

SampleBenchmark Java 0

Hadoop Benchmark - Input splits are first sampled

DMEM Java 0

Dataflow and Memory Estimator for MR Jobs

HadoopJobInfoCollector Java 0

Fetch the configuration/timeline/counters/log infos from JobTracker

PerformanceAnalysis Java 0

Visualize the metrics got from tasks' logs, Pidstata and JVM

BigDataBenchmark Java 0

Representative MapReduce Job for Hadoop Benchmark

编程语言 排名 好于 星星数
Scala 17 99.77% 163
Python 1735 98.05% 100
CSS 2578 94.22% 3
Java 11027 86.28% 5
更新于2019-11-07 10:02:19