MapReduce, Spark, Java, and Scala for Data Algorithms Book
-
Updated
Oct 14, 2024 - Java
MapReduce, Spark, Java, and Scala for Data Algorithms Book
BBoxDB is a scalable, highly available, and distributed data store for multi-dimensional big data. The software supports operations like multi-dimensional range queries and spatial joins. In addition, data streams are supported.
Kafka Workers is a client library which unifies records consuming from Kafka and processing them by user-defined WorkerTasks.
implementation of partitioning mechanism on Apache Kafka and asynchronous communication between Vert.x microservices
Custom AEMO MMS Data Model CSV reader for Apache Spark
message queue and broker system similar to Kafka or RabbitMQ.
Amazon Dynamo-style distributed key-value storage with partitioning, replication, and failure handling
Spring batch job as Spring cloud task
Java Utilities
Spring batch common components for partitioned jobs
Spring batch job as Spring Rest service
A tiny embedded Java-engine for extremely fast partitioned immutable-after-construction databases
A partitioning algorithm for OWL
A jdbc application that runs queries in pgAdmin to simulate the functionality of an insurance company's database using Apache Spark RDD for query implementation.
Spring data jpa application made to work with partitioned postgres tables
How many different ways can you find to calculate the sum of an array by multi-threading? Thread, Runnable, Callable, Future, ExecutorService
Command line tool to extract partitions and files from Atari disk images.
CDAP Plugins for Sinks that allow you to specify a list of fields, and leverage the values as partitions in the dataset.
Add a description, image, and links to the partitioning topic page so that developers can more easily learn about it.
To associate your repository with the partitioning topic, visit your repo's landing page and select "manage topics."