Skip to content
#

distributed-training

Here is 1 public repository matching this topic...

Apache-Spark-and-AWS-EMR-Distributed-Neural-Network-Training-and-Sentence-Generation

A scalable deep learning pipeline designed for training sentence-generation neural networks using Apache Spark and DL4J on AWS EMR. The project leverages Spark RDDs for distributed data preprocessing, and runs on cloud infrastructure with S3 input/output support for large-scale NLP tasks

  • Updated May 8, 2025
  • Scala

Improve this page

Add a description, image, and links to the distributed-training topic page so that developers can more easily learn about it.

Curate this topic

Add this topic to your repo

To associate your repository with the distributed-training topic, visit your repo's landing page and select "manage topics."

Learn more