Cascading Training


Cascading Training

Course Summary

The Cascading Enterprise Developer training course provides Java Developers with the information needed to develop applications using Cascading. The content of the course contains the Cascading Java API, the modeling framework which Cascading works in, key features, important benefits, use cases and best practices. The course is three days long and is delivered via ILT (Instructor-Led Training). The material in the course is reinforced by hands-on labs. The course consists of approximately 50% lecture and 50% labs.


3 days

Intended Audience

The primary audience is Java developers and architects working in enterprise companies. The course is suitable for Java Developers of all levels.


  • Enterprise Java development experience
  • Development experience using the Eclipse or IntelliJ IDE
  • Familiarity with Gradle
  • Watch the Introduction to Cascading video
  • Before class, students are expected to have:
    1. Set up Java SDK 1.6+ and Gradle
    2. Set up VirtualBox and Vagrant to be able to install a 1-node Hadoop cluster
    3. Install Eclipse or IntelliJ to use an IDE
  • Bring your own laptop with the minimum specifications:
    • 4 GB RAM minimum, 8 GB RAM preferred
    • 5 GB Disk Space

Topics Covered

  • ETL processes to clean and prepare data
  • Processing data with filters, functions, aggregators and buffers
  • Creating custom filters, functions, aggregators and buffers
  • Merging and splitting data to optimize processing
  • Joining together of data through joining data fields
  • Best practices using Cascading
  • Connecting to various data sources
  • Using built-in and creating custom subassemblies
  • Cascading implementation architecture
  • Testing and debugging practices
  • Enterprise deployment considerations
  • Hands-on training with 16 labs of practical examples


$2495 USD

Questions? We’ve Got Answers.