[Foundation Course] Apache Ignite Essentials: Key Design Principles for Building Data-Intensive Applications

This project is designed for a free instructor-led training on the Ignite essential capabilities and architecture internals. Check the complete schedule and join one of our upcoming training sessions.

Setting Up Environment

Java Developer Kit, version 11, 17 or 21
Apache Maven 3.0 or later
Docker
Your favorite IDE, such as IntelliJ IDEA, or Eclipse, or a simple text editor.

Clone The Project

Clone the training project with Git or download it as an archive:

git clone https://github.com/GridGain-Demos/ignite-essentials-developer-training.git

(optionally), open the project in your favourite IDE such as IntelliJ or Eclipse, or just use a simple text editor and command-line instructions prepared for all the samples.

Sign up for GridGain's Nebula service

We'll use the Control Center component to execute SQL queries and view cluster internals.

Open portal.gridgain.com in your browser
Click the "Sign up" button
Enter your details

Starting Ignite Cluster

Start a two-node Ignite cluster:

Open a terminal window and navigate to the root directory of this project.
Open src/main/resources/controlcenter.conf in your IDE or text editor
Update the connector.username and connector.password values to the values you used to create your Nebula account.
Start your nodes using Docker Compose:
```
docker compose -f docker-compose.yml up
```
Switch back to your browser and select Attach Apache Ignite
In the "Connector" dropdown, select Ignite Essentials
The URL of the REST API is http://node1:10300
Click Continue
Click Attach
Initialise the cluster by clicking the Initialise button at the top-right of the screen

Creating Media Store Schema and Loading Data

Now you need to create a Media Store schema and load the cluster with sample data. Use SQLLine tool to achieve that:

Open a terminal window and navigate to the root directory of this project.

Load the media store database:

a. Start the Command Line Interface (CLI)

docker run -e LANG=C.UTF-8 -e LC_ALL=C.UTF-8 -v ./config/media_store.sql:/opt/ignite/downloads/media_store.sql --rm --network ignite3_default -it apacheignite/ignite:3.0.0 cli

b. Connect to the cluster.

connect http://node1:10300

c. Execute SQL command to load the sample data.

sql --file=/opt/ignite/downloads/media_store.sql

Keep the connection open as you'll use it for following exercises.

Data Partitioning - Checking Data Distribution

With the Media Store database loaded, you can check how Ignite distributed the records within the cluster:

Switch to your browser and select the "Tables" tab
While on that screen, follow the instructor to learn some insights.

Affinity Co-location - Optimizing Complex SQL Queries With JOINs

Ignite supports SQL for data processing including distributed joins, grouping and sorting. In this section, you're going to run basic SQL operations as well as more advanced ones.

Querying Single Table

In your browser, select the "Queries" tab

Run the following query to find top-20 longest tracks:

SELECT trackid, name, MAX(milliseconds / (1000 * 60)) as duration FROM track
WHERE genreId < 17
GROUP BY trackid, name ORDER BY duration DESC LIMIT 20;

Joining Two Colocated Tables

Modify the previous query by adding information about an author. You do this by doing a LEFT JOIN with the Artist table:

SELECT track.trackId, track.name as track_name, genre.name as genre, artist.name as artist,
MAX(milliseconds / (1000 * 60)) as duration FROM track
LEFT JOIN artist ON track.artistId = artist.artistId
JOIN genre ON track.genreId = genre.genreId
WHERE track.genreId < 17
GROUP BY track.trackId, track.name, genre.name, artist.name ORDER BY duration DESC LIMIT 20;

Try adding the phrase "EXPLAIN PLAN FOR" at the beginning of the above query to see how Ignite will execute it.

Examine the output. Your instructor will give hints for what to look for. It will look something like this:

Limit(fetch=[20]): rowcount = 20.0, cumulative cost = IgniteCost [rowCount=15318.06, cpu=77499.96615043783, memory=33461.76, io=178134.0, network=101068.0], id = 35293

Running Co-located Compute Tasks

Run training.ComputeApp that uses Apache Ignite compute capabilities for a calculation of top-5 paying customers. The compute task executes on every cluster node, iterates through local records and responds to the application that merges partial results.

Build an executable JAR with the applications' classes (or just start the app with IntelliJ IDEA or Eclipse):
```
mvn clean package 
```

Load the code into your cluster:

a. Start the CLI.

docker run -e LANG=C.UTF-8 -e LC_ALL=C.UTF-8 -v ./target/ignite-essentials-developer-training-1.0-SNAPSHOT.jar:/opt/ignite/downloads/ignite-essentials-developer-training-1.0-SNAPSHOT.jar --rm --network ignite3_default -it apacheignite/ignite:3.0.0 cli

b. Connect to the cluster.

connect http://node1:10300

c. Deploy the code to the cluster.

cluster unit deploy --version 1.0.0 --path=/opt/ignite/downloads/ignite-essentials-developer-training-1.0-SNAPSHOT.jar essentialsCompute

Execute the ComputeApp program with the following command:

mvn exec:java

Or run directly from your preferred IDE.

Name		Name	Last commit message	Last commit date
Latest commit History 47 Commits
.github/workflows		.github/workflows
config		config
sql		sql
src/main		src/main
.gitignore		.gitignore
LICENSE		LICENSE
README.md		README.md
dependency-reduced-pom.xml		dependency-reduced-pom.xml
docker-compose.yml		docker-compose.yml
pom.xml		pom.xml

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

[Foundation Course] Apache Ignite Essentials: Key Design Principles for Building Data-Intensive Applications

Setting Up Environment

Clone The Project

Sign up for GridGain's Nebula service

Starting Ignite Cluster

Creating Media Store Schema and Loading Data

Data Partitioning - Checking Data Distribution

Affinity Co-location - Optimizing Complex SQL Queries With JOINs

Querying Single Table

Joining Two Colocated Tables

Running Co-located Compute Tasks

About

Uh oh!

Uh oh!

Contributors 2

Uh oh!

Languages

License

GridGain-Demos/ignite-essentials-developer-training

Folders and files

Latest commit

History

Repository files navigation

[Foundation Course] Apache Ignite Essentials: Key Design Principles for Building Data-Intensive Applications

Setting Up Environment

Clone The Project

Sign up for GridGain's Nebula service

Starting Ignite Cluster

Creating Media Store Schema and Loading Data

Data Partitioning - Checking Data Distribution

Affinity Co-location - Optimizing Complex SQL Queries With JOINs

Querying Single Table

Joining Two Colocated Tables

Running Co-located Compute Tasks

About

Topics

Resources

License

Uh oh!

Stars

Watchers

Forks

Uh oh!

Contributors 2

Uh oh!

Languages