READEME

How to run

Note: The system's logic and AWS architecture were finalized, but due to time constraints, the code is in beta version and does not run smoothly.

Use the fatpom.xml: change bigpom.xml to pom.xml And pom.xml to originalPom.xml. In the cmd run:
mvn clean package (needed only if new dependencies are added)
mvn dependency:copy-dependencies (needed only if new dependencies are added)
mvn package
Swuch back to originalPom.xml (change it's name to pom.xml)
java -cp target/Assignment1-1.0-SNAPSHOT.jar:target/dependency/* Assignment1.LocalApplication input-sample-1.txt outputFileName.txt n terminate

Runing with Amazon Web Services

We used instance of type T2_NANO with ami: "ami-00e95a9222311e8ed".

System Overview

System Architecture Diagram Diagram as photo

The system consisting of the following components:

1. Local Application

2. EC2 Manager Node

Creates worker nodes, controls the data flow, monitors the number of workers, splits the input into tasks, distributes them to workers and combines the results.

3. EC2 Worker Nodes

4. S3 Storage

5. SQS Queues

The system uses S3 as a common storage and SQS as central method for communication. The design enables the Manager to assign tasks to the Workers by sending messages to a queue. The Workers, during their free time, can then retrieve and process these tasks.

The Manager only performs the simple task of defining the task and placing it in the queue, without needing to find a free worker or manage task distribution. The Workers are not disturbed while working and can easily check if there are additional tasks to process.

System Scalability

The Manager monitors the number of worker nodes and dynamically adjusts their count based on the workload.

The system is built such that all the components work with a common data storage and communicated using common queues. As such, the components do not interact directly resulting The amount of communication is linear in the number of components. This design ensures that the Manager node, or any other component, does not need to communicate with a large number of nodes, making the design scalable.

Additionally, the system uses S3 storage and SQS, which can store an adjustable amount of data and messages.

The main weakness of the design is the Manager, which operates as a single node. However, this weakness is mitigated because the Manager performs only simple and fast tasks. Additionally, it operates with several threads in parallel, enabling it to handle a higher workload.

System Persistence

The system is persistent and can handle the failure of of its components.

-If a worker node fails, it will stop taking new tasks. Additionally, thanks to the Amazon SQS visibility timeout process, the message will reappear in the queue, allowing another worker to take over the task. The Manager monitors the number of worker nodes and will launch a new instance to replace the one that failed.

Both S3 and SQS provide live backups and ensure durability of the data.

Data Durability in Amazon S3 Amazon SQS Documentation

If the manager fails, the data is still secure, but the process will only continue once a new manager node is manually created or the local application is re-runed.

System Decentralization

The system consists of multiple independent components, each operating on separate machines. These components communicate through shared storage (S3) and message queues (SQS), enabling them to function autonomously while exchanging data.

The system operates in parallel as tasks are processed concurrently by multiple Worker Nodes. The workers do not wait for others to finish and can independently retrieve additional tasks from the queue.

Name		Name	Last commit message	Last commit date
Latest commit History 45 Commits
Resources		Resources
src/main/java/Assignment1		src/main/java/Assignment1
.gitignore		.gitignore
README.md		README.md
bigpom.xml		bigpom.xml
dependency-reduced-pom.xml		dependency-reduced-pom.xml
input-sample-1.txt		input-sample-1.txt
pom.xml		pom.xml

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

READEME

How to run

System Overview

The system consisting of the following components:

1. Local Application

2. EC2 Manager Node

3. EC2 Worker Nodes

4. S3 Storage

5. SQS Queues

System Scalability

System Persistence

System Decentralization

About

Uh oh!

Releases

Packages

Uh oh!

Contributors 2

Uh oh!

Languages

AmitNG2000/AWS-Cloud-Computing

Folders and files

Latest commit

History

Repository files navigation

READEME

How to run

System Overview

The system consisting of the following components:

1. Local Application

2. EC2 Manager Node

3. EC2 Worker Nodes

4. S3 Storage

5. SQS Queues

System Scalability

System Persistence

System Decentralization

About

Topics

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors 2

Uh oh!

Languages

Packages