Network Traffic Analyze OUTLINE

Current Network Configuration:

Diagram:

Monitor Notes:

Kong and OSS (MinIO) instances are all in their own Docker containers.
There are four Kong instances and one virtual IP. We are focusing on one Kong instance attached to OSS-3.
Kubernetes-7-out/in: Traffic through the physical network interface of the node.
Kubernetes-6-out: Request traffic.
Kubernetes-6-in: Actual download traffic.
Kong-MinIO in: Inward flow to the Kong-MinIO container
Kong-MinIO out: Outward flow from the Kong-MinIO container
Other oss is under other kong instances

There are four Kong instances, each attached to a MinIO instance. These MinIO instances can either be a data block or a parity block. In total, there are 2 data blocks and 2 parity blocks. Since a MinIO block only contains half of the data (due to EC:2), when a client request comes in, MinIOA needs to retrieve data from at least one other MinIO block. Therefore, the bandwidth of such Kong instances is not only used for responding to client requests but also for sending data from one MinIO block to another. We are curious about the proportion of the total output of a Kong instance that is used for communication between MinIO blocks. To answer this question, we set up an experiment and made calculations based on certain assumptions.

Objective:
Determine the system's upper bounds and the proportion of Kong's outbound traffic that is dedicated to client responses.

Calculations:

Assumptions:

Data blocks and parity blocks are evenly distributed (the statistics of each MinIO container's output are nearly the same).
Assume the Kong instance we pick always has half the data and only needs to take half of the data from another MinIO block.
There are three Kong instances, each attached to a MinIO instance. In total, there are 2 data blocks and 1 parity block. Since a MinIO block only contains half of the data (due to EC:1), when a client request comes in, MinIOA needs to retrieve data from at least one other MinIO block.

As file size, transfer speed, and time cancel out in the equation, these parameters are not critical and can be set randomly as you like. Under these assumptions, we calculate that the proportion of Kong's outbound traffic dedicated to client responses is about 3/4. This calculation comes from considering each Kong instance's output (1 + 1 + 1) and MinIO communication (1/2 * 2). We set up an experiment to verify our assumptions.

Experiment Observations:

Experiment Configuration:

Bandwidth: 1Gbps (equivalent to 128MB)
Filesize: 4* 500MB files
Total number of requests: 900
Number of concurrent connections: 30
Use Stress-Testing Tools to send 900 requests all at once

Observations and Discrepancies:

The ratio of outbound traffic of MinIO blocks differ everytime.

OSS0 and OSS3 outflow more because they store the actual data blocks.
OSS1 and OSS2 outflow less because they only store parity blocks.
The theoretical ratio is 5:2:2:5 because
- OSS0 and OSS3 each handle 3 outgoing requests from other OSS nodes (each node requests half of the data file) and 1 internal request (full data file).
- OSS1 and OSS2 each handle 1 internal request (full data file).
This explains the effective service bandwidth ratio of 8/11, where 1Gbps (123MB/s) translates to an actual throughput of 89MB/s. However, the actual traffic distribution is not always 5:2:2:5. This variability might be due to uneven load balancing by Kong. Additionally, we are currently using only the Kong instance that is attached to OSS3. If we were to use another Kong instance attached to the parity block instead, we could achieve higher actual throughput, as there would be no data sent to other MinIO blocks.
In some cases, the output and input flow of Kong-MinIO is not the same.

Cause of such discrepency:
Kong-MinIO output and input discrepancies are due to failed requests from the go-stress-testing tool. Using another stress-testing tool could resolve this problem.

Results:

Based on the monitor screenshot, as kubernetes-6 in represents the download file (response to the client) and kubernetes-7 out represents the total output of the node, with the upper device limit being 123MB, which is close to the 128MB theoretical maximum, we conclude that the results align with our calculations. The proportion of Kong's outbound traffic dedicated to client responses is approximately 85MB out of 123MB, which is close to 3/4 and slightly lower than the theoretical upper bound.

Handling Increased Traffic:

DNS round-robin, Gateway dual VIP:

Possible limitations:

DNS round-robin may not distribute traffic evenly
not possible to assign weighst to DNS entries

Network card upgrade, 10G -> 25G：

Split internal and external traffic to different network cards：

Layer 4 load balancing : ref

Service optimization, optimize existing network requests.

Ways to Experiment: Using Stress-Testing Tools

Comparison of Using Different Stress-Testing Tools:

Tools to Consider:

python custom script
go-stress-testing
apache bench
jmeter

Simplified comparison version is as below image, detailed version is at comparison_of_ST_tools.md

Name		Name	Last commit message	Last commit date
Latest commit History 26 Commits
images		images
Layer4_Load_Balance_note.md		Layer4_Load_Balance_note.md
README.md		README.md
comparison_of_ST_tools.md		comparison_of_ST_tools.md

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

Network Traffic Analyze OUTLINE

Current Network Configuration:

Calculations:

Experiment Observations:

Handling Increased Traffic:

DNS round-robin, Gateway dual VIP:

Network card upgrade, 10G -> 25G：

Split internal and external traffic to different network cards：

Layer 4 load balancing : ref

Service optimization, optimize existing network requests.

Ways to Experiment: Using Stress-Testing Tools

About

Uh oh!

Releases

Packages

peiwal11/Network_traffic_analyze

Folders and files

Latest commit

History

Repository files navigation

Network Traffic Analyze OUTLINE

Current Network Configuration:

Calculations:

Experiment Observations:

Handling Increased Traffic:

DNS round-robin, Gateway dual VIP:

Network card upgrade, 10G -> 25G：

Split internal and external traffic to different network cards：

Layer 4 load balancing : ref

Service optimization, optimize existing network requests.

Ways to Experiment: Using Stress-Testing Tools

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Packages