This dataset was created to simulate the occurrence of cold start in serverless computing. For this, the HealthFaaS framework [1] using a heart disease risk detection scenario is deployed in a serverless environment. To create the dataset, a real working environment was simulated and all requests were sent between 08:00 and 18:00 for 6 days. Triggers for the workload were created using Apache J Meter in HTTP format. The cold start dataset, consisting of 1728 data, contains the information described below.
-
DateTime: Shows the time the requests reach the server. It is in dd/mm/yy hh/mm/ss data format.
-
Hour: Shows the hour of the timestep in DateTime.
-
Day: Displays the current business day of the DateTime (1 - Monday, 2- Tuesday, 3- Wednesday, 4- Thursday, 5- Friday, 6- Saturday )
-
Latency: It indicates the time it takes for the request sent from the client to return after reaching the server. Therefore, the function includes execution time and cold start time. As a result of the observations, a cold start occurs if there are no requests to the server for more than 15 minutes or if more than 200 simultaneous requests come to the server. (In GCP Cloud Functions, containers are kept warm for up to 15 minutes to prevent cold start [2]. Sudden increases in application load may cause a cold start [3].)
-
Request: Shows the number of simultaneous requests sent to the Server. It varies between 1 - 250.
-
CPU Usage: This shows the percentage of RAM usage on the server to respond to the request sent by the client.
-
Ram Usage: This shows the percentage of memory usage on the server to respond to the request sent by the client.
Environment Parameters for GCP-Cloud Functions are as follows:
Name | Character |
---|---|
Trigger Type | HTTP |
RAM | 256Mb |
Software Language | Python 3.11 |
Region | Europe - west2b |
Figure 1. The Cold Start Dataset for All Variables
Figure 2. The Cold Start Dataset as Latency Variable
Figure 3. The Cold Start Dataset as Memory(RAM) Usage Variable
Figure 4. The Cold Start Dataset as CPU Usage Variable
In Figs 1,2,3 and 4, the cold start dataset is shown according to all variables, memory (RAM) and CPU Usage, respectively.
This dataset is part of the following publication, please cite when using this dataset:
M. Golec et al., "ATOM: AI-Powered Sustainable Resource Management for Serverless Edge Computing Environments," in IEEE Transactions on Sustainable Computing, doi: 10.1109/TSUSC.2023.3348157.
1- Golec, Muhammed, et al. "HealthFaaS: AI-based Smart Healthcare System for Heart Patients using Serverless Computing." IEEE Internet of Things Journal (2023).
2- Shilkov, Author: Mikhail. “Comparison of Cold Starts in Serverless Functions across AWS, Azure, and GCP.” Mikhail Shilkov, mikhail.io/serverless/coldstarts/big3/. Accessed 9 Oct. 2023.
3- Shahrad, Mohammad, et al. "Serverless in the wild: Characterizing and optimizing the serverless workload at a large cloud provider." 2020 USENIX annual technical conference (USENIX ATC 20). 2020.
BSD-3-Clause. Copyright (c) 2023, Muhammed Golec. All rights reserved.
See the License file for more details.