SOMA

This repository contains the code for "SoMa: Identifying, Exploring, and Understanding the DRAM Communication Scheduling Space for DNN Accelerators". Please follow the instructions in the AE Appendix of our corresponding HPCA 2025 paper.

Quick Start

First you will need to install some python libs: pip install -r requirements.txt or conda install --file requirements.txt
Optional: we use OpenMP to do multi-threaded search, if you do not want this, just comment out -fopenmp in the Makefile.

./build.sh
./run.sh --eta
./get_results.sh

Console Arguments Explanation

After running build.sh, you can execute a single experiment using the command:

./build/soma 108 2 512 1 8 1 8 4 256 512 849779186 results/dse

Argument Breakdown:

Network (108): Specifies the neural network to be used. (Full list below)
Baseline Type (2): Must be 2, as other values are not supported.
Sequence Length (512): Relevant for LLMs; ignored for CNNs.
Number of Segments (1): Used when a network is too large and needs partitioning for scheduling.
L2 Buffer Size (8 MB): Defines the L2 buffer size in megabytes.
Batch Size (1): Specifies the number of input samples processed at once.
DRAM Bandwidth Ratio (8): Ratio of DRAM bandwidth (GB/s) to computational power (TOPS). Default is 1.
PE Array Dimension (4): Typically ranges from 4 to 16.
L2 Buffer Bandwidth (256 GB/s): Specifies the bandwidth for L2 buffer.
MAC Units per PE (512): Determines the number of multiply-accumulate (MAC) units per PE. TOPS is calculated as:
```
TOPS = 2 * mac_num * PE_ARRAY_Dim^2 / 1024
```
Random Seed (849779186): Used for the random number generator.
Results Folder (results/dse): Specifies where the experiment results are stored.

Supported Networks

Convolutional Neural Networks (CNNs):

0: Darknet19
1: VGG19
2: ResNet50
3: GoogLeNet
4: ResNet101
5: DenseNet
6: Inception-ResNet-V1
7: GNMT
8: LSTM
9: ZFNet
10: Transformer
11: Transformer Cell
12: PNASNet
13: ResNeXt50
14: ResNet152
15: Transformer Big Cell
16: RetinaNet-ResNet50
17: U-Net
18: RandWire Small
19: RandWire Large

Large Language Models (LLMs):

101: GPT-J 6B (Decode)
102: GPT-J 6B (Prefill)
103: LLaMa 2 70B (Decode)
104: LLaMa 2 70B (Prefill)
105: BERT Base
106: BERT Large
107: GPT-2 Small (Decode)
108: GPT-2 Small (Prefill)
109: GPT-2 XL (Decode)
110: GPT-2 XL (Prefill)

For unsupported models, the program will throw an error: Model not supported.

Folder Overview

`include/`

Contains header files for the project.

`pyscripts/`

Holds Python scripts used for processing and analyzing experiment results.

`src/`

Contains the source code (C++) for implementing the SoMa Framework.

Note on GPT: Please note that for GPT-2, we actually explored only one block, so in the data processing script, the corresponding latency is multiplied by the number of blocks. For GPT-2-small in the DSE experiments, the number of blocks is 12.

Contact Info

If you encounter any issues during the AE process, please contact:

Jingwei Cai (Tsinghua University) 1148821791@qq.com
Xuan Wang (Xi'an Jiaotong University, Institute for Interdisciplinary Information Core Technology) wangxuanxjtu@163.com

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

SOMA

Quick Start

Console Arguments Explanation

Argument Breakdown:

Supported Networks

Convolutional Neural Networks (CNNs):

Large Language Models (LLMs):

Folder Overview

`include/`

`pyscripts/`

`src/`

Contact Info

About

Uh oh!

Releases

Packages

Uh oh!

Contributors 2

Uh oh!

Languages

Name		Name	Last commit message	Last commit date
Latest commit History 3 Commits
include		include
pyscripts		pyscripts
src		src
LICENSE		LICENSE
README.md		README.md
args.txt		args.txt
build.sh		build.sh
get_results.sh		get_results.sh
makefile		makefile
requirements.txt		requirements.txt
run.sh		run.sh

License

SET-Scheduling-Project/SoMa-HPCA2025

Folders and files

Latest commit

History

Repository files navigation

SOMA

Quick Start

Console Arguments Explanation

Argument Breakdown:

Supported Networks

Convolutional Neural Networks (CNNs):

Large Language Models (LLMs):

Folder Overview

include/

pyscripts/

src/

Contact Info

About

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors 2

Uh oh!

Languages

`include/`

`pyscripts/`

`src/`

Packages