Skip to content

Commit 632c7d0

Browse files
committed
docs: Update NorthPole blog post description & alt text for images
1 parent 59ab2c2 commit 632c7d0

File tree

1 file changed

+5
-4
lines changed
  • content/blog/northpole-ibm-neuromorphic-ai-hardware

1 file changed

+5
-4
lines changed

content/blog/northpole-ibm-neuromorphic-ai-hardware/index.md

Lines changed: 5 additions & 4 deletions
Original file line numberDiff line numberDiff line change
@@ -1,10 +1,7 @@
11
---
22
title: "NorthPole, IBM's latest Neuromorphic AI Hardware"
3-
description: "Translating the NorthPole paper from IBM to human language."
3+
description: "A deep dive into IBM's NorthPole, a brain-inspired AI accelerator. Understand its architecture, 10 core axioms, and how it achieves groundbreaking energy efficiency for neural inference."
44
image: cover.png
5-
aliases:
6-
- /blog/northpole
7-
- /blog/northpole-ibm-neural-inference-frontier-energy-space-time
85
draft: false
96
date: 2023-12-26
107
showTableOfContents: true
@@ -59,6 +56,7 @@ When you want to skip zero-computations, you need to introduce a structured appr
5956
fig
6057
src="dally-sparsity.png"
6158
caption="Sparse neural networks support in hardware [[William J. Dally]](https://www.computer.org/csdl/proceedings-article/hcs/2023/10254716/1QKTnGyUPbG)."
59+
alt="Sparse neural networks support in hardware"
6260
>}}
6361
6462
## Axiom 2 - Getting inspired by biological neurons
@@ -116,6 +114,7 @@ fig
116114
src="simd-mac.png"
117115
width=760px
118116
caption="The single-instruction-multiple-data MAC unit of NorthPole."
117+
alt="The single-instruction-multiple-data MAC unit of NorthPole."
119118
>}}
120119
121120
Above it is shown a visual description of how this parallelism is exploited. The total word width is always 8 bit, but more values can be glued together to be processed in parallel in the MAC, which produces more outputs at once for the INT4 and INT2 precisions. This is why in the "Silicon implementation" section of the paper it is written:
@@ -139,6 +138,7 @@ fig
139138
src="temporal-vs-spatial.png"
140139
width=760px
141140
caption="Spatial (left) and temporal (right) architectures."
141+
alt="Spatial (left) and temporal (right) architectures."
142142
>}}
143143
144144
Eyeriss [[Chen et al.](https://dspace.mit.edu/bitstream/handle/1721.1/101151/eyeriss_isscc_2016.pdf)] proposed this approach and taxonomy in 2016. Field programmable gate arrays (FPGAs) have been doing this since the beginning, with distributed SRAM near the logic or the special purpose macros available on the silicon. I do not know if it is brain-inspired but it makes sense from a silicon perspective if you want to maximize efficiency.
@@ -154,6 +154,7 @@ Take-home message: PEs communicate using dedicated busses, in what is called a n
154154
fig
155155
src="northpole-arch.png"
156156
caption="A snippet of NorthPole architecture [[Modha et al.](https://www.science.org/doi/10.1126/science.adh1174)]"
157+
caption="A snippet of NorthPole architecture"
157158
width=760px
158159
>}}
159160

0 commit comments

Comments
 (0)