You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
add new files for conv1dtranspose resource
clean up so that conv code is reached. Still need to get the actual implementation matching keras
implement conv1dtranspose super inefficiently (gets correct answer though)
try to fix indices to make code work
make the c code work for conv1dtranspose
reduce weight dimensions to properly reflect transposed kernel size
clean up so that transpose filter width is passes around from config
fix code such that simple transpose layer gets synthesized
move variables out of loops, optimize slightly and add in alternative method of computation to compute by kernel (that option is not optimized as of now)
add in conv1d transpose linebuffer format code. seems to work, unsure of if it is optimized yet
trying to fix stream behavior
get transpose compilation working mostly as expected. weird jump in latency from reuse 1 to 2 still exists
initial conv2dtranspose addition. Output is permuted as of now.
output in correct order. using large array to buffer output though
fix up conv1dtranspose a bit to pad correctly. fix up stream instructions for both 1d and 2d transposes
fix allowed reuse factors for transpose layers
update to new conv methods for io_parallel. Still some issues with multiple filters as well as some padding issues
clean up error with multiple filters and larger kernels
optimize conv transpose resource to get it working reasonably well. may still have slight optimization left
fix output to conv1d transpose resource
add conv2dtranspose io_parallel implementation. Can still be optimized
small changeup to data storage in conv1d parallel
fix zero padding pass addition for transpose stream layers
move transposing of weight matrix to resource_strategy for transpose layers
change how stream loads in weights to be like parallel for conv transposes. unroll all stride steps completely
fix output of 1d transpose parallel to be faster
change 1d transpose weight input to be 2-dimensional (passed from python code)
change 2d transpose weight input to be 3-dimensional (passed from python code)
small changes to transposes
Revert "fix nondefault project name handling (#626)". The commit breaks the Vivado Accelerator workflow, and the fix is unclear to me right now.
This reverts commit e8f048a.
steps towards getting integer inputs to work
0 commit comments