Wrong Decoder RNN Architecture

In the decoder an LSTM is used

https://github.com/alexis-jacq/Pytorch-Sketch-RNN/blob/5c3e21375dfe7695c1c37a0acccf6da17c049f77/sketch_rnn.py#L151-L157

While in the original [paper](https://arxiv.org/abs/1704.03477), the description of the architecture at page 6 states that 

> For the decoder RNN, we use HyperLSTM, as this type of RNN cell excels at sequence generation tasks

Referring to a very different implementation of an LSTM that can generate different weights for itself for every element in a sequence. The model is defined in [this paper](https://arxiv.org/pdf/1609.09106.pdf) as well as implementation details defined in the Appendix Sections 2.2 and 2.3

	class DecoderRNN(nn.Module):
	def __init__(self):
	super(DecoderRNN, self).__init__()
	# to init hidden and cell from z:
	self.fc_hc = nn.Linear(hp.Nz, 2*hp.dec_hidden_size)
	# unidirectional lstm:
	self.lstm = nn.LSTM(hp.Nz+5, hp.dec_hidden_size, dropout=hp.dropout)

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Wrong Decoder RNN Architecture #13

Metadata

Assignees

Labels

Projects

Milestone

Relationships

Development

Wrong Decoder RNN Architecture #13

Description

Metadata

Metadata

Assignees

Labels

Projects

Milestone

Relationships

Development

Issue actions