-
Notifications
You must be signed in to change notification settings - Fork 8
/
Copy pathtodo.txt
34 lines (33 loc) · 1.04 KB
/
todo.txt
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
todo:
- check for correlation between utterances and utility?
- maybe break the problem down, and test each part standalone somehow?
- (change number of utteracne tokens from 10 to 11)
done:
- add reward
- add backprop
- add logging + draw graph
- load/save model
- invalid proposals should cause reward 0 ...
- if no termination, then reward 0
- add baselining control variate
- fix gru => lstm
- remove termination token
- check N for forward batching
- add forward batching
- add reward for batching
- add backward batching
- add utterance back into net
- print example utterances/conversations
- add in game length to logs somehow
- add entropy regularization
- (cudaify)
- enable cuda for utterances
- add entropy regularization for utterances
- add logging of proportion of stochastic draws matching argmax
- add option to remove stochasticity, pick argmax, for testing
- add train/test split
- add test to training loop
- added `--disable-proposal`
- added non-prosocial
- probably should rerun with comms turned off
- probably should do with no-proposal too