Self-attentive encoder in Transformer (Vaswani et al., 2017) stacking m identical layers.
Sign In or Create an Account