Skip to Main Content
Table 2:
Hyperparameters used in the experiments.
Name
.
Value
.
Word Embedding
GloVe (PTB) / fastText (UD)
BERT
BERT-Base
CNN window size
3
CNN filters
30
BiLSTM layers
2
BiLSTM units
300 dimensions
Optimization
Adam
Learning rate
0.001
Rescaling factor
τ
64
Dropout ratio
{0.1, 0.2, 0.3}
Name
.
Value
.
Word Embedding
GloVe (PTB) / fastText (UD)
BERT
BERT-Base
CNN window size
3
CNN filters
30
BiLSTM layers
2
BiLSTM units
300 dimensions
Optimization
Adam
Learning rate
0.001
Rescaling factor
τ
64
Dropout ratio
{0.1, 0.2, 0.3}
View Large
Close Modal
Close Modal
This Feature Is Available To Subscribers Only
Sign In
or
Create an Account
Close Modal
Close Modal
This site uses cookies. By continuing to use our website, you are agreeing to
our privacy policy.
Accept