You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
# BERT for Vietnamese is trained on more 20 GB news dataset
6
+
7
+
Apply for task sentiment analysis on using [AIViVN's comments dataset](https://www.aivivn.com/contests/6)
8
+
9
+
The model achieved 0.90268 on the public leaderboard, (winner's score is 0.90087)
10
+
Bert4news is used for a toolkit Vietnames(segmentation and Named Entity Recognition) at ViNLPtoolkit(https://github.com/bino282/ViNLP)
11
+
12
+
***************New Mar 11 , 2020 ***************
13
+
14
+
**[BERT](https://github.com/google-research/bert)** (from Google Research and the Toyota Technological Institute at Chicago) released with the paper [BERT: Pre-training of Deep Bidirectional Transformers for Language Understanding](https://arxiv.org/abs/1810.04805).
15
+
16
+
We use word sentencepiece, use basic bert tokenization and same config with bert base with lowercase = False.
0 commit comments