oliverguhr/german-sentiment-bert
German Sentiment Classification with Bert
This model was trained for sentiment classification of German language texts. To achieve the best results all model inputs needs to be preprocessed with the same procedure, that was applied during the training. To simplify the usage of the model,
we provide a Python package that bundles the code need for the preprocessing and inferencing.
The model uses the Googles Bert architecture and was trained on 1.834 million German-language samples. The training data contains texts from various domains like Twitter, Facebook and movie, app and hotel reviews.
You can find more information about the dataset and the training process in the paper.
Using the Python package
To get started install the package from pypi:
pip install germansentiment
from germansentiment import SentimentModel
model = SentimentModel()
texts = [
"Mit keinem guten Ergebniss","Das ist gar nicht mal so gut",
"Total awesome!","nicht so schlecht wie erwartet",
"Der Test verlief positiv.","Sie f?hrt ein grünes Auto."]
result = model.predict_sentiment(texts)
print(result)
The code above will output following list:
["negative","negative","positive","positive","neutral", "neutral"]
Output class probabilities
from germansentiment import SentimentModel
model = SentimentModel()
classes, probabilities = model.predict_sentiment(["das ist super"], output_probabilities = True)
print(classes, probabilities)
['positive'] [[['positive', 0.9761366844177246], ['negative', 0.023540444672107697], ['neutral', 0.00032294404809363186]]]
Model and Data
If you are interested in code and data that was used to train this model please have a look at this repository and our paper. Here is a table of the F1 scores that this model achieves on different datasets. Since we trained this model with a newer version of the transformer library, the results are slightly better than reported in the paper.
Dataset | F1 micro Score |
---|---|
holidaycheck | 0.9568 |
scare | 0.9418 |
filmstarts | 0.9021 |
germeval | 0.7536 |
PotTS | 0.6780 |
emotions | 0.9649 |
sb10k | 0.7376 |
Leipzig Wikipedia Corpus 2016 | 0.9967 |
all | 0.9639 |
數據評估
本站OpenI提供的oliverguhr/german-sentiment-bert都來源于網絡,不保證外部鏈接的準確性和完整性,同時,對于該外部鏈接的指向,不由OpenI實際控制,在2023年 5月 26日 下午6:06收錄時,該網頁上的內容,都屬于合規合法,后期網頁的內容如出現違規,可以直接聯系網站管理員進行刪除,OpenI不承擔任何責任。