SA Lora usuage and table is added

2023-12-19 18:19:59 +03:00 · 2023-10-02 14:14:56 +03:00
parent 171e15195d
commit dc62e9a290
1 changed files with 61 additions and 2 deletions
--- a/README.md
+++ b/README.md
@@ -2,7 +2,8 @@
 1. [Introduction](#introduction)
 2. [Main results](#results)
 3. [Using TurkishBERTweet with `transformers`](#transformers)
-    - [Models](#trainedModels)
+    - [Model](#trainedModels)
+    - [Lora Adapter]($loraAdapter)    
    - [Example usage](#usage2)
        - [Twitter Preprocessor](#preprocess)
        - [Feature Extraction](#feature_extraction)
@@ -17,11 +18,15 @@


 <!-- https://huggingface.co/VRLLab/TurkishBERTweet -->
-# <a name="trainedModels"></a> Models
+# <a name="trainedModels"></a> Model
 Model | #params | Arch. | Max length | Pre-training data
 ---|---|---|---|---
 `VRLLab/TurkishBERTweet` | 163M | base | 128 | 894M Turkish Tweets (uncased)

+# <a name="loraAdapter"></a> Lora Adapters
+Model | train f1 | dev f1 | test f1 | Dataset Size
+---|---|---|---|---
+`VRLLab/TurkishBERTweet-Lora-SA` | 0.799 | 0.687 | 0.692 | 42476 Turkish Tweets (uncased)
 # <a name="usage2"></a> Example usage


@@ -61,6 +66,60 @@ with torch.no_grad():
 ```


+## <a name="sa_lora"></a> Sentiment Classification
+
+```python
+from peft import (
+    PeftModel,
+    PeftConfig,
+)
+
+from transformers import (
+    AutoModelForSequenceClassification,
+    AutoTokenizer)
+from Preprocessor import preprocess
+ 
+
+pretrained_model_path =  "VRLLab/TurkishBERTweet"
+peft_model = "VRLLab/TurkishBERTweet-Lora-SA"
+peft_config = PeftConfig.from_pretrained(peft_model)
+
+# loading Tokenizer
+padding_side = "right"
+tokenizer = AutoTokenizer.from_pretrained(
+    pretrained_model_path, padding_side=padding_side
+)
+if getattr(tokenizer, "pad_token_id") is None:
+    tokenizer.pad_token_id = tokenizer.eos_token_id
+
+id2label = {0: "negative", 2: "positive", 1: "neutral"}
+best_model = AutoModelForSequenceClassification.from_pretrained(
+    peft_config.base_model_name_or_path, return_dict=True, num_labels=3, id2label=id2label
+)
+best_model = PeftModel.from_pretrained(best_model, peft_model)
+
+sample_texts = [
+    "Viral lab da insanlar hep birlikte çalışıyorlar. hepbirlikte çalışan insanlar birbirlerine yakın oluyorlar.",     
+    "americanin diplatlari turkiyeye gelmesin 😤",
+    "Mark Zuckerberg ve Elon Musk'un boks müsabakası süper olacak! 🥷",
+    "Adam dun ne yediğini unuttu"
+    ]
+
+
+preprocessed_texts = [preprocess(s) for s in sample_texts]
+
+for s in preprocessed_texts:
+    ids = tokenizer.encode_plus(s, return_tensors="pt")
+    label_id = best_model(**ids).logits.argmax(-1).item()
+    print(id2label[label_id],":", s)
+```
+
+```output
+positive : viral lab da insanlar hep birlikte çalışıyorlar. hepbirlikte çalışan insanlar birbirlerine yakın oluyorlar.
+negative : americanin diplatlari turkiyeye gelmesin <emoji> burundan_buharla_yüzleşmek </emoji>
+positive : mark zuckerberg ve elon musk'un boks müsabakası süper olacak! <emoji> kadın_muhafız_koyu_ten_tonu </emoji>
+neutral : adam dun ne yediğini unuttu
+```

 # <a name="citation"></a> Citation
 ```bibtex