Open-Orca/OpenOrca
Viewer • Updated • 2.94M • 49.3k • 1.54k
How to use jackhhao/jailbreak-classifier with Transformers:
# Use a pipeline as a high-level helper
from transformers import pipeline
pipe = pipeline("text-classification", model="jackhhao/jailbreak-classifier") # Load model directly
from transformers import AutoTokenizer, AutoModelForSequenceClassification
tokenizer = AutoTokenizer.from_pretrained("jackhhao/jailbreak-classifier")
model = AutoModelForSequenceClassification.from_pretrained("jackhhao/jailbreak-classifier")Classifies prompts as jailbreaks or benign. This is a fine-tune checkpoint of bert-base-uncased on the jailbreak-classification dataset.
Fine-tuned on the jailbreak-classification dataset.
Fine-tuning hyper-parameters: