Bekerja dengan Hugging Face
Jacob H. Marquez
Lead Data Engineer
from transformers import pipeline
my_pipeline = pipeline(
"text-classification",
model="distilbert-base-uncased-finetuned-sst-2-english"))
print(my_pipeline("Wi-Fi is slower than a snail today!"))
[{'label': 'NEGATIVE', 'score': 0.99}]
$$
$$

$$
from transformers import AutoModelForSequenceClassification# Unduh model klasifikasi teks terlatih awal model = AutoModelForSequenceClassification.from_pretrained( "distilbert-base-uncased-finetuned-sst-2-english" )
$$
from transformers import AutoTokenizer# Ambil tokenizer yang dipasangkan dengan model tokenizer = AutoTokenizer.from_pretrained( "distilbert-base-uncased-finetuned-sst-2-english" )
$$
tokenizer = AutoTokenizer.from_pretrained("distilbert-base-uncased")# Tokenisasi teks input tokens = tokenizer.tokenize("AI: Helping robots think and humans overthink:)") print(tokens)
['ai', ':', 'helping', 'robots', 'think', 'and',
'humans', 'over', '##thi', '##nk', ':', ')']
Model kita (distilbert-base-uncased):
['ai', ':', 'helping', 'robots', 'think', 'and', 'humans', 'over', '##thi',
'##nk', ':', ')']
Tokenizer BERT-Base-Cased:
['AI', ':', 'Help', '##ing', 'robots', 'think', 'and', 'humans', 'over',
'##thin', '##k', ':', ')']
from transformers import AutoModelForSequenceClassification, AutoTokenizer, pipeline# Unduh model dan tokenizer my_model = AutoModelForSequenceClassification.from_pretrained( "distilbert-base-uncased-finetuned-sst-2-english") my_tokenizer = AutoTokenizer.from_pretrained( "distilbert-base-uncased-finetuned-sst-2-english")# Buat pipeline kustom my_pipeline = pipeline( task="sentiment-analysis", model=my_model, tokenizer=my_tokenizer)
$$
🔧 Gunakan untuk kontrol dan kustomisasi lebih lanjut
📝 Prapemrosesan Teks: Bersihkan dan tokenisasi untuk kasus spesifik
$$

Bekerja dengan Hugging Face