Working with Hugging Face
Jacob H. Marquez
Lead Data Engineer
$$
$$
$$
Extractive:
$$ ✅ Selects key sentences from the text
$$ ✅ Efficient, needs fewer resources
$$ ❌ Lacks flexibility; may be less cohesive
$$
Abstractive:
$$ ✅ Generates new, rephrased text
$$ ✅ Clearer and more readable
$$ ❌ Requires more resources and processing
$$
$$
$$
$$
$$
$$
$$ $$
from transformers import pipeline
# Load the extractive summarization pipeline
summarizer = pipeline("summarization", model="nyamuda/extractive-summarization")
text = "This is my really large text about Data Science..."
summary_text = summarizer(text)
print(summary_text[0]['summary_text'])
"data science is a field that combines mathematics, statistics...."
from transformers import pipeline # Load the abstractive summarization pipeline summarizer = pipeline("summarization", model="sshleifer/distilbart-cnn-12-6")
text = "This is my really large text about Data Science..." summary_text = summarizer(text) print(summary_text[0]['summary_text'])
"The global data science platform market is projected
is projected to reach $140.9 billion by 2025..."
min_new_tokens
& max_new_tokens
: Control summary lengthsummarizer = pipeline(task="summarization", min_new_tokens=10, max_new_tokens=150)
Working with Hugging Face