Introduction to LLMs in Python
Jasmin Ludolf
Senior Data Science Content Developer, DataCamp
generator = pipeline(task="text-generation", model="distilgpt2")
prompt = "The Gion neighborhood in Kyoto is famous for"
output = generator(prompt, max_length=100, pad_token_id=generator.tokenizer.eos_token_id)
eos_token_id
: end-of-sequence token IDpad_token_id
: fills in extra space up to max_length
generator.tokenizer.eos_token_id
marks the end of meaningful text, learned through trainingmax_length
or pad_token_id
truncation = True
generator = pipeline(task="text-generation", model="distilgpt2") prompt = "The Gion neighborhood in Kyoto is famous for" output = generator(prompt, max_length=100, pad_token_id=generator.tokenizer.eos_token_id)
print(output[0]["generated_text"])
The Gion neighborhood in Kyoto is famous for its many colorful green forests, such as the
Red Hill, the Red River and the Red River. The Gion neighborhood is home to the world's
tallest trees.
generator = pipeline(task="text-generation", model="distilgpt2")
review = "This book was great. I enjoyed the plot twist in Chapter 10." response = "Dear reader, thank you for your review." prompt = f"Book review:\n{review}\n\nBook shop response to the review:\n{response}"
output = generator(prompt, max_length=100, pad_token_id=generator.tokenizer.eos_token_id) print(output[0]["generated_text"])
Dear reader, thank you for your review. We'd like to thank you for your reading!
translator = pipeline(task="translation_en_to_es", model="Helsinki-NLP/opus-mt-en-es")
text = "Walking amid Gion's Machiya wooden houses was a mesmerizing experience."
output = translator(text, clean_up_tokenization_spaces=True)
print(output[0]["translation_text"])
Caminar entre las casas de madera Machiya de Gion fue una experiencia fascinante.
Introduction to LLMs in Python