Working with Hugging Face
Jacob H. Marquez
Lead Data Engineer
pip install datasets
from datasets import load_dataset
data = load_dataset("IVN-RIN/BioBERT_Italian")
$$
Split parameter
data = load_dataset("IVN-RIN/BioBERT_Italian", split="train")
data = load_dataset("IVN-RIN/BioBERT_Italian", split="train")
# Filter for pattern " bella " filtered = data.filter(lambda row: " bella " in row['text']) print(filtered)
Dataset({
features: ['text'],
num_rows: 1122
})
# Select the first two rows sliced = filtered.select(range(2))
print(sliced)
Dataset({features: ['text'], num_rows: 2})
# Extract the 'text' for the first row
print(sliced[0]['text'])
Concentrazioni atmosferiche di PCDD/PCDF...
Working with Hugging Face