Efficient AI Model Training with PyTorch
Dennis Lee
Data Engineer
$$
$$
$$
$$
$$
$$
$$
$$
$$
$$
$$
AutoModelForSequenceClassification
from transformers import AutoModelForSequenceClassification model = AutoModelForSequenceClassification.from_pretrained(model_name)
print(model.config)
DistilBertConfig {
"architectures": ["DistilBertForMaskedLM"],
"dim": 768,
"dropout": 0.1,
"hidden_dim": 3072,
...
Accelerator
detects which devices are available on our computeraccelerator.prepare()
torch.nn.Module
) on the first available GPUfrom accelerate import Accelerator accelerator = Accelerator() model = accelerator.prepare(model)
print(accelerator.device)
cpu
Efficient AI Model Training with PyTorch