Optimizing models for scalability

Deploying AI models efficiently is crucial for real-world applications where inference speed, model size, and computational efficiency matter. Now we will test your ability to save and load models for deployment. You will use techniques like TorchScript export to complete the workflow. The dataset used is the variation of MNIST dataset.

By completing this exercise, you will have prepared a model optimized for deployment while applying advanced techniques learned in this lesson.

X_test, y_test datasets as well as torch.jit have been preloaded for you.

This exercise is part of the course

Scalable AI Models with PyTorch Lightning

Exercise instructions

Export the model to TorchScript using trace function.
Save the model to TorchScript.
Load the saved model.

Hands-on interactive exercise

Have a go at this exercise by completing this sample code.

# Export model to TorchScript
scripted_model = torch.jit.____(model, torch.tensor(X_test[:1], dtype=torch.float32).unsqueeze(1))
# Save model to TorchScript
torch.jit.____(scripted_model, 'model.pt')

# Loaded saved model
loaded_model = torch.jit.____('____.pt')
# Validate inference on test dataset
test_loader = DataLoader(TensorDataset(torch.tensor(X_test, dtype=torch.float32).unsqueeze(1), ____), batch_size=64)

accuracy = evaluate_model(loaded_model, test_loader)

print(f"Optimized model accuracy: {accuracy:.2%}")

Edit and Run Code

This exercise is part of the course

Scalable AI Models with PyTorch Lightning

IntermediateSkill Level

4.9+

Start Course for Free

In this chapter, we'll explore how PyTorch Lightning simplifies the development and deployment of scalable AI models. Starting with foundational concepts, we'll go through the core structure of a PyTorch Lightning project, including essential components like the LightningModule and Trainer, to set a strong foundation for more advanced AI solutions.

Exercise 1: Introduction to PyTorch Lightning Exercise 2: Introducing the LightningModule Exercise 3: Running the Lightning Trainer Exercise 4: Defining models with LightningModule Exercise 5: Usage of the LightningModule Exercise 6: Mastering the init method Exercise 7: Perfecting the forward method Exercise 8: Implementing training logic Exercise 9: Implementing the training step Exercise 10: Configuring the optimizer Exercise 11: Training and evaluating

We'll dive deeper into PyTorch Lightning to efficiently manage data and refine model training in this chapter. We'll learn how to create modular and reusable data workflows with LightningDataModule, evaluate your models accurately through validation and testing, and enhance training processes using Lightning Callbacks to automate model improvement and avoid overfitting.

Exercise 1: Managing data with LightningDataModule Exercise 2: Splitting data with LightningDataModule Exercise 3: Creating a train DataLoader Exercise 4: Incorporating validation and testing Exercise 5: Implementing the validation step Exercise 6: Evaluate model accuracy using Torchmetrics Exercise 7: Enhancing training with Lightning callbacks Exercise 8: Classifying Lightning callbacks Exercise 9: Optimizing model training with Lightning

Learn to prepare deep learning models for real-world deployment by making them leaner and faster. This chapter introduces techniques such as dynamic quantization, pruning, and TorchScript conversion, helping you reduce model size and latency without sacrificing accuracy

Exercise 1: Applying dynamic quantization Exercise 2: Apply dynamic quantization Exercise 3: Comparing quantized model performance Exercise 4: Implementing model pruning techniques Exercise 5: Apply pruning to a linear layer Exercise 6: Finalize pruning by removing the mask Exercise 7: Exporting models with TorchScript Exercise 8: Choosing the right conversion method Exercise 9: Optimizing models for scalability

Current Exercise

Exercise 10: Recap: Scalable AI Models with PyTorch Lightning