Confrontare le prestazioni del modello quantizzato

Capire i miglioramenti delle prestazioni non riguarda solo l'accuratezza. I modelli quantizzati spesso offrono tempi di inferenza più rapidi, un vantaggio chiave negli scenari di deploy. Misurerai quanto tempo impiegano sia il modello originale sia quello quantizzato a elaborare il set di test.

La funzione measure_time() è stata predefinita. Imposta il modello in modalità di valutazione, esegue una passata forward su tutti i batch nel dataloader e restituisce il tempo trascorso.

Sia model (il modello originale) sia model_quantized (la versione quantizzata) sono già caricati insieme a test_loader.

Questo esercizio fa parte del corso

Modelli di AI scalabili con PyTorch Lightning

Visualizza il corso

Istruzioni dell'esercizio

Calcola il tempo di inferenza per il modello originale e quello quantizzato.
Stampa entrambi i tempi arrotondati a due decimali.

Esercizio pratico interattivo

Prova a risolvere questo esercizio completando il codice di esempio.

# Measure inference time of the original model
original_time = measure_time(____)

# Measure inference time of the quantized model
quant_time = measure_time(____)

# Print results
print(f"Original Model Time: {____}s")
print(f"Quantized Model Time: {____}s")

Modifica ed esegui il codice

Questo esercizio fa parte del corso

Modelli di AI scalabili con PyTorch Lightning

IntermediárioNível de habilidade

4.8+

Inizia il corso gratis

In this chapter, we'll explore how PyTorch Lightning simplifies the development and deployment of scalable AI models. Starting with foundational concepts, we'll go through the core structure of a PyTorch Lightning project, including essential components like the LightningModule and Trainer, to set a strong foundation for more advanced AI solutions.

Exercise 1: Introduction to PyTorch Lightning Exercise 2: Introducing the LightningModule Exercise 3: Running the Lightning Trainer Exercise 4: Defining models with LightningModule Exercise 5: Usage of the LightningModule Exercise 6: Mastering the init method Exercise 7: Perfecting the forward method Exercise 8: Implementing training logic Exercise 9: Implementing the training step Exercise 10: Configuring the optimizer Exercise 11: Training and evaluating

We'll dive deeper into PyTorch Lightning to efficiently manage data and refine model training in this chapter. We'll learn how to create modular and reusable data workflows with LightningDataModule, evaluate your models accurately through validation and testing, and enhance training processes using Lightning Callbacks to automate model improvement and avoid overfitting.

Exercise 1: Managing data with LightningDataModule Exercise 2: Splitting data with LightningDataModule Exercise 3: Creating a train DataLoader Exercise 4: Incorporating validation and testing Exercise 5: Implementing the validation step Exercise 6: Evaluate model accuracy using Torchmetrics Exercise 7: Enhancing training with Lightning callbacks Exercise 8: Classifying Lightning callbacks Exercise 9: Optimizing model training with Lightning

Learn to prepare deep learning models for real-world deployment by making them leaner and faster. This chapter introduces techniques such as dynamic quantization, pruning, and TorchScript conversion, helping you reduce model size and latency without sacrificing accuracy

Exercise 1: Applicare la quantizzazione dinamica Exercise 2: Applica la quantizzazione dinamica Exercise 3: Confrontare le prestazioni del modello quantizzato

Esercizio in corso

Exercise 4: Implementare tecniche di pruning del modello Exercise 5: Applica il pruning a un livello lineare Exercise 6: Finalizza il pruning rimuovendo la maschera Exercise 7: Esportare i modelli con TorchScript Exercise 8: Scegliere il metodo di conversione giusto Exercise 9: Ottimizzare i modelli per la scalabilità Exercise 10: Riepilogo: Scalable AI Models with PyTorch Lightning