Get startedGet started for free

Congratulations!

1. Congratulations!

Congratulations! You made it to the end of the course on multi-modal models with Hugging Face!

2. Chapter 1

In Chapter 1, you learned how to access models with different modalities via the Hugging Face Hub and their handy API.

3. Chapter 1

You learned to preprocess data for text, image, and audio models, and built pipelines for image caption generation and text-guided music generation.

4. Chapter 2

In Chapter 2, you learned to perform computer vision tasks, including image classification, segmentation for image background removal, and object detection with bounding boxes.

5. Chapter 2

You also explored speech recognition and audio denoising, fine-tuning your models to next level performance!

6. Chapter 3

In Chapter 3, you started combining modalities for multi-modal classification, including image-text similarity, article sentiment analysis, and video emotion analysis.

7. Chapter 4

Finally, in Chapter 4, you got to grips with multi-modal generation, having conversations with images and documents, using prompts to edit images, and even creating videos!

8. Bye and thanks!

Thank you for following this course to the end.

9. Bye and thanks!

Best of luck on the rest of your journey!