Is your deployed AI system successful?

1. Is your deployed AI system successful?

How can we assess the success of an AI solution before and after its deployment? Does our solution contribute to achieving our intended business goal, or does it report a positive return on investments?

2. When to measure success?

The success of an AI initiative or product must be assessed and closely monitored.

3. When to measure success?

Not only during its development, but also after its deployment in production, in parallel with a continuous performance monitoring process.

4. Measuring performance offline - accuracy

During development, particularly for Machine Learning or Deep Learning models, we want to use some metrics that help determine how well they perform before releasing them into production. Take a classification model as an example, where accuracy is normally the central metric to look at. If we have a dataset of pre-labeled penguin observations from three different species,

5. Measuring performance offline - accuracy

we only use a portion of the labeled data to train our model, in this case, a classifier that tries to learn from the data features to distinguish between penguin species.

6. Measuring performance offline - accuracy

We then validate our model by taking the remaining examples we left aside, passing them without their labels to our trained model which tries to predict the right penguin species for as many validation examples as possible.

7. Measuring performance offline - accuracy

Some predictions will be correct, some will not.

8. Measuring performance offline - accuracy

In a nutshell, this is the essence of accuracy metrics: measuring the performance of an ML (or in general an AI) solution against new data, based on the number of times it gives the right output.

9. Beyond accuracy - error and other metrics

Depending on the problem and type of solution, there are other important metrics to monitor. In regression models, for instance, performance is described by the error between numerical predictions and actual outputs. In search and recommendation engines, the ordered ranking of results must be assessed in terms of user relevance or diversity, and so on. If our model performance is not as expected, we may have to improve it by fine-tuning it or by improving the quality of the training data, until it performs satisfactorily.

10. Measuring success in production

But the story doesn't end there. We need to continue observing its performance, for both performance metrics discussed earlier but also contribution to business goals. A deployed model's performance must be closely monitored for multiple reasons. One of the most obvious is model degradation, which happens when the assessed metric starts to deteriorate over time, for example, because the nature of the data consumed changes, signaling the need to re-train our model. The concept of KPI (or Key Performance Indicator) is commonly used to quantify the business success of an AI system. A KPI is a measurable indicator of the performance and progress of specific objectives in an organization.

11. Risks: what could possibly go wrong?

Finally, it is realistic to assume that many types of risks can get in the way during our journey to a successful AI solution. Here are some examples of risks, some of which we will discuss later: Data bias leading to discriminatory outcomes. Lack of transparency to understand AI decisions. Ethical concerns like responsible data use. System reliability and robustness against errors. And possible vulnerabilities to cyber threats. One of the ways to identify risks is by developing a Proof-of-Concept before the final AI product or solution. A PoC is a pilot version of the solution to demonstrate its feasibility and potential value.

12. Let's practice!

Time to practice.

This exercise is part of the course

Understanding Artificial Intelligence

BeginnerSkill Level

4.8+

Start Course for Free

Get started with your journey into AI! Artificial Intelligence has rapidly shifted from a purely scientific subject to an umbrella field whose applications and uses are countless as of today, be it in business, industry, or simply as part of our daily lives. You will learn what is AI, what types of problems can be solved using AI, and what are its current limitations. You will also gain a broad understanding of the subareas within AI and its related disciplines.

Exercise 1: What is Artificial Intelligence?Exercise 2: The big question Exercise 3: Narrow or General?Exercise 4: What can AI do?Exercise 5: AI at work Exercise 6: AI in action Exercise 7: Areas and related disciplines of AI Exercise 8: How does AI learn?Exercise 9: Listen and learn

What are the core functions of AI systems and the possibilities each of its areas has to offer? Where does the data that fuels AI systems come from, and how do techniques like machine learning make use of data to solve a variety of problems? This chapter answers these questions and more, unveiling how different forms of AI can acquire data, learn from the data, and interact with its surroundings- be it the physical or digital environment.

Exercise 1: Algorithms and AI systems demystified Exercise 2: Inside a pizza AI system Exercise 3: Unmasking a hotel booking AI system Exercise 4: Acquiring data Exercise 5: The AI and Internet-of-Things (IoT) symbiosis Exercise 6: Structured or unstructured?Exercise 7: Learning from data Exercise 8: Training and classifying with penguins Exercise 9: The unsupervised intruder Exercise 10: Deep Learning is here to stay Exercise 11: Interacting with the Environment Exercise 12: Learning from customer reviews Exercise 13: Robots, vision and natural language mix-up Exercise 14: The True-False challenge of things AI can do

Get ready to plunge into the exciting world of AI in business and enterprise! This chapter uncovers the fundamental concepts, guidelines, and best practices to build or become an AI-driven organization. Whether it is for equipping your products, services, and projects with a touch of intelligence, or revolutionizing operational processes within your organization, by the end of this chapter, you will understand the essential elements to embrace this technology: from culture to data and infrastructure, to building teams of talented AI professionals. You'll also learn about challenges that may appear along the way and examples of success stories from big companies that learned to make AI a central aspect of their brand.

Exercise 1: Establishing an AI culture Exercise 2: Four ingredients to AI-driven organizations Exercise 3: Home, secured home Exercise 4: Data strategy, resources, and people Exercise 5: Infrastructure dilemmas Exercise 6: The "zen" of MLOps Exercise 7: Team building!Exercise 8: Is your deployed AI system successful?

Current Exercise

Exercise 9: Performance stories Exercise 10: An academic Proof-of-Concept (PoC)Exercise 11: Challenges and success stories Exercise 12: Ways to foster an AI culture Exercise 13: Paola and the fashion project

As our thrilling AI journey comes to an end, we delve into the profound connection between AI and its impact on humanity and society. We will explore subjects like AI democratization and responsible AI, encompassing ethics, interpretability, fairness, and safety – all crucial aspects for cultivating AI systems that positively influence our lives. Lastly, we cast our gaze towards the horizon, outlining the exciting prospects and challenges AI presents in shaping a sustainable future.

Exercise 1: Democratizing Artificial Intelligence Exercise 2: Best practices for AI democratization Exercise 3: Opening the Open Data doors Exercise 4: Explainability and interpretability Exercise 5: A tree inside the white-box Exercise 6: Explaining wine quality Exercise 7: Unboxing the SHAP Exercise 8: Social challenges: ethics, fairness and privacy Exercise 9: Data sharing for AI progress Exercise 10: Biased or unbiased behavior?Exercise 11: The biased vehicle Exercise 12: Social challenges: the future of AI Exercise 13: The thousand faces of sustainable AI Exercise 14: Paola and the human side of AI Exercise 15: One journey ends, another begins