Close the tag, please!

In the meantime, you are working on one of your other projects. The company is going to develop a new product. It will help developers automatically check the code they are writing. You need to write a short script for checking that every HTML tag that is open has its proper closure.

You have an example of a string containing HTML tags:

<title>The Data Science Company</title>

You learn that an opening HTML tag is always at the beginning of the string. It appears inside <>. A closing tag also appears inside <>, but it is preceded by /.

You also remember that capturing groups can be referenced using numbers, e.g \4.

The list html_tags, containing three strings with HTML tags, and there module are loaded in your session. You can use print() to view the data in the IPython Shell.

This exercise is part of the course

Regular Expressions in Python

Exercise instructions

Complete the regex in order to match closed HTML tags. Find if there is a match in each string of the list html_tags. Assign the result to match_tag.
If a match is found, print the first group captured and saved in match_tag.
If no match is found, complete the regex to match only the text inside the HTML tag. Assign it to notmatch_tag.
Print the first group captured by the regex and save it in notmatch_tag.

Hands-on interactive exercise

Have a go at this exercise by completing this sample code.

for string in html_tags:
    # Complete the regex and find if it matches a closed HTML tags
    match_tag =  re.match(____"<____>.*?", ____)
 
    if match_tag:
        # If it matches print the first group capture
        print("Your tag {} is closed".format(match_tag.____(____))) 
    else:
        # If it doesn't match capture only the tag 
        notmatch_tag = re.match(____"<____>", ____)
        # Print the first group capture
        print("Close your {} tag!".format(notmatch_tag.____(____)))

Edit and Run Code

This exercise is part of the course

Regular Expressions in Python

BeginnerSkill Level

4.8+

Start Course for Free

Start your journey into the regular expression world! From slicing and concatenating, adjusting the case, removing spaces, to finding and replacing strings. You will learn how to master basic operation for string manipulation using a movie review dataset.

Exercise 1: Introduction to string manipulation Exercise 2: First day!Exercise 3: Artificial reviews Exercise 4: Palindromes Exercise 5: String operations Exercise 6: Normalizing reviews Exercise 7: Time to join!Exercise 8: Split lines or split the line?Exercise 9: Finding and replacing Exercise 10: Finding a substring Exercise 11: Where's the word?Exercise 12: Replacing negations

Following your journey, you will learn the main approaches that can be used to format or interpolate strings in python using a dataset containing information scraped from the web. You will explore the advantages and disadvantages of using positional formatting, embedding expressing inside string constants, and using the Template class.

Exercise 1: Positional formatting Exercise 2: Put it in order!Exercise 3: Calling by its name Exercise 4: What day is today?Exercise 5: Formatted string literal Exercise 6: Literally formatting Exercise 7: Make this function Exercise 8: On time Exercise 9: Template method Exercise 10: Preparing a report Exercise 11: Identifying prices Exercise 12: Playing safe

Time to discover the fundamental concepts of regular expressions! In this key chapter, you will learn to understand the basic concepts of regular expression syntax. Using a real dataset with tweets meant for sentiment analysis, you will learn how to apply pattern matching using normal and special characters, and greedy and lazy quantifiers.

Exercise 1: Introduction to regular expressions Exercise 2: Are they bots?Exercise 3: Find the numbers Exercise 4: Match and split Exercise 5: Repetitions Exercise 6: Everything clean Exercise 7: Some time ago Exercise 8: Getting tokens Exercise 9: Regex metacharacters Exercise 10: Finding files Exercise 11: Give me your email Exercise 12: Invalid password Exercise 13: Greedy vs. non-greedy matching Exercise 14: Understanding the difference Exercise 15: Greedy matching Exercise 16: Lazy approach

In the last step of your journey, you will learn more complex methods of pattern matching using parentheses to group strings together or to match the same text as matched previously. Also, you will get an idea of how you can look around expressions.

Exercise 1: Capturing groups Exercise 2: Try another name Exercise 3: Flying home Exercise 4: Alternation and non-capturing groups Exercise 5: Love it!Exercise 6: Ugh! Not for me!Exercise 7: Backreferences Exercise 8: Parsing PDF files Exercise 9: Close the tag, please!

Current Exercise

Exercise 10: Reeepeated characters Exercise 11: Lookaround Exercise 12: Surrounding words Exercise 13: Filtering phone numbers Exercise 14: Finishing line