Caracteres repetidos

Volte para sua análise de sentimentos! Sua próxima tarefa é substituir as palavras alongadas que aparecem nos tuítes. Definimos uma palavra alongada como uma palavra que contém um caractere repetido duas ou mais vezes. "Awesoooome".

A substituição dessas palavras é muito importante, pois um classificador as tratará como um termo diferente das palavras de origem, diminuindo sua frequência.

Para encontrá-las, você usará grupos de captura e fará referência a eles usando números. Por exemplo, \4.

Se você quiser encontrar uma correspondência para Awesoooome. Primeiro você precisa capturar Awes. Em seguida, faça a correspondência com o e faça referência ao mesmo caractere de volta e, depois, com me.

A lista sentiment_analysis, que contém o texto de três tuítes, e o módulo re estão carregados em sua sessão. Você pode usar print() para visualizar os dados no Shell IPython.

Este exercicio faz parte do curso

Expressões regulares em Python

Instruções do exercicio

Complete a expressão regular para corresponder a uma palavra alongada, conforme descrito.
Pesquise os elementos na lista sentiment_analysis para descobrir se eles contêm palavras alongadas. Atribua o resultado a match_elongated.
Atribua o número zero do grupo capturado à variável elongated_word.
Imprima o resultado contido na variável elongated_word.

exercicio interativo prático

Tente este exercicio completando este código de exemplo.

# Complete the regex to match an elongated word
regex_elongated = r"____(____)____\w*"

for tweet in sentiment_analysis:
	# Find if there is a match in each tweet 
	match_elongated = re.____(____, ____)
    
	if match_elongated:
		# Assign the captured group zero 
		elongated_word = match_elongated.____(____)
        
		# Complete the format method to print the word
		print("Elongated word found: {____}".format(word=____))
	else:
		print("No elongated word found")

Editar e Executar Código

Este exercicio faz parte do curso

Expressões regulares em Python

InicianteNível de habilidade

4.8+

Comece o curso gratuitamente

Start your journey into the regular expression world! From slicing and concatenating, adjusting the case, removing spaces, to finding and replacing strings. You will learn how to master basic operation for string manipulation using a movie review dataset.

Exercise 1: Introduction to string manipulation Exercise 2: First day!Exercise 3: Artificial reviews Exercise 4: Palindromes Exercise 5: String operations Exercise 6: Normalizing reviews Exercise 7: Time to join!Exercise 8: Split lines or split the line?Exercise 9: Finding and replacing Exercise 10: Finding a substring Exercise 11: Where's the word?Exercise 12: Replacing negations

Following your journey, you will learn the main approaches that can be used to format or interpolate strings in python using a dataset containing information scraped from the web. You will explore the advantages and disadvantages of using positional formatting, embedding expressing inside string constants, and using the Template class.

Exercise 1: Positional formatting Exercise 2: Put it in order!Exercise 3: Calling by its name Exercise 4: What day is today?Exercise 5: Formatted string literal Exercise 6: Literally formatting Exercise 7: Make this function Exercise 8: On time Exercise 9: Template method Exercise 10: Preparing a report Exercise 11: Identifying prices Exercise 12: Playing safe

Time to discover the fundamental concepts of regular expressions! In this key chapter, you will learn to understand the basic concepts of regular expression syntax. Using a real dataset with tweets meant for sentiment analysis, you will learn how to apply pattern matching using normal and special characters, and greedy and lazy quantifiers.

Exercise 1: Introduction to regular expressions Exercise 2: Are they bots?Exercise 3: Find the numbers Exercise 4: Match and split Exercise 5: Repetitions Exercise 6: Everything clean Exercise 7: Some time ago Exercise 8: Getting tokens Exercise 9: Regex metacharacters Exercise 10: Finding files Exercise 11: Give me your email Exercise 12: Invalid password Exercise 13: Greedy vs. non-greedy matching Exercise 14: Understanding the difference Exercise 15: Greedy matching Exercise 16: Lazy approach

In the last step of your journey, you will learn more complex methods of pattern matching using parentheses to group strings together or to match the same text as matched previously. Also, you will get an idea of how you can look around expressions.

Exercise 1: Captura de grupos Exercise 2: Tente outro nome Exercise 3: Voando para casa Exercise 4: Grupos de alternância e de não captura Exercise 5: Adorei!Exercise 6: Ugh! Não para mim!Exercise 7: Referências inversas Exercise 8: Analisando arquivos PDF Exercise 9: Feche a tag, por favor!Exercise 10: Caracteres repetidos

Exercicio Atual

Exercise 11: Lookaround Exercise 12: Palavras circundantes Exercise 13: Filtragem de números de telefone Exercise 14: Linha de chegada