Aan de slagGa gratis aan de slag

Components with extensions

Extension attributes are especially powerful if they're combined with custom pipeline components. In this exercise, you'll write a pipeline component that finds country names and a custom extension attribute that returns a country's capital, if available.

The nlp object has already been created and the Span class is already imported. A phrase matcher with all countries is available as the variable matcher. A dictionary of countries mapped to their capital cities is available as the variable capitals.

Deze oefening maakt deel uit van de cursus

Advanced NLP with spaCy

Cursus bekijken

Praktische interactieve oefening

Probeer deze oefening eens door deze voorbeeldcode in te vullen.

def countries_component(doc):
    # Create an entity Span with the label 'GPE' for all matches
    doc.ents = [____(____, ____, ____, label=____)
                for match_id, start, end in matcher(doc)]
    return doc

# Add the component to the pipeline
____.____(____)
print(nlp.pipe_names)
Code bewerken en uitvoeren