Create the pipeline
You're finally ready to create a Pipeline!
Pipeline is a class in the pyspark.ml module that combines all the Estimators and Transformers that you've already created. This lets you reuse the same modeling process over and over again by wrapping it up in one simple object. Neat, right?
Bu egzersiz
Foundations of PySpark
kursunun bir parçasıdırEgzersiz talimatları
- Import
Pipelinefrompyspark.ml. - Call the
Pipeline()constructor with the keyword argumentstagesto create aPipelinecalledflights_pipe.stagesshould be a list holding all the stages you want your data to go through in the pipeline. Here this is just:[dest_indexer, dest_encoder, carr_indexer, carr_encoder, vec_assembler]
Uygulamalı interaktif egzersiz
Bu örnek kodu tamamlayarak bu egzersizi bitirin.
# Import Pipeline
from ____ import ____
# Make the pipeline
flights_pipe = Pipeline(stages=____)