1. 학습
  2. /
  3. 강의
  4. /
  5. Introduction to Spark SQL in Python

Connected

연습 문제

Practice reading query plans 2

Three dataframes are available: part2_df, part3_df, and part4_df. The questions posed in this exercise can be answered by inspecting the explain() output of each dataframe.

Note that Spark tags each column name with a descriptor, delimited by a # symbol. For example, word#0, id#1L, part#2, and title#3. For the purpose of this exercise, these descriptors can be ignored.

지침 1/4

undefined XP
    1
    2
    3
    4

질문

  • What file was part2_df loaded from? The full path is not wanted here, we want only the filename and its extension.

가능한 답변