知名線上課程 Spark第二部曲: SQL輕鬆處理半結構化資料技巧(含教材) 講師:Erica Li 影音教學 中文發音 繁體中文版(DVD版)
課程內容:
認識 Spark SQL
Introduction to SparkSQL
Initialize SparkSQL with SQLContext
From SchemaRDD to DataFrame
Phases of Query Planning
Spark SQL 支援的資料來源
Data sources
資料匯入與匯出
Input - connected with local system
Input - connected with HDFS
Input - connected with Hive
Output
Dataframe
Dataframe operations(1)
Dataframe operations(2)
UDF(User Defined Functions)的使用
Create your first user defined functions
效能調校 (Performance Tuning)
Performance tuning in SparkSQL