Upgrading from PySpark 1.4 to 1.5ΒΆ
- Resolution of strings to columns in Python now supports using dots (.) to qualify the column or access nested values. For example - df['table.column.nestedField']. However, this means that if your column name contains any dots you must now escape them using backticks (e.g.,- table.`column.with.dots`.nested).
- DataFrame.withColumn method in PySpark supports adding a new column or replacing existing columns of the same name.