Quick PySpark Cheatsheet download pdf. Following topics are included:
- Spark
- Initializing Spark
- Loading Data
- Retrieving RDD Information
- Applying Functions
- Selecting Data
- Iterating
- Reshaping Data
- Mathematical Operations
- Sort
- Repartitioning
- Saving
- Stopping SparkContext
- Execution
Data Scientist with 3+ years of experience in building data-intensive applications in diverse industries. Proficient in predictive modeling, computer vision, natural language processing, data visualization etc. Aside from being a data scientist, I am also a blogger and photographer.