Search
 
SCRIPT & CODE EXAMPLE
 
CODE EXAMPLE FOR PYTHON

pyspark rdd example

# Create RDD from parallelize    
dataList = [("Java", 20000), ("Python", 100000), ("Scala", 3000)]
rdd=spark.sparkContext.parallelize(dataList)
Source by sparkbyexamples.com #
 
PREVIOUS NEXT
Tagged: #pyspark #rdd
ADD COMMENT
Topic
Name
3+8 =