data = [1, 2, 3, 4, 5]
distData = sc.parallelize(data)
Spark支持外部数据转换为并行化数据集合
>>> distFile = sc.textFile("data.txt")