data = [1, 2, 3, 4, 5]
distData = sc.parallelize(data)

Spark支持外部数据转换为并行化数据集合

>>> distFile = sc.textFile("data.txt")

results matching ""

    No results matching ""