seed = 10
sampleby = 'colA'
fraction=0.8
franctions = df.select(sampleby).distinct().withColumn('fraction', lit(fraction)).rdd.collec
pyspark 代码练习7 —— 分层抽样
最新推荐文章于 2023-03-30 22:32:49 发布
seed = 10
sampleby = 'colA'
fraction=0.8
franctions = df.select(sampleby).distinct().withColumn('fraction', lit(fraction)).rdd.collec