distinct去重
import org.apache.spark.api.java.JavaRDD;
import org.apache.spark.api.java.JavaSparkContext...合并到一起
import org.apache.spark.api.java.JavaRDD;
import org.apache.spark.api.java.JavaSparkContext;
import...System.out.println(results);
}
}
结果是[1, 2, 3, 4, 5, 1, 6, 7, 8, 9]
intersection取交集
import org.apache.spark.api.java.JavaRDD....subtract(RDD2),返回在RDD1中出现,但是不在RDD2中出现的元素,不去重
import org.apache.spark.api.java.JavaRDD;
import org.apache.spark.api.java.JavaSparkContext..."a","b","c"],B是["1","2","3"],那笛卡尔积就是(1 a)(1 b)(1 c)(2 a)(2 b)(2 c)(3 a)(3 b)(3 c)
import org.apache.spark.api.java.JavaRDD