tag:blogger.com,1999:blog-2917373063223995124.post4511997623591511419..comments2022-12-03T16:53:54.378-08:00Comments on A River of Bytes: Learning to use Apache Spark and Kafka TogetherSpiro Michaylovhttp://www.blogger.com/profile/04679894175176300394noreply@blogger.comBlogger1125tag:blogger.com,1999:blog-2917373063223995124.post-8940011962470298322017-05-23T01:51:55.901-07:002017-05-23T01:51:55.901-07:00ControlledPartitioning: Here the topic has six par...ControlledPartitioning: Here the topic has six partitions but instead of writing to it using the configured partitioner, we assign all records to the same partition explicitly. Although the generated RDDs still have the same number of partitions as the topic, only one partition has all the data in it. This demonstrates how to exercise control over partitioning all the way from the original RDD, through the topic to the resulting RDDs. http://www.river-of-bytes.comAnonymoushttps://www.blogger.com/profile/14898858693042584445noreply@blogger.com