Data Balance Algorithm Based on Histogram in MapReduce
Data Balance Algorithm Based on Histogram in MapReduce is a scholarly work, published in 2018 in ''Journal of Northwestern Polytechnical University''. The main subjects of the publication include block, histogram, randomness, computer science, Skew, concept drift, cloud computing, clustered file system, and algorithm. MapReduce model is a typical distributed computing model, which is widely used in large-scale data processing, and its performance depends largely on the data distribution status.