TY - JOUR AU - Tsai, C. -. W. AU - Lai, C. -. F. AU - Chao, H. -. C. AU - Vasilakos, A. V. PY - 2015 DA - 2015// TI - Big data analytics: a survey JO - J Big data. VL - 2 UR - https://doi.org/10.1186/s40537-015-0030-3 DO - 10.1186/s40537-015-0030-3 ID - Tsai2015 ER - TY - JOUR AU - Sanse, K. AU - Sharma, M. PY - 2015 DA - 2015// TI - Clustering methods for Big data analysis JO - Int J Adv Res Comput Eng Technol. VL - 4 ID - Sanse2015 ER - TY - CHAP AU - Zhao, W. e. i. z. h. o. n. g. AU - Ma, H. u. i. f. a. n. g. AU - He, Q. i. n. g. PY - 2009 DA - 2009// TI - Parallel K-Means Clustering Based on MapReduce BT - Lecture Notes in Computer Science PB - Springer Berlin Heidelberg CY - Berlin, Heidelberg ID - Zhao2009 ER - TY - STD TI - Srivastava DK, Yadav R, Agrwal G. Map reduce programming model for parallel K-mediod algorithm on hadoop cluster. In: 2017 7th international conference on communication systems and network technologies (CSNT). 2017. p. 74–8. ID - ref4 ER - TY - STD TI - Dai B-R, Lin I-C. Efficient map/reduce-based dbscan algorithm with optimized data partition. In: 2012 IEEE Fifth international conference on cloud computing. 2012. p. 59–66. ID - ref5 ER - TY - JOUR AU - He, Y. AU - Tan, H. AU - Luo, W. AU - Feng, S. AU - Fan, J. PY - 2014 DA - 2014// TI - MR-DBSCAN: a scalable MapReduce-based DBSCAN algorithm for heavily skewed data JO - Front Comput Sci. VL - 8 UR - https://doi.org/10.1007/s11704-013-3158-3 DO - 10.1007/s11704-013-3158-3 ID - He2014 ER - TY - STD TI - Verma A, Cherkasova L, Campbell RH. Two sides of a coin: Optimizing the schedule of mapreduce jobs to minimize their makespan and improve cluster performance. In: 2012 IEEE 20th international symposium on modeling, analysis and simulation of computer and telecommunication systems. 2012. p. 11–8. ID - ref7 ER - TY - STD TI - Ramakrishnan SR, Swart G, Urmanov A. Balancing reducer skew in MapReduce workloads using progressive sampling. In: Proceedings of the Third ACM symposium on cloud computing. 2012. p. 16. ID - ref8 ER - TY - STD TI - Fan L, Gao B, Zhang F, Liu Z. OS4M: Achieving Global Load Balance of MapReduce workload by scheduling at the operation level. arXiv Prepr arXiv14063901. 2014. ID - ref9 ER - TY - STD TI - Xia H. Load balancing greedy algorithm for reduce on Hadoop platform. In: 2018 IEEE 3rd international conference on big data analysis (ICBDA). 2018. p. 212–6. ID - ref10 ER - TY - JOUR AU - Xia, D. a. w. e. n. AU - Wang, B. i. n. f. e. n. g. AU - Li, Y. a. n. t. a. o. AU - Rong, Z. h. u. o. b. o. AU - Zhang, Z. i. l. i. PY - 2015 DA - 2015// TI - An Efficient MapReduce-Based Parallel Clustering Algorithm for Distributed Traffic Subarea Division JO - Discrete Dynamics in Nature and Society VL - 2015 UR - https://doi.org/10.1155/2015/793010 DO - 10.1155/2015/793010 ID - Xia2015 ER - TY - JOUR AU - Ke, H. AU - Li, P. AU - Guo, S. AU - Guo, M. PY - 2015 DA - 2015// TI - On traffic-aware partition and aggregation in mapreduce for big data applications JO - IEEE Trans Parallel Distrib Syst VL - 27 UR - https://doi.org/10.1109/TPDS.2015.2419671 DO - 10.1109/TPDS.2015.2419671 ID - Ke2015 ER - TY - JOUR AU - Reddy, Y. D. AU - Sajin, A. P. PY - 2016 DA - 2016// TI - An efficient traffic-aware partition and aggregation for big data applications using map-reduce JO - Indian J Sci Technol. VL - 9 UR - https://doi.org/10.17485/ijst/2016/v9i10/88981 DO - 10.17485/ijst/2016/v9i10/88981 ID - Reddy2016 ER - TY - STD TI - Venkatesh G, Arunesh K. Map Reduce for big data processing based on traffic aware partition and aggregation. Cluster Comput. 2018. p. 1–7. ID - ref14 ER - TY - JOUR AU - HajKacem, M. A. AU - N’cir, C. -. E. AU - Essoussi, N. PY - 2019 DA - 2019// TI - One-pass MapReduce-based clustering method for mixed large scale data JO - J Intell Inf Syst. VL - 52 UR - https://doi.org/10.1007/s10844-017-0472-5 DO - 10.1007/s10844-017-0472-5 ID - HajKacem2019 ER - TY - STD TI - Ilango SS, Vimal S, Kaliappan M, Subbulakshmi P. Optimization using artificial bee colony based clustering approach for big data. Cluster Comput. 2018. p. 1–9. ID - ref16 ER - TY - JOUR AU - Fan, T. PY - 2018 DA - 2018// TI - Research and implementation of user clustering based on MapReduce in multimedia big data JO - Multimed Tools Appl. VL - 77 UR - https://doi.org/10.1007/s11042-017-4825-4 DO - 10.1007/s11042-017-4825-4 ID - Fan2018 ER - TY - JOUR AU - Jane, E. M. AU - Raj, E. PY - 2018 DA - 2018// TI - SBKMMA: sorting based K means and median based clustering algorithm using multi machine technique for big data JO - Int J Comput. VL - 28 ID - Jane2018 ER - TY - JOUR AU - Kaur, A. AU - Datta, A. PY - 2015 DA - 2015// TI - A novel algorithm for fast and scalable subspace clustering of high-dimensional data JO - J Big Data. VL - 2 UR - https://doi.org/10.1186/s40537-015-0027-y DO - 10.1186/s40537-015-0027-y ID - Kaur2015 ER - TY - CHAP AU - Kanimozhi, K. V. AU - Venkatesan, M. PY - 2017 DA - 2017// TI - A Novel Map-Reduce Based Augmented Clustering Algorithm for Big Text Datasets BT - Advances in Intelligent Systems and Computing PB - Springer Singapore CY - Singapore ID - Kanimozhi2017 ER - TY - CHAP AU - Zerabi, S. o. u. m. e. y. a. AU - Meshoul, S. o. u. h. a. m. AU - Khantoul, B. i. l. e. l. PY - 2018 DA - 2018// TI - Parallel Clustering Validation Based on MapReduce BT - Advances in Computing Systems and Applications PB - Springer International Publishing CY - Cham ID - Zerabi2018 ER - TY - JOUR AU - Hosseini, B. AU - Kiani, K. PY - 2018 DA - 2018// TI - FWCMR: a scalable and robust fuzzy weighted clustering based on MapReduce with application to microarray gene expression JO - Expert Syst Appl VL - 91 UR - https://doi.org/10.1016/j.eswa.2017.08.051 DO - 10.1016/j.eswa.2017.08.051 ID - Hosseini2018 ER - TY - JOUR AU - Reddy, K. H. K. AU - Pandey, V. AU - Roy, D. S. PY - 2019 DA - 2019// TI - A novel entropy-based dynamic data placement strategy for data intensive applications in Hadoop clusters JO - Int J Big Data Intell. VL - 6 UR - https://doi.org/10.1504/IJBDI.2019.097395 DO - 10.1504/IJBDI.2019.097395 ID - Reddy2019 ER - TY - STD TI - Beck G, Duong T, Lebbah M, Azzag H, Cérin C. A Distributed and approximated nearest neighbors algorithm for an efficient large scale mean shift clustering. arXiv Prepr arXiv190203833. 2019. ID - ref24 ER - TY - JOUR AU - Gates, A. J. AU - Ahn, Y. -. Y. PY - 2017 DA - 2017// TI - The impact of random models on clustering similarity JO - J Mach Learn Res. VL - 18 ID - Gates2017 ER - TY - JOUR AU - Heidari, S. AU - Alborzi, M. AU - Radfar, R. AU - Afsharkazemi, M. A. AU - Ghatari, A. R. PY - 2019 DA - 2019// TI - Big data clustering with varied density based on MapReduce JO - J Big Data. VL - 6 UR - https://doi.org/10.1186/s40537-019-0236-x DO - 10.1186/s40537-019-0236-x ID - Heidari2019 ER - TY - STD TI - Kenyon C, others. Best-Fit Bin-Packing with Random Order. In: SODA. 1996. p. 359–64. ID - ref27 ER - TY - STD TI - Data set. https://archive.ics.uci.edu/ml/. Accessed 9 Feb 2018. UR - https://archive.ics.uci.edu/ml/ ID - ref28 ER - TY - STD TI - Data set. ftp://ftp.ncdc.noaa.gov/pub/data/uscrn/products/subhourly01. Accessed 11 Feb 2019. ID - ref29 ER - TY - BOOK AU - Sammut, C. AU - Webb, G. I. PY - 2011 DA - 2011// TI - Encyclopedia of machine learning PB - Springer CY - New York ID - Sammut2011 ER - TY - JOUR AU - Rand, W. M. PY - 1971 DA - 1971// TI - Objective criteria for the evaluation of clustering methods JO - J Am Stat Assoc VL - 66 UR - https://doi.org/10.1080/01621459.1971.10482356 DO - 10.1080/01621459.1971.10482356 ID - Rand1971 ER -