Differentiate Agglomerative and Divisive Hierarchical Clustering?

4 years ago

Data Mining and Data Warehousing

Agglomerative Hierarchical clustering method works on the bottom-up approach.

In Agglomerative hierarchical method, each object creates its own clusters. The single Clusters are merged to make larger clusters and the process of merging continues until all the singular clusters are merged into one big cluster that consists of all the objects.

Divisive Hierarchical clustering method works on the top-down approach. In this method all the objects are arranged within a big singular cluster and the large cluster is continuously divided into smaller clusters until each cluster has a single object.

Hierarchical Agglomerative vs Divisive Clustering

Divisive clustering is more complex as compared to agglomerative clustering, as in case of divisive clustering we need a flat clustering method as “subroutine” to split each cluster until we have each data having its own singleton cluster.
Divisive clustering is more efficient if we do not generate a complete hierarchy all the way down to individual data leaves. Time complexity of a naive agglomerative clustering is O(n3) because we exhaustively scan the N x N matrix dist_mat for the lowest distance in each of N-1 iterations. Using priority queue data structure we can reduce this complexity to O(n2logn). By using some more optimizations it can be brought down to O(n2). Whereas for divisive clustering given a fixed number of top levels, using an efficient flat algorithm like K-Means, divisive algorithms are linear in the number of patterns and clusters.
Divisive algorithm is also more accurate. Agglomerative clustering makes decisions by considering the local patterns or neighbor points without initially taking into account the global distribution of data. These early decisions cannot be undone. whereas divisive clustering takes into consideration the global distribution of data when making top-level partitioning decisions.

0

Rajiv Shah

Rajiv Shah

Sep 25, 2021

More related questions

1. The Term “Business Intelligence” is also used as an alternative term for Data Mining. Justify appropriateness of term with comparison of the term “Artificial Intelligence”

2. Why Data Preprocessing is needed and which are the techniques used for data Preprocessing?

3. Explain Common Hadoop Shell Commands

4. What is Hadoop Architecture and Storage?

5. What is Big Data and Characteristics of Big Data V3s?

6. Explain Clustering, Spatial mining, Web mining, Text mining in brief

7. Explain usage of Data warehousing for information processing, analytical processing, and data Mining

8. What do you mean by data mart? What are the different types of data mart?

9. Explain meta data repository

10. Explain Data Warehouse Design Process in Detail

11. What is Data Warehouse? Explain it with Key Feature

12. What is Business Intelligence? Explain Business Intelligence in today’s perspective

13. How does the ANSI-SPARC architecture promote logical and physical data independence in databases?

14. What are the Major Issues and Challenges of Data Mining?

15. What is HOLAP?

16. What is Enterprise Warehouse?

17. What is Data Mart?

18. What are dependent and independent data marts?

19. What is Virtual Warehouse?

20. What is VLDB?

21. What are Research prototypes?

22. What is the difference between generic single-task tools and generic multi-task tools?

23. What are the areas in which data warehouses are used in present and in future?

24. What is DMQL?

25. What are the factors involved while choosing data mining system?

26. What is Text mining?

27. What is spatial data mining?

28. What are the DB Miner tool in data mining?

29. How data mining is used in health care analysis?

30. How data mining is used in banking industry?

31. Explain the types of data mining.

32. What does audio data mining mean?

33. What is OLAP?

34. What is OLTP?

35. Define Chameleon method?

36. What is CURE?

37. What is Hierarchical method?

38. What is CLARA and CLARANS?

39. What do u mean by partitioning method?

40. Define nominal, ordinal and ratio scaled variables?

41. Define Binary variables? And what are the two types of binary variables?

42. What are interval scaled variables?

43. What are the different types of data used for cluster analysis?

44. What are the requirements of cluster analysis?

45. What are the fields in which clustering techniques are used?

46. What is Clustering and Cluster Analysis?

47. What are the techniques to improve the efficiency of Apriori algorithm?

48. How to generate association rules from frequent item sets?

49. What is the purpose of Apriori Algorithm?

50. Describe the different classifications of Association rule mining.

51. Define support and confidence in Association rule mining.

52. How is a data warehouse different from a database?

53. Describe challenges to data mining regarding data mining methodology and user interaction issues.

54. Classifications of Data mining systems

55. What is Cluster Analysis?

56. What are the advanced database systems?

57. What is Descriptive model?

58. What is Predictive model?

59. What is the purpose of Data mining Technique?

60. What is Genetic algorithm?

61. What is meta learning?

62. Give few statistical techniques in data mining

63. What are the some of the data mining techniques?

64. What is the Architecture of a typical data mining?

65. What is the use of the knowledge base?

Questions Bank

View all Questions