What are the techniques to improve the efficiency of Apriori algorithm?

4 years ago

Data Mining and Data Warehousing

Techniques to improve the efficiency of Apriori algorithm

Hash based technique
Transaction Reduction
Portioning
Sampling
Dynamic item counting

Apriori Algorithm – Frequent Pattern Algorithms

Apriori algorithm was the first algorithm that was proposed for frequent itemset mining. It was later improved by R Agarwal and R Srikant and came to be known as Apriori. This algorithm uses two steps “join” and “prune” to reduce the search space. It is an iterative approach to discover the most frequent itemsets.

Apriori says:

The probability that item I is not frequent is if:

P(I) < minimum support threshold, then I is not frequent.
P (I+A) < minimum support threshold, then I+A is not frequent, where A also belongs to itemset.
If an itemset set has value less than minimum support then all of its supersets will also fall below min support, and thus can be ignored. This property is called the Antimonotone property.

The steps followed in the Apriori Algorithm of data mining are:

Join Step: This step generates (K+1) itemset from K-itemsets by joining each item with itself.
Prune Step: This step scans the count of each item in the database. If the candidate item does not meet minimum support, then it is regarded as infrequent and thus it is removed. This step is performed to reduce the size of the candidate itemsets.

1

Rajiv Shah

Rajiv Shah

Sep 24, 2021

More related questions

1. The Term “Business Intelligence” is also used as an alternative term for Data Mining. Justify appropriateness of term with comparison of the term “Artificial Intelligence”

2. Why Data Preprocessing is needed and which are the techniques used for data Preprocessing?

3. Explain Common Hadoop Shell Commands

4. What is Hadoop Architecture and Storage?

5. What is Big Data and Characteristics of Big Data V3s?

6. Explain Clustering, Spatial mining, Web mining, Text mining in brief

7. Explain usage of Data warehousing for information processing, analytical processing, and data Mining

8. What do you mean by data mart? What are the different types of data mart?

9. Explain meta data repository

10. Explain Data Warehouse Design Process in Detail

11. What is Data Warehouse? Explain it with Key Feature

12. What is Business Intelligence? Explain Business Intelligence in today’s perspective

13. How does the ANSI-SPARC architecture promote logical and physical data independence in databases?

14. What are the Major Issues and Challenges of Data Mining?

15. What is HOLAP?

16. What is Enterprise Warehouse?

17. What is Data Mart?

18. What are dependent and independent data marts?

19. What is Virtual Warehouse?

20. What is VLDB?

21. What are Research prototypes?

22. What is the difference between generic single-task tools and generic multi-task tools?

23. What are the areas in which data warehouses are used in present and in future?

24. What is DMQL?

25. What are the factors involved while choosing data mining system?

26. What is Text mining?

27. What is spatial data mining?

28. What are the DB Miner tool in data mining?

29. How data mining is used in health care analysis?

30. How data mining is used in banking industry?

31. Explain the types of data mining.

32. What does audio data mining mean?

33. What is OLAP?

34. What is OLTP?

35. Define Chameleon method?

36. What is CURE?

37. Differentiate Agglomerative and Divisive Hierarchical Clustering?

38. What is Hierarchical method?

39. What is CLARA and CLARANS?

40. What do u mean by partitioning method?

41. Define nominal, ordinal and ratio scaled variables?

42. Define Binary variables? And what are the two types of binary variables?

43. What are interval scaled variables?

44. What are the different types of data used for cluster analysis?

45. What are the requirements of cluster analysis?

46. What are the fields in which clustering techniques are used?

47. What is Clustering and Cluster Analysis?

48. How to generate association rules from frequent item sets?

49. What is the purpose of Apriori Algorithm?

50. Describe the different classifications of Association rule mining.

51. Define support and confidence in Association rule mining.

52. How is a data warehouse different from a database?

53. Describe challenges to data mining regarding data mining methodology and user interaction issues.

54. Classifications of Data mining systems

55. What is Cluster Analysis?

56. What are the advanced database systems?

57. What is Descriptive model?

58. What is Predictive model?

59. What is the purpose of Data mining Technique?

60. What is Genetic algorithm?

61. What is meta learning?

62. Give few statistical techniques in data mining

63. What are the some of the data mining techniques?

64. What is the Architecture of a typical data mining?

65. What is the use of the knowledge base?

Questions Bank

View all Questions