Hello Chris, On Sat, Sep 14, 2019 at 05:27:31PM -0300, Chris Chinedozie wrote: > I have a dataset and I divided it into 2 datasets RANDOMLY A and B, where A > is 70% of the main dataset and B is 30%... Then I applied Apriori algorithm > on the both A and B separately, generating its rules..
My best guess is that you don't want to do that. Association rules hugely rely on support for building the itemset. And then confidence for the orientation of the rule. Both are very simple metrics. Yet if you divide your dataset, this will lead to very different itemsets due to the algorithm. > I want to compare rules from dataset A to rules of dataset B Can you precise « compare »? > example: > A has the following rules > [(sex=0,age=1),(sex=1,age=1,money=0),(sex=0,age=2,money=2)] > > B has the following rules > [(sex=0,age=1,money=1),(sex=0,age=1,money=0),(sex=1,age=1,money=0)] Aren't these itemsets rather than rules? > A(sex=0,age=1) => B(sex=0,age=1,money=1) returns TRUE > A(sex=1,age=1,money=0) => B(sex=1,age=1,money=0)) returns TRUE > A(sex=0,age=2,money=2) => B(sex=1,age=1,money=0) returns FALSE I don't see rules, nor sense. Can you clarify? Thanks! -- Benoît Vaillant
signature.asc
Description: PGP signature
______________________________________________ R-help@r-project.org mailing list -- To UNSUBSCRIBE and more, see https://stat.ethz.ch/mailman/listinfo/r-help PLEASE do read the posting guide http://www.R-project.org/posting-guide.html and provide commented, minimal, self-contained, reproducible code.