Hi
I have a dataset of twin pairs with many comparisons of interest. One of these is comparing between twins with active disease (in a flare) and twins with inactive disease (in remission).
I have about 4 twin pairs where one has active disease, and the other doesn't. I also have about 10 pairs where both are in remission and about 5 pairs where both have active disease.
I would like to include all the data ideally, since I'll have much less power if I only use the pairs that have one representative with each disease status. I know that duplicateCorrelation
has been recommended for use with incompletely paired designs like this one. But I'm wondering if this type of design is too far beyond what it's designed for, since one might expect that the correlations between twins where both have the same status might be very different than the correlations between twins where they have different status. Or is the method robust enough to handle this kind of situation?
Thank you
Thank you for the guidance! I wish I could share more details of the data