We cannot expect the partition made by WRR to behave like a random partition. We should therefore explain what is the rationale for comparing its P2 score to a random partition. I will discuss separately two possibilities. The first is that the significant phenomenon described in the paper was the result of some optimization in choosing the data and the second is the original research hypothesis of WRR.