Cleaning duplicate data is a major problem that persists even though many works have been done to solve it, due to the exponential growth of data amount treated
Given a set of requirements structured as design problems, we can apply design patterns to solve each problem individually. Much of the published literature on