Cleaning noisy data ‘almost 70%’ of machine learning labour