Imagine you’re analyzing sensor data. Suddenly one sensor shows -999°C. That’s an outlier — a single data point that can completely ruin your analysis.

🧩 What is factorization?

Matrix factorization means decomposing data $X$ into two non-negative components: $$ X \approx WH $$

Where $W$ contains “features” and $H$ shows how much of each is needed.

💡 The problem

Classical methods like NMF are sensitive to noise and outliers. When data is messy, analysis breaks down.

✨ The solution: Target Polish

“Polish” (verb) means to improve, refine. The authors propose correcting the data $X$ before factorization.

How does it work?

  1. Compute initial factorization: $\hat{X} = WH$
  2. Compare $X$ to $\hat{X}$
  3. If values deviate too much, correct them: $$ X’ = \text{clip}(X, \hat{X} - \delta, \hat{X} + \delta) $$
  4. Repeat the process.

📊 Does it work?

Yes! This method is:

  • robust to noise,
  • effective on both matrices and tensors,
  • easy to implement.

🧩 Summary

Target Polish is a method for “robust” machine learning — where we gently clean data instead of blindly trusting it.