<k1, v1>
k1:
Line number (or other identifier for the data point) v1:
Point (coordinates)
<k2, v2>
k2:
Nearest center (centroid index) v2:
Data point (coordinates)
<k2, List<v2>>
<k3, v3>
k3:
New center point (updated centroid) v3:
Points that belong to this centroid