
Simple macro model with three ingredients:
(1) data is task-specific: coding data is not the same as self-driving data; different tasks might also be differentially verifiable so accumulating data is harder/easier
(2) data accumulates endogenously: if the economy finds it worthwhile to do lots of coding we get lots of coding data
(3) data exhibits spillovers: data on one task might augment the productivity of another e.g., via transfer learning or via overlapping primitive skills (pic: some spillover patterns)