We examine the historical development and underlying principles of foundation models realized in language and vision, and propose how physics-infused machine learning interaction potentials could ...