Memory-Augmented Vision-Language Agents for Persistent and Semantically Consistent Object Captioning | ScienceToStartup | ScienceToStartup