What are the practical challenges of implementing interactive vision-language systems in real-world scenarios?Answer not yet generated.