The Challenge#
Voice assistants lack visual understanding. Users need AI systems that can identify objects and answer questions about what they see.
The Solution#
Integrated OpenAI API and Google API for real-time object recognition using webcam, enabling multimodal AI interactions.
Key Achievement#
Enabled natural language queries about visual content, demonstrating the potential of AI-powered augmented reality applications.
Technologies Used#
Artificial Intelligence (AI), OpenAI API, Google API, Python, Computer Vision
