↓ Skip to main content

73 words·1 min

AI/ML OpenAI Computer Vision Python

LLM Voice Assistant

The Challenge
#

Voice assistants lack visual understanding. Users need AI systems that can identify objects and answer questions about what they see.

The Solution
#

Integrated OpenAI API and Google API for real-time object recognition using webcam, enabling multimodal AI interactions.

Key Achievement
#

Enabled natural language queries about visual content, demonstrating the potential of AI-powered augmented reality applications.

Technologies Used
#

Artificial Intelligence (AI), OpenAI API, Google API, Python, Computer Vision