A Computer Vision System for Auditory Scene Description for the Visually Impaired
This innovative device is designed to be a helpful companion for visually impaired individuals searching for specific items in their environment. Imagine a world where you can ask your device to find your "glasses" and then be guided directly to it with intuitive audio cues.
Here's how it works:
Speech Recognition:
The device is designed to be effortlessly operated by the user through a user-friendly speech recognition system. There is no need to navigate complicated menus, just a button and a simple voice command. They can specify the item they're looking for, providing as much detail as they want for better accuracy. For instance, they could say "cup" or "tall, stainless steel black travel mug." This intuitive system empowers the user, making them feel capable and independent.
Camera Activation and Object Detection:
Once the desired item is spoken, the device activates its built-in camera and advanced object detection software. The user then needs to sweep the camera around the room, similar to using a metal detector. This advanced technology ensures precise detection, instilling a sense of trust and reliability in the caregivers and organizations. Our device emits a beeping sound when the desired item enters the camera's view. Depending on preference, this sound can also be changed to a vibration. It's a dynamic system that provides valuable information about the item's location. A faster and louder beep indicates the item is closer to the center of the frame, helping you pinpoint its X and Y positions within the room.
Success Notification:
Finally, as the user finds the item with the help of the beeping guidance system, the sound becomes a continuous beep. This signifies that the object is directly before them, eliminating uncertainty.
This device empowers visually impaired individuals with greater independence and confidence when searching for everyday items. It combines intuitive speech recognition, object detection technology, and a user-friendly audio feedback system to create a valuable tool for navigating their surroundings. Importantly, this is just the beginning. In the future, the model can be improved and developed to include other objects to be used in public places, such as park benches, garbage cans, etc. This potential for future enhancements instills a sense of hope and optimism in the audience.