Developed a small web application using three different technologies/API

Client Name
US Based Client

 

  • Technology

    Google Cloud vision API, awe.js, and Catchoom

  • Domain

    Augmented Reality/CAD

Client Overview

The client aimed to bridge the virtual and physical worlds by providing real-time augmented information over physical objects through live video feeds. This approach would allow the client to create a personalized application experience for every new user. For instance, one customer could benefit from dynamic pricing, where pricing details, based on the number of sales, are augmented on each item in real time. This innovative solution was designed to capture and engage a larger market by offering customizable, interactive content for users across various industries.

Project Overview

By providing information from the virtual world, augmenting over the physical world’s object at runtime, in the live feed, is their way to approach the bigger market. This way the client can customize an application for every new user. For e.g., one client can have dynamic pricing depending upon the number of sales and this information is augmented on each article at runtime.

Challenges:

  • API Integration: Seamlessly connecting Google Cloud Vision API, Catchoom, and Awe.js while ensuring smooth interaction across systems.
  • Real-Time Processing: Ensuring fast, real-time image capture, query, and augmentation without delays or lag.
  • Diverse Image Handling: Managing various image qualities (lighting, angles) while maintaining consistent object and logo recognition.

Solutions:

To address these challenges, the team developed a web application using Google Cloud Vision API, Catchoom, and Awe.js. Though each technology followed a similar mechanism, their roles varied in how they processed the data. The solution was built as follows:

  • Web Application Setup: The team created a small-scale web application. When users pointed their mobile web browsers to a specific URL, a webpage opened, granting access to the device’s camera using the getUserMedia function in HTML5.
  • Live Image Capture: The mobile browser captured live images from the camera feed. These images were sent to the server for querying, which used the Google Cloud Vision API and Catchoom for logo and object detection.
  • Real-Time Augmentation: After the server processed the image query, the data was returned to the user’s mobile device. Using Awe.js, the web app augmented the live feed by overlaying relevant information—such as logos, dynamic pricing, or other object-specific data—directly on top of the detected articles in real time.

Benefits:

The augmented application provided several key advantages:

  • Seamless Integration: Combining Google Cloud Vision API, Catchoom, and Awe.js enabled real-time object recognition and data augmentation, creating a cohesive and interactive user interface.
  • Real-Time Information: The live feed allowed users to access up-to-date, augmented information on objects instantly, enhancing decision-making and creating a more immersive experience.

"ProtoTech helped us pioneer the concept of AR for retail products.
The work of ProtoTech was fundamental to better evaluate the many existing technologies, both based on image and pattern recognition. After testing different results, we decided to go with browser-based AR, being one of the first to adopt what is now becoming a standard for the sector. ProtoTech Team’s demonstrated to be quick in understanding our requirements, and to push their expertise in 3D formats into new directions, developing with new platforms and frameworks in record time"

Giorgio Mazzucchell

Need Help with Your Ongoing Projects?

Have questions? Connect with us today! Explore how our innovative technology can optimize your projects and drive success. Together, we’ll take your business to the next level!

Edit Template
Back