Daniel Madalitso Phiri

Sep 18, 2024 • WeAreDevelopers LIVE

Vision for Websites: Training Your Frontend to See

Build web apps that see. Learn how to implement powerful visual search with vector embeddings in just a few lines of code.

#1about 1 minute

Defining vision as the ability to deduce and understand

The concept of vision for websites is redefined from simply seeing to the ability to deduce, understand, and act on information.

#2about 4 minutes

Demo of a multimodal e-commerce search application

A live demonstration showcases an e-commerce store where users can search for products using both text queries and by uploading images.

#3about 2 minutes

What is multimodality in artificial intelligence?

Multimodality enables search queries to use multiple media types like text, images, and audio to capture more context and improve user interaction.

#4about 2 minutes

Why multimodal AI creates richer user experiences

Multimodal interfaces provide more natural and context-aware interactions, moving beyond simple keyword searches to a more intuitive experience.

#5about 4 minutes

Differentiating generative AI from embedding models

Embedding models encapsulate information into numerical representations (vectors), unlike generative models which create new data.

#6about 4 minutes

How vector search works by measuring distance

Vector search operates by converting a query into an embedding and finding the closest, most semantically similar items in a multidimensional space.

#7about 2 minutes

Creating a unified space for multimodal search

Different data types like text, images, and audio are processed by specific encoders and plotted into a single, unified vector space for cross-modal queries.

#8about 9 minutes

Implementing text-based image search with Weaviate

A code walkthrough demonstrates how to build a text-to-image search feature using a Next.js frontend and a Weaviate backend with a `nearText` query.

#9about 4 minutes

Implementing visual search with an image query

The code for an image-to-image search is explained, showing how a base64 image is sent to the backend to perform a `nearImage` vector search.

#10about 2 minutes

Expanding vision to other creative applications

Beyond e-commerce, multimodal vision can be applied to creative use cases like movie recommenders, educational tools, and map navigation.

envelio
Köln, Germany

Remote

Senior

Python

JavaScript

Hubert Burda Media
München, Germany

€80-95K

Intermediate

Senior

JavaScript

Node.js

Hubert Burda Media
München, Germany

€65-80K

Intermediate

PHP

JavaScript

Defining vision as the ability to deduce and understand

Demo of a multimodal e-commerce search application

What is multimodality in artificial intelligence?

Why multimodal AI creates richer user experiences

Differentiating generative AI from embedding models

How vector search works by measuring distance

Creating a unified space for multimodal search

Implementing text-based image search with Weaviate

Implementing visual search with an image query

Expanding vision to other creative applications

Senior Fullstack Engineer (all genders)

Principal Full-Stack Engineer

Fullstack Developer

Matching moments

A tour of creative code demos and useful developer tools

WeAreDevelopers LIVE – PHP Is Alive and Kicking and More

Exploring modern tools for web interaction and analysis

WeAreDevelopers LIVE - the weekly developer show with Chris Heilmann and Daniel Cranney

The future of on-device AI in web development

Generative AI power on the web: making web apps smarter with WebGPU and WebNN

Will AI replace developers? An AI-built demo

From Syntax to Singularity: AI’s Impact on Developer Roles

Presenting live web scraping demos at a developer conference

Tech with Tim at WeAreDevelopers World Congress 2024

The future of web development is faster and simpler

The Eternal Sunshine of the Zero Build Pipeline

Discussing modern web development news and trends

WeAreDevelopers LIVE - GraalVM in action, Static Analysis insights and more

Exploring the future of AI in FinTech

OpenAI for FinTech: Building a Stock Market Advisor Chatbot

Featured Partners

Related Videos

Build UIs that learn - Discover the powerful combination of UI and AI

WeAreDevelopers LIVE - the weekly developer show with Chris Heilmann and Daniel Cranney

WAD Live 22/01/2025: Exploring AI, Web Development, and Accessibility in Tech with Stefan Judis

Web APIs you might not know about

Virtual Reality – The path to create your world

Modern Web Development with Nuxt3

Explore new web features before everyone else

WeAreDevelopers LIVE – AI vs the Web & AI in Browsers

Related Articles

From learning to earning

Design-Oriented Frontend Developer (m/f/d) Next.js

Frontend JavaScript Developer- Ai Training

Frontend JavaScript Developer- Ai Training

Frontend JavaScript Developer- Ai Training

Frontend JavaScript Developer- Ai Training

Frontend JavaScript Developer- Ai Training

Full-Stack Engineer

Frontend JavaScript Developer- Ai Training

Frontend Developer / Web Developer (Vue.js)