multimodal

Here are 35 public repositories matching this topic...

Mintplex-Labs / anything-llm

The all-in-one Desktop & Docker AI application with built-in RAG, AI agents, No-code agent builder, MCP compatibility, and more.

mcp no-code ai-agents multimodal rag vector-database llm localai local-llm ollama llm-webui lmstudio agent-framework-javascript deepseek llama3 custom-ai-agents mcp-servers deepseek-r1 qwen3

Updated Jun 18, 2025
JavaScript

lxe / llavavision

Star

A simple "Be My Eyes" web app with a llama.cpp/llava backend

machine-learning ai computer-vision artificial-intelligence webapp llama multimodal llm llamacpp local-llm

Updated Nov 28, 2023
JavaScript

fanglu0411 / sgs

Star

SGS, is a user-friendly, collaborative and versatile browser for visualizing single-cell and spatial multiomics data.

visualization single-cell genome-browser sgs multimodal scrna scatac zarr anndata mudata spatial-omics sceqtl scmethylc schic

Updated Mar 4, 2025
JavaScript

aymenfurter / smartrag

Star

Deep Research through Multi-Agents, using GraphRAG

azure openai multi-agent-systems autogen multimodal voice-mode llm graphrag gpt-4o deep-research

Updated Nov 10, 2024
JavaScript

rustic-ai / rustic-ui-components

Star

React component library for crafting user-friendly and engaging conversational experiences

chat ai reactjs mui reactjs-components conversational-ai multimodal

Updated Jun 17, 2025
JavaScript

aj-archipelago / cortex

Star

Simplify and accelerate AI-powered application development with structured interfaces to models and powerful prompt execution environments.

graphql router ai rest-api entities gemini openai llama claude multimodal vertex-ai llm chatgpt

Updated Jun 20, 2025
JavaScript

sutdcv / SUTD-TrafficQA

Star

[CVPR2021] SUTD-TrafficQA: A Question Answering Benchmark and an Efficient Network for Video Reasoning over Traffic Events

paper annotations dataset vqa cvpr video-qa vqa-dataset traffic-events multimodal multimodal-deep-learning cvpr2021 video-reasoning

Updated Aug 19, 2024
JavaScript

phetsims / paper-land

Star

Build and explore multimodal web interactives with pieces of paper!

javascript open-source community paper ar augmented codesign multimodal

Updated Mar 13, 2025
JavaScript

taco-group / DecAlign

Star

A novel cross-modal decoupling and alignment framework for multimodal representation learning.

alignment decoupling multimodal-learning multimodal-sentiment-analysis multimodal multimodal-deep-learning

Updated Mar 19, 2025
JavaScript

aws-samples / improve-employee-productivity-using-genai

Star

Employee Productivity GenAI Assistant Example is an innovative code sample and architecture pattern designed to enhance writing tasks efficiency using AWS serverless technologies and Amazon Bedrock's generative AI models.

aws aws-lambda aws-s3 aws-apigateway aws-serverless aws-dynamodb aws-sam multimodal servereless aws-cloud9 generative-ai anthropic-claude genai aws-bedrock bedrock-claude-llm

Updated Apr 29, 2025
JavaScript

pixeltable / pixelbot

Star

Multimodal Infinite Memory AI Agent

agent ai chatbot context-aware chatbot-framework multimodal llm agentic-ai

Updated May 6, 2025
JavaScript

rimmi21-zz / Alexa-APL-Fact-Skill

Star

Sample skill which demonstrates the new Alexa Presentation Language (APL). The multi modal skill functionality is same as Alexa Fact Skill template it will select a fact at random and tell it to the user when the multi modal skill is invoked and is compatible with devices having display.

Updated Jun 26, 2019
JavaScript

aws-samples / semantic-image-search-for-articles

Star

How you can add semantic search to your applications. This sample shows how you can use a multimodal model to find images which are semantically similar to some text. New blog coming out soon.

search aws semantic vector multimodal vector-search generative-ai

Updated Dec 1, 2024
JavaScript

palubad / MMTS-GEE

Star

Google Earth Engine tool to generate multi-modal and multi-temporal datasets, including spatially and temporally aligned Sentinel-1 SAR data, Sentinel-2 multispectral data, weather and DEM-based data. A supplementary material for Paluba et al. 2024: "Identification of Optimal Sentinel-1 SAR Polarimetric Parameters for Forest Monitoring in Czechia

machine-learning time-series dataset digital-elevation-model remote-sensing google-earth-engine earth-observation gee time-series-analysis synthetic-aperture-radar multitemporal-remote-sensing sentinel-2 earth-engine multimodal sentinel-1 era5-land multitemporal-data copernicus-dem

Updated Jun 8, 2025
JavaScript

benursu / Afrosquared-ForkOnTheRoad

Star

Amazon Alexa Skill - "Alexa, ask Fork On The Road"

nodejs alexa webgl multimodal

Updated Mar 24, 2019
JavaScript

lab-rasool / MINDS

Star

🧠 | Multimodal Integration of Oncology Data System

data machine-learning deep-learning cancer nih oncology multimodal gdc-portal

Updated May 31, 2025
JavaScript

msai-cereal / ai_fitness_trainer_v2

Star

Web-Based Exercise Posture Evaluation and AI Voice Feedback System

computer-vision fitness-app buildship multimodal openai-api yolov8

Updated Nov 30, 2024
JavaScript

josemariagarcia95 / hera-system

Star

Three-level multimodal emotion recognition framework to detect emotions combining different inputs with different formats.

detector affective-computing detect-emotions affective multimodal pad-form ensembler

Updated Dec 9, 2022
JavaScript

cdaein / supercut

Sponsor

Star

Create a supercut montage video with Gemini LLM

montage supercut gemini-api multimodal llm

Updated Jul 26, 2024
JavaScript

gsbingo17 / spanner-media-search-demo

Star

This repository contains the implementation of a media search application using Google Cloud Spanner and Vertex AI for generating and searching embeddings.

embeddings spanner multimodal vector-database

Updated Oct 31, 2024
JavaScript

Improve this page

Add a description, image, and links to the multimodal topic page so that developers can more easily learn about it.

Curate this topic

Add this topic to your repo

To associate your repository with the multimodal topic, visit your repo's landing page and select "manage topics."

Learn more

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

multimodal

Here are 35 public repositories matching this topic...

Mintplex-Labs / anything-llm

lxe / llavavision

fanglu0411 / sgs

aymenfurter / smartrag

rustic-ai / rustic-ui-components

aj-archipelago / cortex

sutdcv / SUTD-TrafficQA

phetsims / paper-land

taco-group / DecAlign

aws-samples / improve-employee-productivity-using-genai

pixeltable / pixelbot

rimmi21-zz / Alexa-APL-Fact-Skill

aws-samples / semantic-image-search-for-articles

palubad / MMTS-GEE

benursu / Afrosquared-ForkOnTheRoad

lab-rasool / MINDS

msai-cereal / ai_fitness_trainer_v2

josemariagarcia95 / hera-system

cdaein / supercut

gsbingo17 / spanner-media-search-demo

Improve this page

Add this topic to your repo