Local vision-language model Expansion Ready Cost-effective

Next-gen Touchscreens
With Edge AI

Intelligent touchscreens integrated with locally hosted VLM to achieve low-latency, high-privacy AIoT experience. Expansion Ready to fit your needs.

Built-in Camera
Steve recognized
🔊 Welcome home, Steve!
🔊 I see an active issue with garage door sensor.
🎤 "Resolve the garage door sensor issue"
Checking connection...
Calibrating sensor...
Updating firmware...
🔊 All done! Sensor fully functional.
Dealer X
AI Touchscreen
AI agent with generated avatar
Request to VLM
Response from VLM
Local VLM Box
Local VLM Touch System Architecture Diagram

Current HMI solutions fall short

🧠

Need Modernization

Existing HMI solutions haven't kept pace with the latest innovations in multimodal LLMs. Users expect intelligent, conversational interfaces — not static button grids.

💰

Cloud & Library Costs

Adding LLM or AI features means ongoing cloud API costs or expensive library licenses. Per-query pricing makes scaling unpredictable and eats into margins.

🔒

Privacy Concerns

Facial recognition and voice commands processed in the cloud mean your biometric data leaves your premises. For many applications, this is simply not acceptable.

⏱️

No Real-Time Priority

Non real-time operating systems can't prioritize critical events. When an alarm triggers, Android and Linux panels offer "best effort" — not guaranteed response times.

📦

Supply Chain Issues

Finding the right hardware is a challenge. Long lead times, minimum order quantities, and component shortages make it difficult to get what you need, when you need it.

🔌

Limited I/O Options

Modern applications require integrating new sensors — radar, multiple cameras, environmental monitors. Most HMI panels lack the I/O flexibility to accommodate them.

Our solution addresses all of these challenges.

Complete system, delivered as one

Hardware and software designed together, optimized for local AI and real-time performance. Cost-effective design leveraging open-source platforms.

AI Touchscreen Display

AI Touchscreen Display

Thin client architecture — powerful enough for AI vision, light enough to stay responsive. Acts as an intelligent interface to your hub, panel, or automation system.

High Performance MCU

Dual RISC-V processor running at up to 400 MHz, designed for HMI applications and edge computing with rich I/O capabilities.

📡
Comprehensive Connectivity

2.4 and 5 GHz dual-band Wi-Fi 6, Bluetooth 5 (LE), and 802.15.4 for Thread, Zigbee, Matter, HomeKit, and MQTT support.

👤
On-Device AI Vision

Facial recognition for secure operations, processed entirely on the device. No cloud, no external servers, no privacy compromise.

🎥
Video Capabilities

Hardware video encoding for WebRTC streaming. Display IP camera feeds or use built-in camera for video intercom applications.

🔌
Expansion Ready

Available I/O for adding 60 / 77 GHz radar for 3D person sensing, dual cameras, co-processors for Z-Wave, proprietary wireless, or RS-485 bus protocols.

⏱️
RTOS Foundation

Real-time operating system delivers consistent, predictable performance — faster and more reliable than Linux or Android-based alternatives.

Local VLM Box

An edge AI platform for developing and deploying intelligent agents — completely offline with enterprise-grade capabilities.

🧠
Offline AI Processing

40 TOPS AI acceleration with Ara-2 Runtime SDK. Run 8B VLM models locally with OpenAI-compliant REST API — no internet required.

🤖
Agentic AI Framework

Built on Google ADK orchestrator for multi-agent workflows. Seamlessly integrate AI agents via Agent-to-Agent (A2A) protocol for complex task automation.

🔧
MCP Tool Integration

Extend capabilities with Model Context Protocol (MCP) tools. Connect to Home Assistant, time series databases, and custom backends with plug-and-play ease.

🐳
Containerized Deployment

Deploy at scale via Docker containers. Gateway server architecture enables easy monitoring and management of ecosystem components.

🎯
Flexible LLM Support

Bring any supported LLM to work within the orchestrator. Fine-tune with your RAG data for domain-specific responses.

🛡️
Safe AI Guardrails

Built-in safety mechanisms ensure appropriate, reliable responses for consumer and enterprise deployments.

📱
Ready-to-Use Client Apps

Pre-built client agent / meet apps for Android, iOS, Desktop, Web, and Embedded Linux — get started instantly on any platform.

Local VLM Box - APC880E Edge AI Platform

Multiple Display Sizes

From compact interfaces to full tablet experiences — same platform, same software stack.

4" Compact Toys, Thermostats
7" Standard Home Panels, Intercoms
10" Large Factory Floor, Smart Hubs

Built for Your Brand

We deliver complete hardware and software solutions tailored to your requirements.

  • Custom branding and logos on hardware
  • Light or dark body color options
  • GUI light and dark mode themes
  • Port your existing interface to our platform
  • VLM fine-tuning with your domain knowledge
  • Custom expansion modules and sensors
🏠

Local First

All core functionality runs on your premises. No cloud accounts, no subscriptions, no data leaving your network.

📦

One-Time Purchase

Touchscreen and VLM are one-time costs. Engineering hours available for customization, porting, and expansion.

🔗

Thin Client Architecture

No heavy processes on the display. Clean messaging interface with your hub, panel, or automation system.

🤖

AI Agent Ready

Connect to local AI agents like Qwen3 VL or cloud-based services. MCP support for extensible AI workflows.

Where Local VLM Touch fits

🏠

Alarm & Automation Touchscreens

Intelligent display paired with your hub or panel. Facial recognition for secure disarm, natural language control, and seamless integration with existing automation systems.

🌐

IoT Solutions

Rich connectivity options — Thread, Zigbee, Matter, HomeKit, MQTT — make this the ideal interface for IoT deployments. Display sensor data, control devices, monitor systems.

🏭

Industrial Automation

Real-time operating system delivers consistent performance for machine interfaces. Expansion options for RS-485 bus protocols and industrial sensors. Rugged and reliable.

🤖

LLM Robots

Expressive face display with local AI vision and VLM connectivity. Perfect for social robots, assistants, and interactive installations requiring natural interaction.

🧒

Kids Learning Companions

Safe, offline AI with built-in guardrails. No inappropriate content, no data collection, no cloud dependency. Parents can trust what their children interact with.

Coming soon

More information about our team and mission will be available here.

Frequently Asked Questions

Common questions about our platform and technology.

No, ARM64 apps cannot run directly on the ESP32-P4. The ESP32-P4 uses a RISC-V 32-bit architecture and runs FreeRTOS (a real-time operating system), not Linux. ARM64 binaries are incompatible — it's a completely different instruction set and operating environment.

However, the VLM Box runs on ARM64 Linux (i.MX8M Plus with Cortex-A53), so existing ARM64 Linux applications can potentially run there with minimal modifications. This architecture gives you the best of both worlds: a responsive RTOS-based touchscreen for HMI, and a Linux-based edge computer for complex processing tasks.

The ESP32-P4 can perform basic face detection using Espressif's ESP-WHO framework. However, for secure face recognition with liveness detection (anti-spoofing), the processing requirements exceed what the ESP32-P4 can handle in real-time.

For advanced biometric applications, we use a two-step approach: the ESP32-P4 handles initial face detection, then streams the camera frames along with Time-of-Flight (ToF) sensor data to the i.MX8M Plus edge computer. The i.MX8M Plus runs the liveness detection algorithm using ToF depth data to distinguish real faces from photos, videos, or masks — all processed locally without cloud dependency.

The ESP32-P4 is purpose-built for Human-Machine Interface (HMI) applications. Its RTOS foundation delivers consistent, predictable response times — critical for touch interfaces. It handles display rendering, touch input, camera preview, audio I/O, and network communication with sub-millisecond latency.

More powerful Linux-based processors introduce unpredictable latency, higher power consumption, longer boot times, and greater complexity. By separating the UI (ESP32-P4) from heavy compute (VLM Box), we optimize each component for its specific role.

No, all core functionality runs offline. The touchscreen and VLM Box communicate over your local network. AI processing, face recognition, voice commands, and VLM inference all happen on-premises — no cloud accounts, no subscriptions, no data leaving your network.

Internet connectivity is optional and only needed if you want features like remote access, OTA updates, or integration with external services.

The touchscreen (ESP32-P4) is the user interface — it handles display, touch, camera, microphone, speaker, and real-time interactions. It runs an RTOS for instant responsiveness and can perform basic on-device AI like face detection.

The VLM Box is an edge AI computer with a 40 TOPS NPU. It handles compute-intensive tasks: running 7B Vision Language Models, advanced face recognition, natural language understanding, and complex reasoning — all locally. Think of it as a local AI server that your touchscreen can query.

Yes. The VLM can be fine-tuned with your own RAG (Retrieval-Augmented Generation) data to provide domain-specific responses. You can customize the knowledge base, personality, and response style for your specific application — whether that's a smart home assistant, industrial support system, or customer service kiosk.

The touchscreen UI is also fully customizable with your branding, color schemes, and interface layouts.

Join Our Waitlist

Thank you for your interest in Local VLM Touch! Join our waitlist to receive updates on our development progress and be among the first to access our platform.

📬
Quarterly Updates Stay informed on development milestones and new features
🚀
Early Access Be first in line when we launch
💬
Direct Communication Questions? Reach us at localvlmtouch@gmail.com

Sign up for updates

We respect your privacy. No spam, ever.

Let's discuss your project

Tell us about your application. We'll respond within 24 hours to schedule a conversation.

Free initial consultation
NDA available on request
Custom solutions for your needs

Your information is kept private and never shared.