Joseph Crawford

  • Home
  • Family
  • Reviews
  • Resume
  • Work
  • GitHub
  • LinkedIn
AI

PHP Can See Now: Vision Capabilities for AI Agents Are Here

byJoseph Crawford•May 25, 2026•0
PHP Can See Now: Vision Capabilities for AI Agents Are Here

PHP Can See Now: Vision Capabilities for AI Agents Are Here

If you build in PHP, you've probably noticed the AI agent ecosystem passing you by. Python has LangChain, CrewAI, and a dozen other frameworks. TypeScript has Vercel's AI SDK and an ever-growing toolkit of options. PHP? PHP had Neuron AI—a solid start, but until recently, one critical capability was missing: your agents couldn't see.

That changed this month. Neuron AI shipped vision support, and it's more than a feature checkbox. It's a real shift for the PHP ecosystem, because it removes one of the last reasons to reach for a different language when you need image understanding in your application.

Let's break down what this means, how it works, and what you can build with it.

The Problem: PHP's Blind Spot

PHP powers roughly 77% of websites with a known server-side language. That's WordPress, Laravel, Symfony, and a long tail of custom applications. When a developer on one of those stacks wants to add AI—say, a chatbot, a content moderation pipeline, or a document processing agent—the natural move is to stay in PHP. The codebase is there. The team knows it. The infrastructure is already running.

But vision capabilities—image analysis, object detection, OCR, visual question answering—have been the domain of Python-first frameworks. Want your Laravel app to analyze an uploaded receipt? You'd typically spin up a Python microservice, pipe the image over HTTP, and glue the result back into your PHP workflow. It works, but it adds operational complexity, deployment overhead, and a hard dependency on a second language stack just for one feature.

Neuron AI's vision support closes that gap. You can now pass images directly to your PHP agents and get structured understanding back—all without leaving your existing codebase.

How It Works

The API is straightforward. You attach images to a UserMessage and send it through your agent's chat() method, just like you would a text prompt. Images can be provided either as URLs or base64-encoded content.

Using an Image URL

php
use NeuronAI\Chat\Messages\UserMessage;
use NeuronAI\Chat\Messages\Image;
$message = (new UserMessage("Describe this image"))
->addImage(new Image(
image: 'https://example.com/photo.jpg',
type: Image::TYPE_URL
));
$response = MyAgent::make()->chat($message);

Using Base64-Encoded Images

php
use NeuronAI\Chat\Messages\UserMessage;
use NeuronAI\Chat\Messages\Image;
$content = base64_encode(file_get_contents('/path/to/image.jpg'));
$message = (new UserMessage("What do you see in this image?"))
->addImage(new Image(
image: $content,
type: Image::TYPE_BASE64,
mediaType: 'image/jpeg'
));
$response = MyAgent::make()->chat($message);

That's it. No microservices, no API gateway, no language-switching. The image goes in alongside your text prompt, and the underlying LLM processes both together.

One thing to note: Ollama currently only supports base64-encoded images. If you're using Ollama as your provider, make sure you encode locally rather than passing URLs.

What You Can Actually Build With This

The announcement highlights several application categories, and they're worth walking through because they map directly to problems PHP developers face every day.

E-Commerce: Product Intelligence

If you're running a WooCommerce or Laravel-based store, vision agents can now:

  • Auto-generate product attributes from images—extract colors, materials, and styles without manual data entry
  • Power visual search where users upload a photo and find similar products in your catalog
  • Run quality checks on product images before they go live, flagging low-resolution shots, incorrect aspect ratios, or mismatched items

Content Management: Media Automation

CMS platforms process thousands of images. Vision agents can:

  • Generate alt text automatically for accessibility compliance and SEO
  • Moderate content by identifying inappropriate or off-brand images before they reach the public site
  • Describe visual content for screen readers, going beyond basic alt attributes to provide real detail

Document Processing: Data Extraction

This is where it gets particularly practical. PHP handles forms, invoices, and receipts on a massive scale. Vision agents can now:

  • Extract structured data from uploaded forms—names, dates, amounts, addresses
  • Scan receipts and invoices for automated expense tracking and bookkeeping
  • Classify documents by visual layout and content type, routing them to the right workflow without manual triage

Education and Beyond

The announcement also mentions educational applications like grading visual assignments and creating interactive tools that respond to student-drawn diagrams. That's niche, but it illustrates the range—anywhere a human needs to "look at something and decide," a vision-capable agent can now assist.

The Bigger Picture: Framework-Agnostic AI

Here's what makes this more than just a feature release: Neuron AI is deliberately framework-agnostic.

Most AI tooling in PHP either targets Laravel specifically or lives as a thin wrapper around a Python service. Neuron's approach is different. It's a standalone PHP package that integrates with Laravel, Symfony, WordPress, or plain vanilla PHP—no framework-specific dependencies, no coupling to a specific ORM or event system.

This matters for a few reasons:

Your AI layer survives framework upgrades. When Laravel ships a major version change, your agent code doesn't break. The same Neuron agent works across Laravel 10, 11, 12, and beyond, because it's not wired into Laravel's internals.

Your skills transfer across projects. Learn to build a vision agent in Neuron once, and that knowledge follows you whether your next project uses Symfony, WordPress, or something else entirely.

The community isn't siloed. Laravel developers, Symfony developers, and WordPress developers all contribute to and benefit from the same framework. That's a bigger ecosystem than any single PHP framework's community.

Neuron AI in Context

Neuron AI is still young—the project just hit v3, introducing a workflow-first architecture with multi-agent streaming and human-in-the-loop tool approval. It's got around 1,900 GitHub stars and an active community. It's not LangChain-scale, but it doesn't need to be. It needs to be the right tool for PHP developers who want AI capabilities without leaving their stack.

The vision release follows the v3 architectural overhaul, which replaced Neuron's original class-based design with a proper workflow engine. That's the foundation that makes multi-modal input—text plus images—feel natural rather than bolted on.

Getting Started

If you want to try this yourself:

bash
composer require inspector-apm/neuron-ai

Then set up an agent with your preferred LLM provider, attach images to your messages, and you're processing visual input from PHP. The documentation covers agent configuration, tool definitions, and RAG pipelines.

The framework supports multiple LLM providers, so you're not locked into a single model. Swap between OpenAI, Anthropic, Ollama, or any compatible provider without rewriting your agent logic.

Why This Matters

The PHP ecosystem has historically been an AI consumer, not an AI builder. You could call an OpenAI API from PHP, but building actual agentic systems—things that reason, use tools, manage state, and now see—meant switching languages.

Neuron AI's vision support doesn't just add a feature. It removes a barrier. PHP developers who were considering Python microservices just for image understanding can now do it natively. Teams that standardized on Laravel or Symfony don't need to hire a Python specialist to add visual intelligence to their applications.

That's the kind of shift that compounds. As more PHP developers build with these capabilities, the ecosystem generates patterns, libraries, and shared knowledge that make the next project easier. The flywheel spins.

PHP can see now. And that changes what PHP can build.


If you're working with PHP and AI agents, I'd love to hear what you're building. The Neuron AI GitHub Discussions are active, and the project is moving fast.

Tags:AIPHPDeveloper Tools

Comments

Previous ArticleBuilding Long-Running AI Agents That Actually Survive: A Practical Look at Google's ADK
Next ArticleTypeScript 7.0 Beta: The Compiler Rewritten in Go and What It Means for Your Projects

Popular

TypeScript 7.0 Beta: The Compiler Rewritten in Go and What It Means for Your Projects

TypeScript 7.0 Beta: The Compiler Rewritten in Go and What It Means for Your Projects

June 12, 2026
PHP Can See Now: Vision Capabilities for AI Agents Are Here

PHP Can See Now: Vision Capabilities for AI Agents Are Here

May 25, 2026
Building Long-Running AI Agents That Actually Survive: A Practical Look at Google's ADK

Building Long-Running AI Agents That Actually Survive: A Practical Look at Google's ADK

May 14, 2026
GPT-5.5 First Impressions: What OpenAI's Latest Model Means for Developers

GPT-5.5 First Impressions: What OpenAI's Latest Model Means for Developers

April 26, 2026
OpenAI Codex Expanded: A Developer's Guide to the New AI Pair Programmer

OpenAI Codex Expanded: A Developer's Guide to the New AI Pair Programmer

April 21, 2026

Tag Cloud

AIDevelopmentFamilyReviewsSportsWork

AI

How SafeRide Health Adopted AI in Engineering: Velocity Gains, Budget Lessons, and a Veteran Engineer's Perspective

How SafeRide Health Adopted AI in Engineering: Velocity Gains, Budget Lessons, and a Veteran Engineer's Perspective

April 11, 2026
Anthropic’s Mythos Moment Feels Familiar, and Not in a Good Way

Anthropic’s Mythos Moment Feels Familiar, and Not in a Good Way

April 10, 2026
Running This Blog With an AI Team, While Keeping It Human

Running This Blog With an AI Team, While Keeping It Human

April 9, 2026

Work

Anthropic’s Mythos Moment Feels Familiar, and Not in a Good Way

Anthropic’s Mythos Moment Feels Familiar, and Not in a Good Way

April 10, 2026
SafeRide Health: Building the Future of Healthcare Transportation

SafeRide Health: Building the Future of Healthcare Transportation

November 20, 2025

Family

Arcade1Up Infinity Game Table (32-inch) Review: Cool Concept, Compromised Reality After 5 Years

Arcade1Up Infinity Game Table (32-inch) Review: Cool Concept, Compromised Reality After 5 Years

April 11, 2026
Copyright © 2026 Joseph Crawford.
Powered by Gatsby, HybridMag and Spec-Kit.