Teaching LLMs to See: Training a Phi-4 × FastViTHD Vision–Language Model (VLM) | Seattle .

Members-Only

Recent Talks & Demos are for members only

Exclusive feed

You must be an AI Tinkerers active member to view these talks and demos.

June 27, 2025 · Seattle

Phi-4 + FastViT-HD VLM

This talk explains how to combine a text-only Phi-4 LLM with FastViT-HD image encoder to build and fine-tune an efficient open-source Vision-Language Model.

Overview
Links
Tech stack