
Local LLMs: Where to Actually Start, with Vincent Deeney (A Chat with Cross-Functional Experts)

If you’re an engineer or quality professional wondering how to actually start using AI beyond the chatbots and CAD plugins your company already provides, this episode is your starting point.
Vincent Deeney (technologist and AI enthusiast) joins Dianna to break down how to run large language models (LLMs) on your own hardware, why that matters for data privacy, and how to get real work done with modest equipment. No programming required.
This interview is part of our series, “A Chat with Cross Functional Experts”.
About Vincent
Vincent Deeney is a director who has spent the past five years helping organizations navigate complex software decisions. Before that, he built a nearly two-decade career specializing in data quality, master data management, and governance. He holds a Master’s in Organization Leadership and, perhaps most tellingly, an undergraduate degree in Philosophy. He’s someone who asks the right questions before reaching for a solution. Lately he’s been exploring local LLMs on his own time.
What Vincent and Dianna Talk About
They walked through the practical side of running local LLMs.
We covered why data privacy and token costs push people toward local setups, what hardware you actually need (from your existing laptop to a dedicated AI machine like the DGX Spark), and which free tools — Ollama, LM Studio, OpenCode, Hermes Agent — make it possible to get started without writing code. We dug into how local models compare to frontier models like Claude, and Vincent shared his approach to using both together: plan with the frontier model, execute with the local one.
We also talked about managing context windows, using sub-agents, and connecting local models to tools you’re already using. The episode wraps with a practical challenge to get you started this week.


Ask a question or send along a comment.
Please login to view and use the contact form.