HOUSTON AI DEVELOPER

Houston AI Developer — Custom Models for Texas Businesses

I am a Houston AI developer building self-hosted LLM systems, RAG pipelines, and custom AI automation for Houston-area enterprises. 25 years in computer science, U.S. Army veteran, single-principal delivery. The Houston buyer pool — energy, healthcare, Microsoft-shop enterprises — has compliance and cost economics that often rule out the SaaS-API path; this page exists for those buyers.

Houston AI demand — what I see in the metro

The Houston AI buyer pool is unusual. The Texas Medical Center generates more healthcare AI demand than any city outside Boston, but most of that demand is gated by HIPAA and risk-management constraints that rule out third-party LLM APIs. The Energy Corridor and Houston Ship Channel generate operations-AI demand at scale — predictive maintenance, claim review, contractor-document extraction — but with data classification rules that require self-hosted models. Microsoft-shop enterprises across the metro want AI features inside the .NET / Azure AD perimeter they already operate, not bolted onto an OpenAI account that lives outside their compliance posture. All three converge on the same shape: custom AI built by someone who will run it inside the buyer\'s perimeter.

I work as that engineer. The model is fine-tuned on the buyer\'s corpus. The inference runs on the buyer\'s hardware (or the buyer\'s Azure tenant). The eval harness, monitoring, and deploy runbook all belong to the buyer. There is no third-party API in the production path. There is no per-token fee compounding for the lifetime of the application.

The trade is up-front capex for zero-marginal-cost inference at scale. For Houston buyers whose monthly OpenAI / Anthropic / Bedrock bill has crossed $20K, the economics tilt before the engagement ends. For buyers running compliance-gated workloads, the trade is non-economic — the third-party path is simply not allowed.

What I build for Houston AI buyers

  • Self-hosted LLM systems. Llama 3 / Llama 4 / Mistral / Qwen fine-tuned on the buyer\'s corpus, served via llama.cpp, vLLM, or Triton inside the buyer\'s infrastructure.
  • RAG pipelines. Vector stores (pgvector, Qdrant), citation-first answers, versioned source documents, audit trails for healthcare and energy operations.
  • Document-extraction workflows. Energy contractor documents, healthcare claim forms, insurance correspondence — structured data out of unstructured inputs.
  • Internal copilots. Operations-team knowledge surfaces, customer-service triage, ticket categorization, on-call runbooks turned interactive.
  • Eval + monitoring tooling. The harness that proves the model is good enough on day one; the monitoring that proves it stays good in week 26.

Houston verticals I work in

  • Texas Medical Center healthcare-adjacent. Scheduling, claim review, audit logging, contractor portals, internal triage — not directly clinical work, which belongs to FDA-cleared specialists.
  • Energy operations. Exploration data review, contractor-document extraction, predictive maintenance pipelines, operations dashboards.
  • Microsoft-shop enterprises. AI features inside the Azure AD / Entra ID perimeter, Power BI-adjacent dashboards, Microsoft Graph integrations.
  • Mid-market accounting + B2B services. Customer-service AI, document-extraction, internal copilots for the operations team.

How a Houston AI engagement is scoped

Sprint zero is the eval harness — the golden set and the measurable success criterion before any code ships. Sprint one is the first model. Sprint two is the production deploy. Sprint three is the monitoring. Total engagement typically runs 3-6 months. In-person discovery in the Houston metro is available for serious buyers.

FREQUENTLY ASKED

Do you work with Texas Medical Center buyers?

Yes — for healthcare-adjacent (not directly clinical) workflows. Scheduling, claim review, audit logging, contractor portals, internal triage. I do not build FDA-regulated clinical decision support; that work belongs to specialists with the relevant certifications.

Can the model run inside our Azure tenant?

Yes — Azure ML, Azure OpenAI (Microsoft-tenant scoped), or self-hosted on Azure VMs running llama.cpp / vLLM. The deployment shape matches the buyer's existing Microsoft / Azure perimeter rather than introducing a third-party API.

What is your hourly rate?

Bespoke, scoped per engagement, written into SOW before contract. AI engagements are typically priced per-phase rather than hourly: eval harness; first working model; production deploy; monitoring.

Can you meet in person in The Woodlands, Sugar Land, or the Energy Corridor?

Yes — a 30-minute coffee meeting in the Houston metro is the recommended way to evaluate fit. Energy Corridor, The Woodlands, Sugar Land, Katy, Pearland, downtown Houston, and Galveston are routine. Beaumont and College Station are reachable with a few days notice.

Why self-hosted instead of OpenAI / Anthropic?

Three reasons, in order: compliance posture (healthcare and energy frequently rule out third-party APIs), monthly cost at scale (the math tilts above $20K/month in per-token fees), and data classification (some buyer data simply cannot leave the perimeter). For buyers whose problem does not have any of those constraints, a wrapper is the right answer and I will say so.

RELATED

Scope This Engagement

Single principal, plan first, working code on every checkpoint.

Start a Conversation