Why DIY ChatGPT Bots Fail in Home Services (and What to Do Instead)
A British Columbia tribunal held Air Canada liable in 2024 for its chatbot's hallucination. The same legal logic applies when your DIY ChatGPT bot quotes the wrong price to a homeowner.
Key takeaways
- Custom GPTs cap knowledge at 20 uploaded files and cannot be embedded on a public website
- Air Canada was held liable in February 2024 for its chatbot's false advice about bereavement fares
- 26% of calls to home service businesses go unanswered per Invoca, and under 3% of voicemails get a callback
An HVAC contractor on r/HVAC put it plainly in the ACHR News summary: "ChatGPT said it was the capacitor." It almost never is. That confidence is exactly the problem when you drop raw ChatGPT on your website or give your CSR a Custom GPT to draft customer replies.
This post lays out the specific ways DIY ChatGPT bots break down in home services, the legal exposure you inherit, and the setup that actually works.
The 20-file ceiling
Per OpenAI's own documentation summarized in CustomGPT.ai's integration guide, Custom GPTs max out at 20 knowledge files per GPT. A plumbing shop's SOP binder, price book, brand guide, tech training docs, and emergency scripts alone hit that cap.
You can combine files. PDFs grow huge. The GPT's retrieval quality degrades as the knowledge base gets messier.
A real home service shop has hundreds of SKUs in its price book, thousands of past invoices, current service area, live availability, and a schedule that changes hourly. None of that fits in 20 static uploads.
Custom GPTs cannot live on your website
Per the OpenAI developer community and guidance from Bartosz Mikulski, Custom GPTs run only inside chat.openai.com. They cannot be embedded on your website for anonymous visitors.
Your homeowner would have to own a ChatGPT account, log in, and navigate to your specific GPT. That is not happening when the furnace is out.
The only way to put AI on your public website is to rebuild the bot using the OpenAI or Anthropic API, a retrieval layer, and a customer-facing chat widget. That is a software project, not a Custom GPT.
Static knowledge goes stale immediately
The moment you upload your price book to a Custom GPT, it is frozen. You raised your hourly rate last week. The bot still quotes the old one.
Your tech's availability is in Jobber. The bot cannot see it.
A customer asked a question yesterday in Gmail. The bot has no memory.
This is the integration problem every DIY setup crashes into. A static upload is not a data fabric. It cannot power real quotes, real availability, or real follow-up.
Hallucinations quote the wrong price
ChatGPT hallucinates. Anyone who has used it for more than a week knows this. OpenAI's own terms of use, cited in the Originality.AI lawsuit summary, acknowledge the model cannot be trusted to generate accurate information.
In home services, a hallucinated answer is not a typo. It is a $5,000 quote mismatch on a furnace install or a promise your techs cannot keep.
Contractors on r/HVAC are openly complaining that customers arrive with ChatGPT diagnoses telling them "it is the capacitor" when it rarely is (per the ACHR News piece). The confidence with which bad information spreads is the core risk.
Your DIY bot does the same thing on your own website, with your own customers, on your own price book.
You are legally on the hook
Per the American Bar Association's summary of Moffatt v. Air Canada, the British Columbia Civil Resolution Tribunal ruled in February 2024 that Air Canada was liable for its chatbot's misrepresentation about bereavement fares.
Air Canada argued the chatbot was a separate legal entity. The tribunal rejected it outright. "As a service provider, Air Canada owed Moffatt a duty of care that was breached."
The lesson for any home service shop: if your DIY ChatGPT bot tells a customer your service area covers their address and it does not, or quotes a $300 diagnostic when it is $450, you are liable for the misrepresentation. OpenAI's terms do not transfer to your homeowner.
This is not theoretical. A $3M shop that gets sued once over a hallucinated quote spends more on defense than on three years of a real AI agent.
Your office manager still does the work
The sales pitch for Custom GPTs is "it saves time." In practice, your office manager ends up with a new tab open and copy-pastes answers between ChatGPT and your CRM.
Per HubSpot's 2024 State of Marketing Report, 64% of marketers now use AI for daily work, but the productivity gain comes from integrated tools, not from context-switching.
A Custom GPT is a context switch. It does not see your Jobber job cards. It does not know which customer called yesterday. It cannot book into your calendar.
Your office manager saves maybe 30 minutes a day drafting emails, then loses an hour reconciling what the bot said with what the CRM actually holds.
Speed-to-lead is where you actually lose money
Invoca's platform data shows 26% of calls to home service businesses go unanswered and under 3% of voicemails get a callback. The average missed call costs $1,200 in lost revenue.
The MIT Lead Response Management Study, hosted by HubSpot, found that the odds of qualifying a lead drop 21 times when contact is made after 30 minutes instead of 5.
A Custom GPT cannot fix this. It cannot see the missed call. It cannot send the SMS. It cannot book the follow-up.
A real agent wired into your phone system, your SMS number, and your calendar can.
Contractors who tried DIY and moved on
On the Owned and Operated podcast episode 100, John Wilson and Jack Carr, both operators of multi-location HVAC and plumbing businesses, described using AI internally for drafting estimates and marketing copy. John called AI "an enhancer rather than a complete solution" for their in-house work.
Their customer-facing deployment uses AI chatbots that schedule directly into ServiceTitan through integrated tools, not a raw Custom GPT.
That is the pattern. Owners use ChatGPT as a personal research tool. They do not put it on the customer-facing side without real integrations.
Hatch's case study on Brown Roofing reports conversion rising from the low 70s to 86% after switching to an integrated AI CSR. Their Apex Service Partners case study reports average first-reply times dropping to under 1 minute across speed-to-lead, estimate follow-up, and recurring-service campaigns.
None of that is achievable with a Custom GPT.
What actually works: integrated agents on a unified data layer
A working home-service AI setup has three pieces.
First, a data fabric that pulls from every tool you already use: Jobber, Housecall Pro, ServiceTitan, Workiz, GoHighLevel, Gmail, Google Calendar, Slack, QuickBooks, HubSpot. Not a static upload, a live feed.
Second, pre-built agents tuned for your workflows. Missed-call follow-up sees a Twilio missed call and texts the homeowner within seconds. Quote follow-up sees an estimate in Jobber and nudges two days later. AI chat on your website answers from your real price book, not a frozen PDF.
Third, a morning brief that rolls up overnight: new leads, urgent replies needed, overdue quotes, jobs at risk. Your phone buzzes at 6:45am with a two-paragraph summary.
This is what Sully is built for. Pre-built agents wired into the CRMs a $1M to $10M contractor actually uses, without the 20-file ceiling, the legal exposure, or the developer retainer.
The DIY rule of thumb
Use ChatGPT and Custom GPTs internally. Draft email copy, summarize a long document, brainstorm a price book item description. Your CSR checks the output before it goes out.
Do not put a DIY ChatGPT on your public website. Do not use it to auto-reply to homeowners. Do not let it quote prices or schedule jobs.
The moment a DIY bot talks directly to a customer without human review, you own every word it says and, per Moffatt v. Air Canada, you own every mistake.
The cheap path is not a Custom GPT. It is a platform that shipped the integrations, the retrieval, and the guardrails already.
Sources:
- American Bar Association on Moffatt v. Air Canada
- Invoca missed calls study
- MIT Lead Response Management Study via HubSpot
- ACHR News on ChatGPT in HVAC
- OpenAI developer community on Custom GPT limits
- Hatch AI Brown Roofing case study
- Hatch AI Apex Service Partners case study
- Owned and Operated podcast 100
- HubSpot State of AI report
See Sully in action
Sully is the pre-built AI for home service shops. Connect your CRM, email, and phone system in minutes and the agents run on your real data.