Solo specialist within a salon
Has their own personal Instagram. Customers DM the specialist directly, the bot logs into the salon CRM.
MCP server for an AI assistant
SaaS for booking automation via Instagram Direct
Pilot — SaaS for businesses that take bookings via Instagram Direct: beauty salons, barbershops, clinics, trainers, consultants, tutors, specialists. Owners connect their Instagram, the AI assistant intercepts DMs from customers — chats, advises, books appointments, creates customers in CRM, reschedules, cancels.
The platform supports networks with multiple locations and dozens of staff with their own service sets.
My role: solo full-stack + AI — from architecture to production. One developer for the entire stack: backend, MCP layer, integrations with CRM/Instagram/Google Calendar/Telegram, infrastructure, deployment.
Before Pilot I worked as a media buyer and saw the same problem at business clients: managers leak leads.
For small businesses this often doesn't pay off. The owner either pays for a mediocre service, or sits in Direct themselves instead of running the business.
Under the hood — an MCP server with 8 production tools that the LLM agent calls for real actions in CRM.
The system scales from a single specialist to a salon network — with the same codebase.
Has their own personal Instagram. Customers DM the specialist directly, the bot logs into the salon CRM.
Independent specialist working alone. No CRM needed — the bot books directly into Google Calendar, which syncs with Apple Calendar.
Integration with CRM + calendars of all staff. The bot knows everyone's schedule and skill set.
Multiple locations under one brand, their own CRM, their own staff, their own pricing per location. Even at scale, the assistant keeps in mind every specialist at every location, what services each provides, schedules and prices — and navigates fluidly thanks to smart context.
Customers send not only text — a reference photo, a voice "book me for tomorrow", a video of the last haircut. AI has to understand all of this together and respond meaningfully.
Solution → Async media processing pipeline on top of a vision-capable LLM — for the agent the input stream looks like a single multimodal feed.
A single backend serves dozens of independent businesses. One universal agent has to work for a barbershop, a nail salon, and a salon network alike — with their actual catalog of services, staff and prices.
Solution → Full state isolation per tenant + a tool-schema generator that builds a contextual MCP-tools schema per client from current CRM data.
Sometimes the owner manually opens Instagram and starts replying to a customer themselves. If AI keeps responding in parallel — disaster: two responses, confusion, lost customer.
Solution → "Human took over" detector — the bot temporarily pauses for that chat and resumes with updated context once the human steps out.
Real people don't write one long message. They send "Hi", then a photo, then "I want to book", then "for tomorrow" — within 15 seconds, in separate bubbles. Most bots reply to each one separately — the dialogue becomes schizophrenic.
Solution → Pilot waits until the customer stops typing, groups the whole burst (text + media + more text) into one logical turn and responds with one clear message — like a human would.
The product is currently in final testing — below is the value proposition vs. hiring a person.
| Metric | Human | Pilot |
|---|---|---|
| Monthly cost | $600-1800 for 1-3 managers per location | $120 per salon |
| Response time | 30 min – 2 hours | < 5 seconds |
| Parallelism | 1 conversation at a time | unlimited concurrent |
| Working hours | 8-12 hours, 5 days | 24/7, no days off |
| Script quality | fluctuates, depends on mood | stable, updated centrally |
| Burnout / turnover | high turnover, constant retraining | none |
| Customer memory | forgets | full visit and add-on history |
| Scaling | +1 location → +1 manager | +1 location → 0 extra cost |
This is one of my cases. The rest is on the home page.