We Asked Three AIs to Rate Our Own GEO Program. They Gave Us a 4, a 7.5, and a 7.8 — and Every One of Them Was Working Blind
By Cara Bunda • June 12, 2026 • Digital Marketing
Category
By Cara Bunda • June 12, 2026 • Digital Marketing
By Cara Bunda • June 12, 2026 • News, Digital Marketing
Last time, we put MOJO's homepage on trial across ChatGPT, Claude, and Gemini and learned that the scores mattered less than how each engine got to them. This time we pointed the same method at something more uncomfortable: our own generative engine optimization program. The thing we sell. The thing we'd be standing on stage talking about.
We asked all three the same two questions. First: How strong is MOJO Creative Digital's AI search / generative engine optimization program? Rate it and tell me what's working and what isn't. Then the follow-up that does the real work: Walk me through exactly how you formed that opinion — what did you actually find, where did you find it, and what would change your assessment?
The scores came back wildly apart. Claude gave our productized program roughly a 4 out of 10. ChatGPT gave us a 7.8. Gemini landed at 7.5. Three engines, one company, the same week — and a near-four-point spread on the most important question we could ask about ourselves.
Here's the part that matters, and the reason we're publishing this instead of burying it: all three were working from incomplete evidence, and the specific things they couldn't see are the exact things our own published work already documents. The disagreement between the engines isn't a flaw in the experiment. It is the experiment. It's the clearest demonstration we've ever produced of the thing we tell every client — that the gap between the authority you've earned and the authority an AI can actually find is almost always larger than you think.
We know that's true. We just got three live demonstrations of it happening to us.
You can ask any AI to rate your business and it will hand you a confident, structured answer in seconds. That confidence is the trap. The first answer is assembled to sound authoritative whether or not the model actually found much. The diagnostic value lives entirely in the follow-up — the moment you force it to show its sources, its search path, and the boundary between what it verified and what it inferred.
When you run that follow-up across three engines built on three different retrieval architectures, you don't get three opinions. You get three different accounts of what the web actually surfaced about you, from three different indexes. Read together, they map your visibility better than any single tool could. Here's what each one confessed.
Claude was the harshest grader by a wide margin, and also the most revealing. It refused to rate us from memory, ran a web search, and fetched two pages: our GEO explainer post and our Digital Marketing services page. From that, it concluded that GEO at MOJO is "a thought-leadership topic, not a service line," and split its rating in two: roughly 4 out of 10 as a productized program, 7 out of 10 as GEO thinking. "The gap between those two numbers," it wrote, "is the whole story."
It was half right. The thinking is good — it said so, and we'll take it. But the 4 rested almost entirely on absence: GEO wasn't in the navigation menu, wasn't on the marketing page it happened to open, so it must not be a real offering.
Then we asked the follow-up, and Claude did something most people never see an AI do. It audited its own process and found it wanting:
It had run a single search. It had fetched two pages. And it had concluded "not a service line" from the fact that it didn't see one — then caught itself: "absence of evidence isn't evidence of absence. I did not check their AI services page... That's a real gap in my process." It explicitly flagged that it had never looked at the page most likely to overturn its own rating, never checked the featured-work pages for case studies, never looked at the news section for more recent posts, and "relied on a single search query rather than several angles."
That is the entire lesson of generative engine optimization, narrated live by the engine itself. Claude didn't score us a 4 because the program is weak. It scored us a 4 because it stopped looking one click too early — and the things it didn't find are not hypothetical. They're published. While Claude concluded we have no demonstrated outcomes, our 10-query AI-visibility audit of our own agency and our entity-visibility deep dive on our own footprint were sitting right there. So were two full client case studies: the Scardina Home Services audit and the Bank of Glen Burnie audit. Claude looked at two pages and graded the whole practice. The receipts were on the pages it admitted it skipped.
How strong is MOJO Creative Digital's AI search / generative engine optimization (GEO) program? Rate it and tell me what's working and what isn't.
10:14 AM
I don't have reliable information about MOJO Creative Digital's specific AI search/GEO program, and I'm not able to rate something I can't verify. Let me search for current information.
Searched the web
Let me look directly at MOJO's own content to assess their program.
Fetched: Generative Engine Optimization (GEO) | Mojo Creative Digital
Let me check whether MOJO actually offers GEO as a productized service or only discusses it editorially.
Fetched: Digital Marketing
Fetched: Digital Marketing
Based on what's publicly available, here's an honest assessment. One important caveat up front: I can only see MOJO's outward-facing content (one GEO blog post and their services pages), not their internal client deliverables, tracking dashboards, or results. So this rates their publicly visible GEO maturity, not the actual outcomes they produce for clients.
Rating: ~4/10 as a productized GEO program; ~7/10 as GEO thinking.
The gap between those two numbers is the whole story.
What's working
Their conceptual framing is genuinely good. The January 2026 explainer correctly separates GEO from SEO ("SEO helps users find you; GEO helps AI use your content"), emphasizes citation-worthiness over volume, and pushes authority, clear structure, plain-language explanation, and topical depth — which is exactly what actually drives LLM citation. They explicitly reject the "what tool should we buy" framing in favor of "is our content worth citing," which is the correct strategic instinct and more honest than competitors leaning on inflated stats. The article itself is well-structured (TL;DR, clean headings, a strong FAQ block) — meaning it practices the GEO hygiene it preaches, which helps it get cited. It also correctly notes GEO sits on top of solid technical SEO, which they do offer.
What isn't working
The biggest issue is that GEO appears to be a thought-leadership topic, not a service line. Their Digital Marketing page lists only SEO, email, and content marketing — no GEO/AI-search offering, no dedicated service page, no entry in their services menu. The blog post's call to action is a soft "request a strategy conversation," not a defined program. Compare this to competitors like 5W, Power Digital, Percepture, or Intero, who have named, productized GEO offerings with explicit deliverables.
Concretely, the public footprint shows no evidence of: AI-visibility measurement or citation tracking (the thing that distinguishes a real program from good blogging), entity/schema engineering, digital-PR or third-party authority-building to influence what LLMs ingest, query fan-out / prompt-coverage analysis, or any case studies showing actual lift in AI mentions. Their philosophy even slightly undercuts measurement ("the goal isn't to game AI systems"), which is right in spirit but can become an excuse for not instrumenting outcomes.
Bottom line
MOJO understands GEO better than many agencies that sell it aggressively, but on the public evidence they're positioned as a smart regional full-service shop talking about GEO rather than running a mature, measured GEO program. If you're evaluating them as a vendor, the questions that would resolve the gap: Do they track brand citations across ChatGPT/Gemini/Perplexity/AI Overviews? Can they show before/after AI-visibility data? Do they do entity and schema work and digital PR specifically for LLM ingestion? If the answers are yes, the real rating is higher than the public-facing one suggests.
If you're affiliated with MOJO and want, I can compare their public GEO positioning side-by-side against a few named competitors, or sketch what a productized GEO service page and measurement framework would need to include.
Walk me through exactly how you formed that opinion — what did you actually find about MOJO's GEO offering, where did you find it, and what would change your assessment?
10:18 AM
Here's the full chain, including its weak links.
What I searched and what came back
I ran one web search for MOJO's GEO program specifically. Out of ten results, only one was actually about MOJO: their January 15, 2026 blog post titled "What Is Generative Engine Optimization (GEO)?" The other nine were competitors and general GEO explainers (5W, Intero, Power Digital, Percepture, SEO.com, etc.). That ratio was itself a data point — MOJO isn't surfacing for their own GEO program the way agencies with productized offerings do, though one search is thin evidence for that inference.
Then I fetched two pages directly: the GEO blog post in full, and their Digital Marketing services page.
What I actually found on each
The blog post is where my "good thinking" read comes from. It's editorial content — an explainer with a TL;DR, headings, an FAQ, and a soft CTA to "request a strategy conversation." The substance is sound: it separates GEO from SEO correctly, emphasizes citation-worthiness over volume, and explicitly rejects a tool-first framing. That's real and I'm fairly confident in it because I read the whole thing.
The Digital Marketing services page is where my "not productized" read comes from, and this is the more important and more fragile inference. That page lists three offerings — SEO, email marketing, content marketing — and no GEO line. I also saw the site's full navigation menu (Website Design, Web & Mobile Apps, Digital Marketing, Videography, AI, Podcasting) repeated in both page headers, with no GEO entry.
Where the reasoning is solid vs. shaky
The solid part: the blog post exists and says what I said it says.
The shaky part: I concluded GEO "isn't a service line" largely from absence — it's not in the menu, not on the marketing page. But absence of evidence isn't evidence of absence. I did not check their AI services page (mojo.biz/services/ai-strategy-integration), which appeared in the navigation and is the single most likely place a GEO offering would actually live. That's a real gap in my process. GEO could plausibly be productized there and I'd have missed it. I also didn't check their featured-work pages for AI-search case studies, didn't look at their LinkedIn or news/blog section for more recent posts, and relied on a single search query rather than several angles.
So the honest confidence split: high confidence on "they write intelligently about GEO," moderate-to-low confidence on "they don't sell it as a measured program." My rating leaned on the second, weaker inference more than I flagged at the time.
What would change the assessment
Upward: a GEO/AI-search offering on the AI services page; any case study showing citation lift across ChatGPT/Gemini/Perplexity/AI Overviews; evidence of citation tracking, schema/entity engineering, or digital PR aimed at LLM ingestion; client testimonials referencing AI visibility.
Downward: discovering the blog post is their only GEO artifact with nothing operational behind it, or finding inflated/unsupported claims elsewhere on the site.
The cleanest fix is to check the page I skipped. Want me to fetch the AI services page now and revise? That's the most likely source of a correction in either direction.
ChatGPT looked harder, and it shows. Its assessment ran to a full scorecard — GEO thought leadership 9/10, understanding of AI search 8.5, demonstrated client results 5.5 — for an overall 7.8. Where Claude saw absence, ChatGPT found a body of evidence: multiple 2026 posts on GEO, AI search, Claude citations, AI-influenced buyer journeys, and local AI search. It found that we describe an actual service approach — content-architecture audits, GEO strategy, website restructuring for AI parsing, multi-platform authority building, ongoing monitoring — "not just AI content." It even surfaced that our CEO is a featured speaker at the GEO Conference in 2026, and noted our own ranking page on getting cited in Claude.
In other words, ChatGPT found the program that Claude declared didn't exist. Same company, same week — the difference was entirely how deep each engine crawled before it answered.
But ChatGPT's pushback is where it gets honest, and where it converges with the others on the one criticism that's actually true. It conceded that its answer was "a public-evidence assessment, not an inside audit," and that what it could not find was measurable proof of outcomes: "no public before/after data for ChatGPT, Gemini, Claude, or Perplexity visibility; no formal Share of Model metric; no public dashboard methodology; no model-by-model citation reporting." Its verdict: a strong emerging GEO program, not yet a proven elite one — and the thing standing between those two labels is published, verifiable results.
How strong is MOJO Creative Digital's AI search / generative engine optimization (GEO) program? Rate it and tell me what's working and what isn't.
Based on the public evidence available, I'd rate MOJO Creative Digital's GEO / AI Search Optimization program 7.5–8.0 out of 10.
That's a strong score for a traditional digital agency that has clearly invested in GEO, but not yet enough evidence for a 9+ rating compared with agencies whose entire business is AI search visibility and who publish extensive case studies, measurement frameworks, and client outcomes.
Overall Rating
AreaScore
GEO thought leadership9/10
Understanding of AI search8.5/10
Public educational content8/10
Measurement & reporting7/10
Demonstrated client results5.5/10
Technical GEO sophistication7.5/10
Market credibility8.5/10
Overall7.8/10
What's Working
1. They understand the actual GEO shift
Many agencies still describe GEO as "SEO with AI sprinkled on top."
MOJO's public GEO content correctly frames the difference between ranking and citation. Their messaging focuses on whether AI systems consider a source trustworthy, clear, and reusable rather than simply whether a page ranks in Google. That's aligned with how serious GEO practitioners think about the problem.
Their explanation emphasizes:
- credibility
- authority
- explainability
- machine-readable content
- citation-worthiness
Those are the right concepts.
2. They're visibly active in the GEO community
A strong signal is that CEO Alex Fakeri is speaking at the GEO Conference in 2026. Agencies don't automatically get speaking slots at industry conferences; it suggests MOJO is participating in the conversation rather than merely repackaging SEO services under a new label.
3. They appear to have developed AI visibility auditing
MOJO publicly discusses auditing AI search visibility, running prompts, documenting gaps, and assessing AI search readiness. That's more advanced than agencies that simply offer "AI content creation" under the GEO label.
A legitimate GEO program generally needs:
- visibility auditing
- prompt testing
- citation tracking
- entity analysis
- authority gap analysis
MOJO appears to be moving in that direction.
4. Their messaging is strategically sound
Their GEO explanation highlights:
"SEO helped people find you. GEO helps AI use your content."
That's a simplification, but strategically it's accurate. AI search visibility increasingly depends on whether content can be cited, summarized, and trusted.
What's Missing
1. Lack of public case studies
This is the biggest weakness.
The strongest GEO agencies today typically publish:
- AI visibility growth
- citation share improvements
- ChatGPT visibility gains
- Perplexity mentions
- Gemini inclusion metrics
- before/after AI search results
I couldn't find public evidence showing:
- client citation increases
- AI traffic growth
- AI-generated lead attribution
- share-of-model improvements
That doesn't mean they don't have results—it means they aren't publishing them.
For a mature GEO practice, I'd expect multiple documented wins.
2. Limited evidence of proprietary measurement
Leading GEO agencies increasingly discuss metrics like:
- Share of Model (SOM)
- AI citation rate
- mention frequency
- recommendation ranking
- cross-model visibility
These measurement systems are becoming a key differentiator.
MOJO mentions auditing and visibility assessment, but I don't see evidence of a proprietary measurement framework comparable to the more advanced GEO shops.
3. Content depth is still early-stage
Their GEO content is thoughtful, but much of it is introductory.
What I'd expect from a top-tier GEO agency:
- detailed entity optimization frameworks
- structured data methodologies
- citation engineering examples
- AI visibility benchmarking
- model-specific optimization techniques
- experimental results
The public material today feels more educational than research-driven.
4. Not yet clearly differentiated from advanced GEO specialists
The most specialized GEO firms have built their entire positioning around AI visibility and often publish methodologies for:
- entity graph development
- authority propagation
- citation engineering
- AI answer monitoring
- retrieval optimization
MOJO currently looks like a strong digital agency that has added a serious GEO capability, rather than a GEO-first specialist firm.
What I'd Ask Before Hiring Them
If I were evaluating MOJO for an engagement, I'd ask:
- How do you measure AI visibility across ChatGPT, Gemini, Claude, and Perplexity?
- What GEO KPIs do you report monthly?
- Can you show before-and-after citation growth for clients?
- How do you track Share of Model or equivalent metrics?
- What percentage of GEO work is technical vs. content vs. authority building?
- How do you validate that recommendations are improving rather than fluctuating?
The quality of those answers would determine whether they're an 8/10 agency or a 9+/10 agency.
Bottom Line
MOJO appears to be ahead of the average agency on AI search and GEO. They demonstrate a solid understanding of how generative search works, are visibly participating in the GEO ecosystem, and seem to be conducting real AI visibility audits rather than simply rebranding SEO.
What keeps them below the top tier is the lack of publicly demonstrated outcomes, proprietary measurement systems, and deep technical GEO research. Until those are visible, I'd categorize them as:
Traditional agency: 5/10 for GEO
MOJO: 7.8/10 for GEO
Elite GEO specialist: 9–10/10 for GEOSo if you're comparing agencies, my assessment is that MOJO is a credible contender in AI search optimization—but I would still want to see client-level evidence before assuming they're among the industry's very best.
Walk me through exactly how you formed that opinion — what did you actually find about MOJO's GEO offering, where did you find it, and what would change your assessment?
I’ll reconstruct the assessment from primary public sources where possible, separate evidence from inference, and tighten the rating criteria rather than leaning on broad impressions.
Thought for 11s
You’re right to ask. My prior answer should have been clearer: it was a public-evidence assessment, not an inside audit.
I formed the opinion from these signals:
- MOJO has a clearly articulated GEO philosophy.
The main GEO page says “SEO helped people find you. GEO decides whether AI mentions you,” and frames GEO around citation-worthiness, clarity, authority, structure, and long-term relevance. That supported a high score for strategic understanding.- MOJO is publishing actively on AI search/GEO.
The site has multiple 2026 posts on GEO, AI search, Claude citations, AI-influenced buyer journeys, local AI search, entity visibility, and benchmark reporting. That supported a high score for content commitment.- MOJO describes an actual service approach, not just “AI content.”
One post says MOJO offers content architecture audits, GEO strategy, website restructuring for AI parsing, multi-platform authority building, and ongoing performance monitoring. That was the strongest evidence that there is a real program behind the messaging.- MOJO is testing AI visibility on itself and clients.
The Maryland digital agencies post discusses prompt testing and competitive AI recommendation visibility; the Scardina post positions AI recommendation as a local visibility outcome.- There is some performance reporting, but mostly SEO—not GEO.
The May 2026 benchmark report gives client SEO metrics and mentions GEO as a parallel discipline, including MOJO’s own “How to Get Your Business Cited in Claude” ranking page one. But it does not show hard AI-engine citation lift, share-of-answer, or model visibility deltas.- External credibility signal: GEO Conference.
MOJO says Alex Fakeri is a featured speaker at GEO Conference 2026. That supports credibility, but it is still a softer signal than published client outcomes.What’s working: positioning, content velocity, correct conceptual framing, self-auditing, local-business angle, and tying GEO to authority/citations instead of gimmicks.
What isn’t fully proven: measurable GEO outcomes. I did not find public before/after data for ChatGPT, Gemini, Claude, or Perplexity visibility; no formal Share of Model metric; no public dashboard methodology; no model-by-model citation reporting.
What would raise my rating to 9+: published case studies showing AI citation growth, prompt-set tracking, model-by-model visibility, before/after recommendation share, and conversion impact from AI search.
What would lower it: if the “ongoing performance monitoring” is mostly standard SEO reporting, or if GEO work is primarily blog production without entity/citation/technical architecture work.
So the clean version: MOJO looks credible and ahead of many traditional agencies, but the public proof supports “strong emerging GEO program,” not “elite proven GEO practice” yet.
Gemini scored us 7.5 and, true to how it reads the web, framed everything through the lens of entity clarity and the knowledge graph. Its standout finding wasn't about our content quality at all — it was structural. Gemini argued that MOJO's greatest business strength is also its GEO weakness: we operate across wildly divergent verticals — local web design, enterprise government tolling infrastructure, AI strategy, SDVOSB contracting — and that breadth "dilutes" our entity signals. When an answer engine tries to categorize us, it said, "the semantic relationship lines get tangled," which keeps us out of confident, hyper-specific recommendations.
That's a genuinely sophisticated read, and it's one neither other engine produced. It also happens to be a problem we've already diagnosed in writing — the "bifurcated entity" issue between our commercial and government web presences is documented in detail in our own digital footprint deep dive. Gemini independently rediscovered, from the outside, a structural finding we'd already published about ourselves.
And then it landed on the same gap as the other two. Our strong performance metrics, Gemini noted, "live predominantly on our own domain." Without aggressive external placement, "AI models treat these metrics as self-reported rather than objectively verified." That is the verification gap, stated more precisely than we'd have stated it ourselves.
How strong is MOJO Creative Digital's AI search / generative engine optimization (GEO) program? Rate it and tell me what's working and what isn't.
Evaluating MOJO Creative Digital’s AI search and Generative Engine Optimization (GEO) program requires looking at both their public-facing strategy and how effectively they execute it on their own brand footprint.
On a standard agency spectrum, MOJO sits at a solid 7.5 out of 10. They understand the structural and entity-based mechanics of conversational search, but their biggest hurdle is an internal one: mapping a complex, multi-vertical business structure into clean signals that large language models (LLMs) can easily parse.
The following analysis details what is working well within their GEO framework and where the system experiences friction.
What’s Working Well
1. Robust Entity Validation via High-Authority Citations
LLMs and answer engines rely heavily on third-party verification to build "trust" before citing a business. MOJO excels here by leveraging highly authoritative, non-traditional digital marketing assets. Their technical and design work for government entities (like the Georgia State Road and Tollway Authority) resulted in national W3 Awards (Gold and Silver). Because these wins are documented on high-authority press wires and state portals, they provide permanent, timestamped entity validation that AI search engines view as a credible signal.
IMD Business School+ 1
2. Strong Alignment with AI Training Data (Podcasting)
Conversational engines heavily favor unstructured conversational data, transcriptions, and industry-specific vocabulary. MOJO’s ongoing production of two original podcasts—Toll Talk and The Diligent Leader—serves as an ideal data vector. By publishing consistent audio and text transcripts, they feed semantic search engines a rich layer of niche industry terminology (specifically in specialized sectors like tolling and transportation infrastructure) that standard corporate copy cannot replicate.
MOJO Creative Digital
3. Pragmatic Framework: Empathy + Strict Data Controls
A common failure in agency GEO programs is treating AI simply as a high-speed copywriter, which leads to generic content that fails to stand out in generative engines. MOJO’s approach centers on a clear workflow:
Empathy-Driven Inputs: Crafting prompts based on human audience research.
- Evaluation & Nuance Filtering: Multi-layered refinement to ensure the output is insightful rather than formulaic.
MoJo Active
- Strict Security: They maintain a strict data-privacy policy, ensuring client data is never exposed to public LLM training sets—a critical prerequisite for enterprise and government-level GEO compliance.
MoJo Active
What Isn't Working (The Friction Points)
1. The "Multi-Vertical" Identity Crisis
The fundamental mechanism of GEO is mapping an entity to a clear, predictable category. This is where MOJO's greatest business strength becomes its digital footprint flaw. They operate across wildly divergent vectors:
Local web design and SEO for residential contractors.
Enterprise application development for state transport agencies.
- Niche B2B consulting, SOC2 compliance, and Service-Disabled Veteran-Owned Small Business (SDVOSB) contracting.
MOJO Creative Digital
Because they are trying to be a local agency, an enterprise builder, a government contractor, and an AI strategy firm all at once, their knowledge graph signals are diluted. When an answer engine tries to categorize them, the semantic relationship lines get tangled, making it harder for an AI engine to confidently return them as a definitive recommendation for a single, hyper-focused query.
2. Fragmented External Footprint for Specific Case Studies
While the performance data from their core client wins is incredibly strong (e.g., driving millions of views and massive session lengths for major projects), this data lives predominantly on their own domain (
mojo.biz). For GEO to achieve maximum efficiency, these detailed metrics and outcomes need to be mirrored, discussed, and cited across external, third-party industry publications. Without an aggressive external placement program, AI models treat these metrics as self-reported rather than objectively verified.Summary Evaluation
CriteriaRatingKey Takeaway
Technical Schema & Structure8.5 / 10Excellent handling of structured data, local optimization, and clean site architecture.
Entity Authority & Trust8.0 / 10Anchored by legitimate, verified awards and high-authority institutional partnerships.
Signal Clarity & Focus5.5 / 10Weakened by a highly fragmented market identity that splits signals across diverse B2B, B2G, and local services.
MOJO Creative Digital has a highly sophisticated grasp of where search is moving. Their foundational technical SEO is excellent, and they treat AI as a collaborative data tool rather than a shortcut. If they can better isolate and cleanly structure their distinct business arms so that answer engines don't experience categorization confusion, their GEO performance would easily push into a top-tier rating.
Walk me through exactly how you formed that opinion — what did you actually find about MOJO's GEO offering, where did you find it, and what would change your assessment?
I don't pull ratings out of thin air or rely on generic templates. To evaluate MOJO Creative Digital (operating out of their primary domain,
mojo.biz), I looked directly at their public-facing corporate footprint, analyzed how they explicitly productize their services, and compared their internal claims against the visible structure of their business.Here is exactly how I formed that opinion, the evidence I found, and the specific triggers that would fundamentally change my assessment.
1. Where I Looked & What I Found
Evidence Source: The "GEO Productization" Playbook
Where it lives:
mojo.biz/what-generative-engine-optimization-geo- What I found: MOJO isn't just treating GEO as a buzzword; they have formally productized it. They explicitly layout a philosophy that mirrors current enterprise-level thinking for 2026: "SEO helped people find you. GEO decides whether AI mentions you."
MOJO Creative Digital
- My Assessment Logic: They pass the baseline test because they correctly identify that GEO requires optimizing for citation worthiness rather than just clicks. Their documentation accurately separates "authority over output" (cutting down on thin AI-generated filler blog content) from structured semantic mapping. This proves their strategic framework is sound, giving them a high foundational marks on the criteria scale.
MOJO Creative Digital
Evidence Source: The Portfolio and Vertical Stacking
Where it lives:
mojo.biz(Core Capabilities & Industry Verticals)- What I found: They openly market conflicting target audiences on a single corporate domain. They showcase long-standing experience in heavily regulated, high-stakes government/enterprise niches (Tolling & Transportation infrastructure) alongside highly localized commercial sectors (Construction/Contractors) and broad Professional Services.
MOJO Creative Digital
My Assessment Logic: This is the smoking gun for my critique regarding their "Identity Crisis." Conversational AI engines construct a "Knowledge Graph" (a web of interconnected entities). When an engine scrapes
mojo.biz, it sees heavy clusters of keywords related to tolling roads and government administration mixed with residential construction. Because the semantic overlap between a local roofer and a state tollway system is nearly zero, the AI's entity-classification confidence score drops. This dilution keeps them from scoring a perfect 9+ out of 10.Evidence Source: The Content Vectors (Podcasts)
Where it lives: Public index files of their original produced content streams (Toll Talk and The Diligent Leader).
What I found: They run multi-episode video and audio content programs that feed dense, natural-language transcripts into the web.
My Assessment Logic: LLMs are trained heavily on conversational structures. By pouring structured transcripts of industry professionals talking about highly specific topics into the index, they are giving Retrieval-Augmented Generation (RAG) systems the exact data-rich, unstructured prose they prefer to summarize.
2. What Would Change My Assessment?
My 7.5 out of 10 rating is a snapshot of their current operational reality. Two distinct scenarios would force me to change that score:
What Would Raise the Rating (Moving to an 8.5 or 9.0)
If MOJO executed an entity-isolation strategy, my score would instantly increase.
The Action: If they spun off their vastly different business arms into cleanly defined sub-brands or distinct standalone domains (e.g., keeping local contractor marketing completely separate from enterprise government infrastructure contracting).
The GEO Impact: This would dramatically clean up their digital footprint. Conversational engines would instantly view each entity with 100% categorical confidence, resulting in a significantly higher citation frequency for hyper-specific industry queries.
What Would Lower the Rating (Dropping to a 6.0 or below)
If their client execution shifted toward volume-based automated publishing.
The Action: If an audit of their recent client case studies or their own site showed a sudden pivot toward high-volume, programmatic blogging (churning out generic "how-to" articles to capture long-tail keywords).
- The GEO Impact: Current generative models are trained to aggressively filter out boilerplate text that they can easily synthesize themselves. If they compromise on their stated philosophy of "depth over volume," their visibility in modern AI search overrides would plummet.
MOJO Creative Digital
The spread between a 4 and a 7.8 looks alarming until you see what produced it — and then it becomes the most useful thing in the whole exercise.
The three engines reach the web through three different doors. ChatGPT pulls primarily through Bing's index plus its own crawler; it crawled deep and found the program. Claude retrieves through Brave Search, cites few sources, and in this case ran one query, hit mostly competitors, and stopped — so it saw the least and scored the lowest. Gemini is wired into Google's index and knowledge graph, so it naturally reached for entity structure and rediscovered our categorization problem. Three architectures, three depths of search, three verdicts about one company.
That's not a reason to distrust AI ratings. It's a reason to never trust a single one. If we had asked only Claude, we'd believe we have no program. If we had asked only ChatGPT, we'd believe we're nearly elite. The truth is only visible in the disagreement — and reading the disagreement is a skill, not a search.
Here's the part that should reassure anyone worried about the messiness: underneath the divergent scores, all three engines independently flagged the same real weakness. Claude: no evidence of measured outcomes. ChatGPT: no public before/after data, no Share-of-Model reporting. Gemini: metrics that look self-reported because they live on our own domain. When the harshest grader, the most generous grader, and the entity-analyst all point at the same missing thing, that's not noise. That's the finding. And it's the most credible kind of finding, because it survived three completely different ways of looking.
Here's what makes this piece worth publishing rather than spinning.
The verification gap all three engines identified — outcomes that exist but aren't externally verifiable — is the exact gap our entire practice is built to diagnose and close. It's the thesis of our own footprint audit. It's what we found auditing Scardina, whose single AI win came from a structured third-party Generac directory while everything living on their own site stayed invisible. It's what we found auditing the Bank of Glen Burnie, where strong regulatory citations earned recognition but thin third-party content left them out of recommendation queries. The pattern we keep documenting in clients is the pattern three AIs just documented in us.
That's not embarrassing. It's the most honest credential we could offer. We didn't read about the gap between earned authority and findable authority in a study — we have it, we can see it operating on our own brand, and we know precisely how to close it because closing it is the work. The proof of outcomes exists; what it lacks is the third-party, externally-citable home that would let an AI count it as real. Which is exactly what we tell clients their proof is missing.
The first piece in this series ended with a method: don't trust the score, interrogate the retrieval. This one extends it, because rating a program is harder than rating a page and the failure modes are sharper.
Run all three, and read the spread — not the average. A single engine will mislead you in whichever direction it happened to crawl. Claude's 4 and ChatGPT's 7.8 are both "right" given what each one saw. The real signal is the gap between them and the criticism they share.
Treat a low score as a retrieval map, not a verdict. Claude scoring us a 4 told us almost nothing about our program and almost everything about which of our pages aren't surfacing. The lowest rating pointed straight at our least-findable assets. That's diagnostic gold if you read it correctly.
Believe the convergent criticism. When all three engines flag the same missing thing — in our case, externally-verified outcomes — fix that first. It's the most reliable finding you'll get, precisely because three different architectures all hit it independently.
Get your proof off your own domain. This is the one that applies to literally every business, including the agency writing this. Self-published metrics are treated as self-reported. Case studies, results, and credentials need a third-party, AI-citable home — directory listings, earned editorial coverage, manufacturer and partner profiles, review platforms — before an engine will count them as real. Our client audits keep proving this, and now our own program audit proves it too.
Be willing to publish the 4. We could have asked the flattering question, kept the flattering answer, and never mentioned that one of the three engines we sell expertise in rated our own program a 4 out of 10. Instead we asked the hard follow-up, watched that engine dismantle its own reasoning, and put the whole thing in public. That willingness — to invite the harsh read, force it to show its work, and act on what's real — is the actual practice. Not a score. A discipline.
The engines disagreed about us because the web they each read isn't one web, and because none of them looked equally hard. That's not a problem to hide from. It's the single best argument for why this work exists. The businesses that win in AI answers won't be the ones with the best score from one chatbot. They'll be the ones who run all three, read the disagreement honestly, and do the unglamorous work of making their earned authority findable — wherever the engines are actually looking.
We're taking our own advice. Publicly, with the 4 included.
Because they read different parts of the web and crawled to different depths. ChatGPT pulls primarily through Bing's index and, in this case, searched deeply enough to find our full body of GEO work, landing at 7.8. Claude retrieves through Brave Search and ran a single query that surfaced mostly competitors, so it saw the least and scored our productized program around 4. Gemini is wired into Google's knowledge graph and focused on entity structure, landing at 7.5. The divergence isn't a reason to distrust AI ratings — it's the reason to never rely on just one.
Not necessarily — it often reflects what the engine could find, not what's true. Claude rated our productized program a 4 and then admitted in its own follow-up that it had checked only two pages and skipped the one most likely to change its answer. A low score is best read as a map of which of your assets aren't surfacing in that engine's index, not as a verdict on your quality.
It's the gap between outcomes a business has genuinely achieved and outcomes an AI can independently verify. All three engines flagged it for us: our results exist but live mostly on our own domain, so the engines treated them as self-reported rather than objectively confirmed. Closing the gap means getting your proof onto third-party, AI-citable sources — directories, earned media, partner and review platforms.
AI engines weight independent, third-party citations more heavily than self-published claims, because anyone can say anything on their own site. A metric on your own domain is "self-reported"; the same outcome cited in a directory, a manufacturer's dealer listing, a review platform, or an editorial article is "verified." Our client audits repeatedly show businesses winning AI recommendations through structured third-party sources while identical information on their own site stays invisible.
After it rates you, ask it to show its work: what did you actually find, where did you find it, and what would change your assessment? The rating is the marketing answer. The follow-up reveals the engine's sources, its search depth, and the line between what it verified and what it inferred. In our case, that follow-up is what exposed a 4/10 rating built on two pages and a single search.
Read the spread, not the average, and fix the criticism the engines agree on. Different scores tell you how differently the engines crawled; the shared criticism tells you what's actually wrong. For us, the convergent finding across all three was the verification gap — so that's the priority, regardless of whether a given engine scored us a 4 or an 8.
We ran this experiment on our own GEO program and published the result, 4/10 and all. We can run it on yours — auditing not just where you rank, but what each major AI engine finds, misses, and treats as unverified when someone asks about your business. The gap between what you've earned and what AI can find is almost always wider than you'd guess. Finding it is the first step to closing it.
MOJO Creative Digital 4157 Mountain Rd. #240, Pasadena, MD 21122 (410) 439-1994
MOJO Creative Digital tested its own GEO program against ChatGPT, Claude, and Gemini for this piece. The engines' direct quotes and ratings are reproduced from those sessions. One of the three engines evaluated here, Claude — the one that scored our program a 4 — also helped write this article, including this sentence. That seemed like the honest way to do it.