Generative AI in Local Search: A Case Study About Pizza

Wow, things have moved quickly since we mused about the possibilities of generative AI in local search back in January, haven’t they?

Google finally joined the party with Bard, Microsoft unveiled AI-powered Bing Chat, and we’re already a few iterations deep into Chat-GPT. And then, in May, the explosive announcement of Google’s Search Generative Experience (SGE).

So, with increasing prevalence, integration within everyday search tools, and varying levels of public accessibility, we wanted to test how these different models respond to local search queries. Are they accurate? Useful?

You might already know that BrightLocal HQ is based in Brighton, UK. We also just so happen to have a food-obsessed content team—including two ex/sort-of food bloggers (yes, one of them is me, hi). So, what better way to be able to manually verify the accuracy of AI-generated search results than by analyzing those of search queries around our own local pizza restaurants?!

Methodology
Key Findings
Results
Analysis
Summary

Methodology

This case study centers around searching for local hospitality businesses in Brighton, specifically pizza restaurants, from the perspective of a typical consumer.

We determined five search queries, each with slightly differing intent, based on what a consumer might be looking for, but all with the common theme of local business discovery:

Where are the best pizza restaurants in Brighton?
What are the top-rated pizza restaurants in Brighton?
Most authentic pizza restaurants in Brighton
Best takeaway pizza in Brighton
Pizza delivery near me

These exact queries were entered into four publicly accessible (sometimes via a waitlist) generative AI tools and two traditional search engines as a control group:

Generative AI Tools

Google Bard
Search Generative Experience
Bing Chat
OpenAI’s ChatGPT (May 24 version)

Traditional Search Engines

Bing
Google

We’ve taken screenshots of every result provided to analyze the type of content and media displayed, whether sources are quoted, and how accurate the information is.

We did not refine our prompts, attempt to improve the results or gain any further information from the AI bots about where their information is sourced.

Key Findings

Traditional search engines remain the most accurate for results containing business information.
SGE provides local business information (listings, reviews, and maps) 100% of the time, compared to 80% via traditional Google searches.
Bing provides local search results with directory links, maps, images, review ratings, and business listings 100% of the time.
Bing appears to be making leaps and bounds in matching intent behind local search queries—watch out, Google!
Bard provides some incorrect results, such as incorrect business names or businesses in other parts of the UK, 80% of the time.
Bard and ChatGPT do not generally provide citations to support their responses.
Bing Chat cites its sources for local search results 100% of the time.

Table: How often media formats and business information is presented in search results for local search queries

	Bard	Bing Chat	Bing Search	ChatGPT	Google Search	SGE
Website links	100%	80%	80%	0%	100%	40%
Directory links	100%	80%	100%	0%	100%	60%
Map	0%	60%	100%	0%	80%	100%
Images	100%	60%	100%	0%	80%	100%
Review ratings	20%	60%	100%	0%	80%	100%
Business listings	0%	60%	100%	0%	80%	100%
Sponsored content	0%	0%	60%	0%	20%	0%
Inaccuracies	80%	20%	0%	60%	0%	0%

Note: It is important to consider that this case study analyzes local search results using generative AI in its current state (as of publication in July 2023). As mentioned above, the technology is constantly developing.

Bard, ChatGPT, Bing Chat, and SGE all have disclaimers to note that mistakes, inaccurate, or even offensive content may be generated by the tools.

Results

“Where are the best pizza restaurants in Brighton?”

Bard

Bard’s results to this query show quite a hodgepodge. There is a mix of independent pizza restaurants, known chains, shopping center food court brands, and… a London pizza restaurant, which definitely isn’t based in Brighton.

What’s more, it’s not clear how Bard is determining what makes this list of restaurants the ‘best’, although each result is attributed to a clickable source. There are no review ratings attached to them either, which is unusual considering Bard is a Google product.

SGE

SGE’s results display much in the way you would expect a typical Google search to for this kind of local query. A selection of local business listings are displayed in a local pack-style format, complete with a map and review ratings.

The main difference here is that, rather than pulling a quote from a business review, SGE assigns each business a rather ‘samey’ description. Laid-back, inexpensive, and vegan options are descriptors you’d probably expect for any casual dining situation, so it doesn’t feel particularly helpful.

Bing Chat

Bing Chat also goes for a straightforward list approach, with short descriptions and clickable sources. It’s not clear where these descriptions have come from, as some of them are pretty questionable, such as stating that Wild Flour Pizza is “known for its ‘yummy’ pizzas”.

The sources are a mix of review sites, search engines, and restaurant websites. However, one of the sources is wrongly attributed, which highlights an issue with result accuracy.

As the results continue generating, we also get a map pack with Bing business listings and reviews pulled from Facebook. This looks much more like the kind of search results a user would be used to seeing, helping to reinforce trust in the model.

ChatGPT

It’s interesting that it feels like ChatGPT is being ‘careful’ right from the start, with a mini disclaimer to say it can’t provide any up-to-date information, which is alo reinforced in the final paragraph.

All of the results are independent and generally well-loved Brighton restaurants. But there are some accuracy issues. The most bizarre is that result five, The Coal Shed, has never served anything close to a pizza on its steakhouse menu. Meanwhile, VIP is described here as “known for its New York-style pizza”, when it is most definitely Neapolitan. And, yes, it matters!

The ChatGPT results don’t provide any images, review ratings, sources, or any business information that might back up the list. It’s not really giving the typical user a reason to trust the results—something you’ll see recurring throughout this case study.

Traditional Search

As mentioned, SGE unsurprisingly produces the closest thing to typical search results, especially when compared directly to Google. So, I suppose the question here is: what is SGE really adding to the searcher’s experience?

“What are the top-rated pizza restaurants in Brighton?”

Bard

For this search query, Bard presents the same restaurants as before (including the London-based one, doh!). However, it does appear to recognize that in asking for the ‘top-rated’ pizza restaurants, the user expects to see some kind of review rating information, and highlights the Google ratings.

Still, it’s strange considering these aren’t actually the top-rated according to Google—and a quick search for the ratings of several other local pizza restaurants easily confirms this.

It also links each restaurant image to a source, including TripAdvisor, one of the brand’s websites, and a local business listing website, none of which reflect the Google review rating. Possibly just the source for the image, but odd logic either way.

SGE

The Search Generative Experience results for this query are displayed similarly to the last query, with five local pizza restaurants displayed in a map pack-style format.

Just like Bard, though, these aren’t all actually the top Google-rated pizza restaurants in Brighton.

On first glance, the labels highlighting each restaurant’s pizza style seemed a cool and useful addition… until I realized they were incorrect. Fatto a Mano, for example, doesn’t make sourdough pizza, and I think any Italian would faint if you tried to describe national chain, Pizza Express, as Neapolitan!