Generative Engines Are Breaking Web Analytics and Hurting Their Future

Generative Engines Are Breaking Web Analytics and Hurting Their Future

Search is moving from traditional search engines to generative engines, but traffic from many of these sites isn’t being tracked properly in analytics. It’s their fault, not yours.

I was looking at our LLM filter in Ahrefs Web Analytics and noticed some common generative engines missing from the list. They’re in our filters, but we aren’t seeing any data from them for sites.

Ahrefs Web Analytics filtered to LLM traffic

This invisible traffic problem comes from these systems stripping the referral value. I first noticed this problem with AI Mode in Google, but it’s a common problem for generative engines.

This is most likely a mistake on their part, but in some cases may be intentional. Some of these tools probably want more market share and just made a mistake, while others may not want you to be able to measure traffic from the systems. Google has said the clicks from AI Search are higher quality, but we have no way to verify that.

If you have a website that sends traffic to other sites, you should want it to be tracked properly. In the case of generative engines, I warned that these AI bots need to send that info in order to fulfill their social contract, where they provide traffic to websites, and websites allow these bots to crawl and their data to be used.

There’s a cost to bots crawling your websites and there’s a social contract between search engines and website owners, where search engines add value by sending referral traffic to websites. This is what keeps most websites from blocking search engines like Google, even as Google seems intent on taking more of that traffic for themselves. This social contract extends to generative engines.

I think many site owners want to let these bots learn about their brand, their business, and their products and offerings. But while many people are betting that these systems are the future, they currently run the risk of not adding enough value for website owners.

The first LLM to add more value to users by showing impressions and clicks to website owners will likely have a big advantage. Companies will report on the metrics from that LLM, which will likely increase adoption and prevent more websites from blocking their bot.

The same sentiment is true for attribution. If these generative engines want to win market share, they need to be present in reporting to companies. So far, many are not doing a great job.

noreferrer attribute on the link. This would prevent the referral value from being sent.

ChatGPT is not passing the referrer on in-content links

As expected, there is no referrer shown in the Chrome Dev Tools Console. It comes back empty.

document.referrer
''

In Ahrefs Web Analytics, this is recorded as Unknown, but in Google Analytics it would be classified as Direct. Google lumps traffic from unknown sources and internal website traffic together as Direct, whereas we separate them into Unknown and Internal.

The traffic is treated as Unknown

What’s interesting is that when I looked at the same type of link in a free account, it did not have the noreferrer attribute. It’s tracked properly.

The free account did send the referrer

For lists of links, they were also tracked properly. 

Lists of links were tracked properly

The linkes to Sources in the content and at the bottom of the response are also tracked properly, and they add a URL parameter “?utm_source=chatgpt.com” to the URLs as well. 

Sources at the end are tracked properly and add a parameter

Web Search

Most of the links in Web Search mode had the referrer. I did run into an interesting example when there are multiple references. The top one had a referrer, the other 2 did not.

mixed referrers in web search mode

DeepResearch

For DeepResearch mode, in-content links were attributed properly, but the sources at the end were marked with noreferrer.

HTTP Headers

If you look at the HTTP Headers, you’ll sometimes find a Referrer-Policy header to specify what and how much information gets passed in the referrer. You can use the Ahrefs SEO Toolbar to view this information by going to the HTTP headers tab.

referrer policy can be checked in the HTTP headers with the Ahrefs SEO Toolbar
For ChatGPT, they’ve set a referrer-policy value of “strict-origin-when-cross-origin”. In this case, the downgrade from HTTPS to HTTP would drop the referrer. Any links to pages using HTTP wouldn’t be attributed properly.

AI Mode is marked with noreferrer.

Google AI Mode doesn't pass the referrer

John Mueller from Google has since confirmed it’s a bug and that they will likely fix it.

John Mueller says AI Mode not passing the referrer is a bug
Louise Linehan mentioned that we may be underestimating AI traffic. She specifically mentioned how Copilot disappeared from our analytics tracking system. Since that time, the traffic has returned.

Copilot referrals just disappeared for a few months

What I suspect is that these links were marked as noreferrer during that time period. This shows how code changes can impact your global tracking.

Everything here seemed to be tracked properly now.

That’s not the case with Copilot in Windows. I found no cases where the referrer was passed.

LinkedIn or X.

Similar Posts

  • 10 Best Linux Distros for Hosting 2026,Jan (Top Picks)

    Is your website still running slowly even after you’ve an expensive hosting service? Working on a computer that crashes again and again can be very frustrating. The problem might be with your operating system, not the server. Because the outdated OS lacks new features, creates laggy performance and an unresponsive system. So the only solution…

  • 10+ Best Free Personal Portfolio WordPress Themes in 2026

    Portfolio websites are a key part of any creative professional’s branding. They’re a place to showcase your work and share your expertise with prospective clients. On a personal level, they’re also a nice way to look back on your past achievements. WordPress is the perfect tool for creating an online portfolio. The content management system…

  • 8 Autumn-Inspired CSS & JavaScript Effects

    Every season has a distinct vibe. People celebrate them by wearing seasonal colors and delighting in traditional flavors. Autumn appears to have taken over as a favorite time of year for many. Pumpkin spice, anyone? Yes, the fall season is being celebrated like never before – and for good reason. It’s when we embrace cooler…

  • New and Updated Tutorials for Drupal 11

    TL;DR; We’ve added a new tutorial, Upgrade to Drupal 11. And, we’ve updated all tutorials and code in our Drupal Module Developer Guide for compatibility with Drupal 11. Upgrade to Drupal 11 is new free tutorial in the course, Upgrade Drupal, and part of our Keep Drupal Up-To-Date guide. If your site is on Drupal…

  • Taiwan NSB Alerts Public on Data Risks from TikTok, Weibo, and RedNote Over China Ties

    Jul 05, 2025Ravie LakshmananNational Security / Privacy Taiwan’s National Security Bureau (NSB) has warned that China-developed applications like RedNote (aka Xiaohongshu), Weibo, TikTok, WeChat, and Baidu Cloud pose security risks due to excessive data collection and data transfer to China. The alert comes following an inspection of these apps carried out in coordination with the…