I read up on the May 2024 data leak. Below is my Google leak summary of what it means for your SEO work.
Things I list here are leaked realities:
Google leak summary: Content factors
- If content is short, Google checks if it is original (short is OK if original)
- Page titles should match keywords. Google checks this match (Morningscore checks for this)
- Outlinking (linking to other websites from your article) is apparently not important for SEO – I would still do it though.
- Authors on articles are being captured by Google (add yourself as an author on your website’s articles)
- The date of an article is important (Google wants fresh content)
- Google scores how much a website focuses on one core topic and how far the site drifts from it.
- Mentioning keywords at the top of the content is important.
Recent analysis by Hobo SEO shows how Google prioritizes content originality, authoritativeness, and freshness. We added author details to all articles after the leak. This has improved our topical focus and rankings.
Link building findings in the Google leak
- If a backlink uses a large font size, it probably carries more weight (Google at least checks for it)
- Backlinks that carry traffic really boost your domain authority and keyword rankings.
- Mentions of a brand are also a ranking signal. Google said this in 2019, but now there is proof.
- Google checks the homepages of backlink sources to see if it trusts them.
- Domain age was listed as a ranking factor. By 2026, this has been de-emphasized – relevance and authority matter more than age.
We now seek backlinks that actually drive visits. It makes a real difference for authority. Brand mentions are worth the effort too.
By 2026, many of these signals have been validated. Google’s updates since the leak have consistently targeted thin content, weak topical focus, and manipulative links – exactly what the leak pointed to. The impact is clearer now than it was in 2024.
Detailed expert breakdown of the leak:
https://www.linkedin.com/posts/shaun-anderson-hobo_the-google-content-warehouse-api-leak-of-activity-7411555028152315904-Y3AK
Comprehensive video discussion on the findings:
https://www.youtube.com/watch?v=tYObbKZS-zA
Update March 2026:
Combined analysis of all leaked ranking factors:
https://www.hobo-web.co.uk/the-google-content-warehouse-leak-2024
Update March 2026:
Google’s March 2026 core and spam updates are a clear validation of the leak. They directly target manipulative link schemes, low-quality content, and sites with no topical focus. If you ignored the 2024 signals, these updates are a good reason to revisit them.