SEO Tags in the wild
So I thought of running my own research to see the kind of trends and usage of tags across the Alexa Top 1000 websites. Since I did not have access to the Alexa list, I used the top 1000 URLs from HTTPArchive instead.
In this first part of the blog, I summarize my findings on the tag.
One of the first things I checked was the use of the tag. This was interesting to me after I learnt the foundational tag and a slightly less used but interesting use of for geo targeting.
For the top websites, I assumed that “canonical” would be quite relevant and hreflang would play an important role in geo-targeting. However, my research did have a limitation in that I was accessing just the home page. So, the potential to see the canonical links would be fewer.
I was correct in this assumption. Of the top 25 use of “link rel”, here’s the distribution I saw:
I have uploaded the raw data here: https://gist.github.com/akshayranganath/9e953a32c6ed8066f006301bc02bef1c.
Here’s the distribution again:
When I looked at the tags, I wasn’t sure on the usage and had to do a bit of read-up. Here’s a summary of some of the tags:
- rel=”search”: As a user, you can configure alternate search engines like “imdb” on browsers. This tag provides a mechanism to provide hint that your site has the ability for such a search functionality. The concept is called OpenSearch and you can read about it’s use at Aaron Parecki’s blog. And here’s a sample OpenSearch file from Airbnb: https://www.airbnb.com/opensearch.xml.
- rel=”image_src”: As a site owner, you may have a preference on which image is used as an icon to represent the website. This tag could be used to specify the icon. For more, read this stack overflow discussion.
- rel=”next”: If you have a paginated website and would like to provide hints to the search engine bots using this tag. An associated tag is rel=”prev” for pointing back to the previous page. More details at Google Webmaster tools blog.
In the Alexa Top 1000, <link> tags are heavily used for SEO purpose but, the majority of the usage is for geo-targeting. A lot of other nifty mechanisms are being put to use like specifying the pagination or the preferred icon image. However, these seem to be less popular than the basic use of targeting and canonicalization.
Apart from SEO, the other use of the tag appears to be for performance optimization. Specially, “resource hints” in the form of dns-prefetch, preconnect and preload are being used and they feature in the top 25 uses of <link rel> tag. To know more about this, have a look at Yaov Weiss‘ presentation from Velocity New York.