{"id":172884,"date":"2024-03-13T12:14:56","date_gmt":"2024-03-13T17:14:56","guid":{"rendered":"https:\/\/ahrefs.com\/blog\/?p=172884"},"modified":"2025-06-23T18:18:33","modified_gmt":"2025-06-23T23:18:33","slug":"link-index-comparison","status":"publish","type":"post","link":"https:\/\/ahrefs.com\/blog\/link-index-comparison\/","title":{"rendered":"You Can\u2019t Compare Backlink Counts in SEO Tools: Here\u2019s Why"},"content":{"rendered":"\n<div class=\"intro-txt\">Google knows about 300T pages on the web. It\u2019s doubtful they crawl all of those, and at least according to some documents from their antitrust trial we learned they only indexed 400B. That\u2019s around .133% of the pages they know about, roughly 1 out of every 752&nbsp;pages.<\/div>\n\n\n\n<p>For Ahrefs, we choose to store about 340B pages in our index as of December 2023.<\/p>\n\n\n\n<p>At a certain point, the quality of the web becomes bad. There are lots of spam and junk pages that just add noise to the data without adding any value to the&nbsp;index.<\/p>\n\n\n\n<p>Large parts of the web are also duplicate content, <a href=\"https:\/\/twitter.com\/lilyraynyc\/status\/1509176261884747781\">~60% according to Google\u2019s Gary Illyes<\/a>. Most of this is technical duplication caused by different systems. However, if you don\u2019t account for this duplication, it can waste more resources and create more noise in the&nbsp;data.<\/p>\n\n\n\n<p>When building an index of the web, companies have to make many choices around crawling, parsing, and indexing data. While there\u2019s going to be a lot of overlap between indexes, there\u2019s also going to be some differences depending on each company\u2019s decisions.<\/p>\n\n\n\n<p>Comparing link indexes is hard because of all the different choices the various tools have made. I try my best to make some comparisons more fair, but even for a few sites I\u2019m telling you that I don\u2019t want to put in all of the work needed to make an accurate comparison, much less do it for an entire study. You\u2019ll see why I say this later when you read what it would take to compare the data accurately.<\/p>\n\n\n\n<p>However, I did run some tests on a sample of sites and I\u2019ll show you how to check the data yourself. I also pulled some fairly large 3rd party data samples for some additional validation.<\/p>\n\n\n\n<p>Let\u2019s dive&nbsp;in.<\/p>\n\n\n<div class=\"intro-tok\" id=\"intro_tok\" style=\"display:none;\"><div class=\"intro-title\">Contents<\/div><a href=\"#\" class=\"expand-dots\"><span><\/span><span><\/span><span><\/span><\/a><\/div>\n\n\n\n<div class=\"post-nav-link clearfix\" id=\"section1\"><a class=\"subhead-anchor\" data-tip=\"tooltip__copielink\" rel=\"#section1\"><svg width=\"19\" height=\"19\" viewBox=\"0 0 14 14\" style><g fill=\"none\" fill-rule=\"evenodd\"><path d=\"M0 0h14v14H0z\" \/><path d=\"M7.45 9.887l-1.62 1.621c-.92.92-2.418.92-3.338 0a2.364 2.364 0 0 1 0-3.339l1.62-1.62-1.273-1.272-1.62 1.62a4.161 4.161 0 1 0 5.885 5.884l1.62-1.62L7.45 9.886zM5.527 5.135L7.17 3.492c.92-.92 2.418-.92 3.339 0 .92.92.92 2.418 0 3.339L8.866 8.473l1.272 1.273 1.644-1.643A4.161 4.161 0 1 0 5.897 2.22L4.254 3.863l1.272 1.272zm-.66 3.998a.749.749 0 0 1 0-1.06l2.208-2.206a.749.749 0 1 1 1.06 1.06L5.928 9.133a.75.75 0 0 1-1.061 0z\" style \/><\/g><\/svg><\/a><div class=\"link-text\" data-anchor=\"Numbers often include different data\" data-section=\"numbers-often-include-different-data\">\n\n\n\n<h2 class=\"wp-block-heading\"><a id=\"post-172884-_e5fv4dcq6hvf\"><\/a>Numbers often include different data<\/h2>\n\n\n\n<\/div><\/div>\n\n\n\n<p>If you just looked at dashboard numbers for links and RDs in different tools you might see completely different things.<\/p>\n\n\n\n<p>For example, here\u2019s what we count in Ahrefs:<\/p>\n\n\n\n<ul class=\"wp-block-list\">\n<li>Live links<\/li>\n\n\n\n<li>Live RDs<\/li>\n\n\n\n<li>6 months of&nbsp;data<\/li>\n<\/ul>\n\n\n\n<p>In Semrush, here\u2019s what they&nbsp;count:<\/p>\n\n\n\n<ul class=\"wp-block-list\">\n<li>Live + dead&nbsp;links<\/li>\n\n\n\n<li>Live + dead&nbsp;RDs<\/li>\n\n\n\n<li>6 months of data + a bit&nbsp;more*<\/li>\n<\/ul>\n\n\n\n<p>*By a bit more, what I mean is that their data goes back 6 months and to the start of the previous month. So, for instance, if it\u2019s the 15th of the month, they would actually have about 6.5 months of data instead of 6 months of data. If it\u2019s the last week of the month, they may have close to 7 months of data instead of&nbsp;6.<\/p>\n\n\n\n<p>This may not seem like a lot, but it can increase the numbers shown by a lot, especially when you\u2019re still counting dead links and dead&nbsp;RDs.<\/p>\n\n\n\n<p>I don\u2019t think SEOs want to see a number that includes dead links. I don\u2019t see a good reason to count them, either, other than to have bigger and potentially misleading numbers.<\/p>\n\n\n\n<p>I only say this because I\u2019ve called Semrush out on making this type of biased comparison before on Twitter, but I stopped arguing when I realized that they really didn\u2019t want the comparison to be fair; they just wanted to win the comparison.<\/p>\n\n\n\n<figure class=\"wp-block-embed is-type-rich is-provider-twitter wp-block-embed-twitter\">\n<div class=\"wp-block-embed__wrapper\">https:\/\/twitter.com\/patrickstox\/status\/1382700507882532869<\/div>\n<\/figure>\n\n\n\n<div class=\"post-nav-link clearfix\" id=\"section1\"><a class=\"subhead-anchor\" data-tip=\"tooltip__copielink\" rel=\"#section1\"><svg width=\"19\" height=\"19\" viewBox=\"0 0 14 14\" style><g fill=\"none\" fill-rule=\"evenodd\"><path d=\"M0 0h14v14H0z\" \/><path d=\"M7.45 9.887l-1.62 1.621c-.92.92-2.418.92-3.338 0a2.364 2.364 0 0 1 0-3.339l1.62-1.62-1.273-1.272-1.62 1.62a4.161 4.161 0 1 0 5.885 5.884l1.62-1.62L7.45 9.886zM5.527 5.135L7.17 3.492c.92-.92 2.418-.92 3.339 0 .92.92.92 2.418 0 3.339L8.866 8.473l1.272 1.273 1.644-1.643A4.161 4.161 0 1 0 5.897 2.22L4.254 3.863l1.272 1.272zm-.66 3.998a.749.749 0 0 1 0-1.06l2.208-2.206a.749.749 0 1 1 1.06 1.06L5.928 9.133a.75.75 0 0 1-1.061 0z\" style \/><\/g><\/svg><\/a><div class=\"link-text\" data-anchor=\"A more accurate, but still not accurate way to compare links\" data-section=\"a-more-accurate-but-still-not-accurate-way-to-compare-links\">\n\n\n\n<h2 class=\"wp-block-heading\"><a id=\"post-172884-_i95rqxms4e7m\"><\/a>A more accurate, but still not accurate way to compare links<\/h2>\n\n\n\n<\/div><\/div>\n\n\n\n<p>There are <em>some<\/em> ways you can compare the data to get somewhat similar time periods and only look at active links.<\/p>\n\n\n\n<p>If you filter the Semrush backlinks report for \u201cActive\u201d links, you\u2019ll have a somewhat more accurate number to compare against the Ahrefs dashboard number.<\/p>\n\n\n\n<p>Alternatively, if you use the \u201cShow history: Last 6 months\u201d option in the Ahrefs backlink report, this would include lost links and be a fairer comparison to Semrush\u2019s dashboard number.<\/p>\n\n\n\n<p>Here\u2019s an example of how to get more similar data:<\/p>\n\n\n\n<ul class=\"wp-block-list\">\n<li>Semrush Dashboard: 5.1K = Ahrefs (6-month date comparison): 5.6K<\/li>\n\n\n\n<li>Semrush All Links: 5.1K = Ahrefs (6-month date comparison): 5.6K<\/li>\n\n\n\n<li>Semrush Active Links: 2.9K = Ahrefs Dashboard: 3.5K = Ahrefs (no date comparison): 3.5K<\/li>\n<\/ul>\n\n\n\n<p><strong>What you should not compare is Semrush Dashboard and Ahrefs Dashboard numbers. <\/strong>The number in Semrush (5.1K) includes dead links. The number in Ahrefs (3.5K) doesn\u2019t; it\u2019s only live&nbsp;links!<\/p>\n\n\n\n<p>Note that the time periods may not be exactly the same as mentioned before because of the extra days in the Semrush data. You could look at what day their data stops and select that exact day in the Ahrefs data to get an even more accurate, but still not quite accurate comparison.<\/p>\n\n\n\n<p>I don\u2019t think the comparison works at all with larger domains because of an issue in Semrush. Here\u2019s what I saw for semrush.com:<\/p>\n\n\n\n<ul class=\"wp-block-list\">\n<li>Semrush Dashboard: 48.7M = Ahrefs (6 month date comparison): 24.7M<\/li>\n\n\n\n<li>Semrush All Links: 48.7M = Ahrefs (6 month date comparison): 24.7M<\/li>\n\n\n\n<li>Semrush Active Links: 1.8M = Ahrefs Dashboard: 15.9M = Ahrefs (no date comparison): 15.9M<\/li>\n<\/ul>\n\n\n\n<p>So that\u2019s 1.8M active links in Semrush vs 15.9M active in Ahrefs. But as I said, I don\u2019t think this is a fair comparison. Semrush seems to have an issue with larger sites. There is a warning in Semrush that says, \u201cDue to the size of the analyzed domain, only the most relevant links will be shown.\u201d It\u2019s possible they\u2019re not showing all the links, but this is suspicious because they will show the total for all links which is a larger number, and I can filter those in other&nbsp;ways.<\/p>\n\n\n\n<p>I can also sort normally by the oldest last seen date and see all the links, but when I do last seen + active, I see only 608K links. I can\u2019t get more than 50k rows in their system to investigate this further, but something is fishy&nbsp;here.<\/p>\n\n\n\n<h3 class=\"wp-block-heading\"><a id=\"post-172884-_h4k6t51l6u2t\"><\/a>More link differences<\/h3>\n\n\n\n<p>The above comparison wouldn\u2019t be enough to make an accurate comparison. There are still a number of differences and problems that make any sort of comparison troublesome.<\/p>\n\n\n\n<p>This tweet is as relevant as the day I wrote&nbsp;it:<\/p>\n\n\n\n<figure class=\"wp-block-embed is-type-rich is-provider-twitter wp-block-embed-twitter\">\n<div class=\"wp-block-embed__wrapper\">https:\/\/twitter.com\/patrickstox\/status\/1354501093204619265<\/div>\n<\/figure>\n\n\n\n<h4 class=\"wp-block-heading\"><a id=\"post-172884-_askax7u5wmd0\"><\/a>It\u2019s almost impossible to do a fair link comparison<\/h4>\n\n\n\n<p>Here\u2019s <a href=\"https:\/\/ahrefs.com\/blog\/how-ahrefs-counts-links\/\">how we count links<\/a>, but it\u2019s worth mentioning that each tool counts links in different ways.<\/p>\n\n\n\n<p>To recap some of the main points, here are some things we&nbsp;do:<\/p>\n\n\n\n<ul class=\"wp-block-list\">\n<li>We store some links inserted with JavaScript, no one else does this. We render ~250M pages a&nbsp;day.<\/li>\n\n\n\n<li>We have a <a href=\"https:\/\/ahrefs.com\/blog\/canonicalization\/\">canonicalization system<\/a> in place that others may not, which means we shouldn\u2019t count as many duplicates as others do.<\/li>\n\n\n\n<li>Our crawler tries to be intelligent about what to prioritize for crawling to avoid spam and things like infinite crawl&nbsp;paths.<\/li>\n\n\n\n<li>We count one link per page, others may count multiple links per&nbsp;page.<\/li>\n<\/ul>\n\n\n\n<p>These differences make a fair link comparison nearly impossible to&nbsp;do.<\/p>\n\n\n\n<h4 class=\"wp-block-heading\"><a id=\"post-172884-_zd7cdp6a8bng\"><\/a>How to see where the biggest link differences are<\/h4>\n\n\n\n<p>The easiest way to see the biggest discrepancies in link totals is to go to the Referring Domains reports in the tools and sort by the number of links. You can use the dropdowns to see what kinds of issues each index may have with overcounting some links. In many cases, you\u2019re likely to see millions of links from the same site for some of the reasons mentioned above.<\/p>\n\n\n\n<p>For example, when I looked in Semrush I found blogspot links that they claimed to have recently checked, but these are showing 404 when I visit them. Semrush still counts them for some reason. I saw this issue on multiple domains I checked. This is one of those&nbsp;pages:<\/p>\n\n\n\n<figure class=\"wp-block-image\"><img loading=\"lazy\" decoding=\"async\" width=\"1494\" height=\"1254\" class=\"wp-image-172885\" src=\"https:\/\/ahrefs.com\/blog\/wp-content\/uploads\/2024\/03\/semrush-counting-links-on-404-pages.jpg\" alt=\"Semrush counting links on 404 pages\" srcset=\"https:\/\/ahrefs.com\/blog\/wp-content\/uploads\/2024\/03\/semrush-counting-links-on-404-pages.jpg 1494w, https:\/\/ahrefs.com\/blog\/wp-content\/uploads\/2024\/03\/semrush-counting-links-on-404-pages-506x425.jpg 506w, https:\/\/ahrefs.com\/blog\/wp-content\/uploads\/2024\/03\/semrush-counting-links-on-404-pages-768x645.jpg 768w\" sizes=\"auto, (max-width: 1494px) 100vw, 1494px\"><\/figure>\n\n\n\n<h4 class=\"wp-block-heading\"><a id=\"post-172884-_dssjf022waam\"><\/a>Lots of links counted as live are actually dead<\/h4>\n\n\n\n<p>Seeing the dead link above counted in the total made me want to check how many dead links were in each index. I ran crawls on the list of the most recent live links in each tool to see how many were actually still&nbsp;live.<\/p>\n\n\n\n<p>For Semrush, 49.6% of the links they said were live were actually dead. Some churn is expected as the web changes, but half the links in 6 months indicates that a lot of these may be on the spammier part of the web that isn\u2019t as stable or they\u2019re not re-crawling the links often. For some context, the same number for Ahrefs came back as 17.2%&nbsp;dead.<\/p>\n\n\n\n<h4 class=\"wp-block-heading\"><a id=\"post-172884-_a5nwcwfifw8s\"><\/a>It\u2019s going to get more complicated to compare these numbers<\/h4>\n\n\n\n<p>Ahrefs recently added a filter for \u201cBest links\u201d which you can configure to filter out noise. For instance, if you want to remove all blogspot.com blogs from the report, you can add a filter for&nbsp;it.<\/p>\n\n\n\n<figure class=\"wp-block-image is-resized\"><img loading=\"lazy\" decoding=\"async\" width=\"982\" height=\"1484\" class=\"wp-image-172886\" style=\"width: 608px; height: auto;\" src=\"https:\/\/ahrefs.com\/blog\/wp-content\/uploads\/2024\/03\/ahrefs-best-links-filter.png\" alt=\"Ahrefs' Best links filter\" srcset=\"https:\/\/ahrefs.com\/blog\/wp-content\/uploads\/2024\/03\/ahrefs-best-links-filter.png 982w, https:\/\/ahrefs.com\/blog\/wp-content\/uploads\/2024\/03\/ahrefs-best-links-filter-281x425.png 281w, https:\/\/ahrefs.com\/blog\/wp-content\/uploads\/2024\/03\/ahrefs-best-links-filter-768x1161.png 768w\" sizes=\"auto, (max-width: 982px) 100vw, 982px\"><\/figure>\n\n\n\n<p>This means you\u2019ll only see links you consider important in the reports. This can also be applied to the main dashboard numbers and charts now. If the filter is active, people will see different numbers depending on their settings.<\/p>\n\n\n\n<p>This also leads to another point about granularity of data. Ahrefs has 77 data points around each link. Semrush has 22. If you really need to slice and dice the link data, Ahrefs is going to let you do it in more&nbsp;ways.<\/p>\n\n\n\n<div class=\"post-nav-link clearfix\" id=\"section1\"><a class=\"subhead-anchor\" data-tip=\"tooltip__copielink\" rel=\"#section1\"><svg width=\"19\" height=\"19\" viewBox=\"0 0 14 14\" style><g fill=\"none\" fill-rule=\"evenodd\"><path d=\"M0 0h14v14H0z\" \/><path d=\"M7.45 9.887l-1.62 1.621c-.92.92-2.418.92-3.338 0a2.364 2.364 0 0 1 0-3.339l1.62-1.62-1.273-1.272-1.62 1.62a4.161 4.161 0 1 0 5.885 5.884l1.62-1.62L7.45 9.886zM5.527 5.135L7.17 3.492c.92-.92 2.418-.92 3.339 0 .92.92.92 2.418 0 3.339L8.866 8.473l1.272 1.273 1.644-1.643A4.161 4.161 0 1 0 5.897 2.22L4.254 3.863l1.272 1.272zm-.66 3.998a.749.749 0 0 1 0-1.06l2.208-2.206a.749.749 0 1 1 1.06 1.06L5.928 9.133a.75.75 0 0 1-1.061 0z\" style \/><\/g><\/svg><\/a><div class=\"link-text\" data-anchor=\"Can you compare RDs?\" data-section=\"can-you-compare-rds\">\n\n\n\n<h2 class=\"wp-block-heading\"><a id=\"post-172884-_as3yr6834x2i\"><\/a>Can you compare RDs?<\/h2>\n\n\n\n<\/div><\/div>\n\n\n\n<p>You would think this is straightforward, but it\u2019s&nbsp;not.<\/p>\n\n\n\n<h3 class=\"wp-block-heading\"><a id=\"post-172884-_369r96lwbriq\"><\/a>Solving for all the issues is a lot of&nbsp;work<\/h3>\n\n\n\n<p>There are a lot of different things you\u2019d have to solve for&nbsp;here:<\/p>\n\n\n\n<ul class=\"wp-block-list\">\n<li>The extra days in Semrush\u2019s data that you\u2019ll have to remove or add to the Ahrefs number.<\/li>\n\n\n\n<li>Remember that Semrush also includes dead RDs in their dashboard numbers. So you need to filter their RD report to just \u201cActive\u201d to get the live&nbsp;ones.<\/li>\n\n\n\n<li>Remember that half the links in the test of Semrush live data were actually dead, so I would suspect that a number of the RDs are actually lost as well. You could possibly look for domains with low link counts and just crawl the listed links from those to remove most of the dead&nbsp;ones.<\/li>\n\n\n\n<li>After all that, you\u2019re still going to need to strip the domains down to the root domain only to account for the differences in what each tool may be counting as a domain.<\/li>\n<\/ul>\n\n\n\n<h3 class=\"wp-block-heading\"><a id=\"post-172884-_o54lmtm7p1ro\"><\/a>What is a domain?<\/h3>\n\n\n\n<p>Ahrefs currently shows 206.3M RDs in our database and Semrush shows 1.6B. Domains are being counted in extremely different ways between the&nbsp;tools.<\/p>\n\n\n\n<figure class=\"wp-block-image\"><img loading=\"lazy\" decoding=\"async\" width=\"1600\" height=\"573\" class=\"wp-image-172887\" src=\"https:\/\/ahrefs.com\/blog\/wp-content\/uploads\/2024\/03\/ahrefs-has-340b-pages-and-206m-domains-in-the-inde.png\" alt=\"Ahrefs has 340B pages and 206M domains in the index\" srcset=\"https:\/\/ahrefs.com\/blog\/wp-content\/uploads\/2024\/03\/ahrefs-has-340b-pages-and-206m-domains-in-the-inde.png 1600w, https:\/\/ahrefs.com\/blog\/wp-content\/uploads\/2024\/03\/ahrefs-has-340b-pages-and-206m-domains-in-the-inde-680x244.png 680w, https:\/\/ahrefs.com\/blog\/wp-content\/uploads\/2024\/03\/ahrefs-has-340b-pages-and-206m-domains-in-the-inde-768x275.png 768w, https:\/\/ahrefs.com\/blog\/wp-content\/uploads\/2024\/03\/ahrefs-has-340b-pages-and-206m-domains-in-the-inde-1536x550.png 1536w\" sizes=\"auto, (max-width: 1600px) 100vw, 1600px\"><\/figure>\n\n\n\n<p>According to the major sources who look at these kinds of things, the number of domains on the internet seems to be between <a href=\"https:\/\/www.netcraft.com\/blog\/november-2023-web-server-survey\">269M<\/a>-<a href=\"https:\/\/dnib.com\/articles\/the-domain-name-industry-brief-q3-2023\">359M<\/a> and the number of websites between <a href=\"https:\/\/www.netcraft.com\/blog\/november-2023-web-server-survey\">1.1B<\/a>-<a href=\"https:\/\/www.internetlivestats.com\/total-number-of-websites\/\">1.5B<\/a>, with <a href=\"https:\/\/www.netcraft.com\/blog\/november-2023-web-server-survey\">191M<\/a>-<a href=\"https:\/\/www.internetlivestats.com\/total-number-of-websites\/\">200M<\/a> of them being active.<\/p>\n\n\n\n<p>Semrush\u2019s number of RDs is higher than the number of domains that&nbsp;exist.<\/p>\n\n\n\n<p>I believe Semrush may be confusing different terms. Their numbers match fairly closely with the number of websites on the internet, but that\u2019s not the same as the number of domains. Plus, many of those websites aren\u2019t even&nbsp;live.<\/p>\n<p>Another possibility is they\u2019re counting dead domains. If we do that, our comparable number is 1.86B to their 1.6B.<\/p>\n\n\n\n<h4 class=\"wp-block-heading\"><a id=\"post-172884-_o2m4t1e12em1\"><\/a>It\u2019s going to get more complicated to compare these numbers<\/h4>\n\n\n\n<p>Part of our process is dropping spam domains, and we also treat some subdomains as different domains. We come up close to the numbers from other 3rd party studies for the number of active websites and domains, whereas Semrush seems to come in closer to the total number of websites (including inactive ones).<\/p>\n\n\n\n<p>We\u2019re going to simplify our methodology soon so that one domain is actually just one domain. This is going to make our RD numbers go down, but be more accurate to what people actually consider a domain. It\u2019s also going to make for an even bigger disparity in the numbers between the&nbsp;tools.<\/p>\n\n\n\n<div class=\"post-nav-link clearfix\" id=\"section1\"><a class=\"subhead-anchor\" data-tip=\"tooltip__copielink\" rel=\"#section1\"><svg width=\"19\" height=\"19\" viewBox=\"0 0 14 14\" style><g fill=\"none\" fill-rule=\"evenodd\"><path d=\"M0 0h14v14H0z\" \/><path d=\"M7.45 9.887l-1.62 1.621c-.92.92-2.418.92-3.338 0a2.364 2.364 0 0 1 0-3.339l1.62-1.62-1.273-1.272-1.62 1.62a4.161 4.161 0 1 0 5.885 5.884l1.62-1.62L7.45 9.886zM5.527 5.135L7.17 3.492c.92-.92 2.418-.92 3.339 0 .92.92.92 2.418 0 3.339L8.866 8.473l1.272 1.273 1.644-1.643A4.161 4.161 0 1 0 5.897 2.22L4.254 3.863l1.272 1.272zm-.66 3.998a.749.749 0 0 1 0-1.06l2.208-2.206a.749.749 0 1 1 1.06 1.06L5.928 9.133a.75.75 0 0 1-1.061 0z\" style \/><\/g><\/svg><\/a><div class=\"link-text\" data-anchor=\"Data freshness \/ Update speed\" data-section=\"data-freshness-update-speed\">\n\n\n\n<h2 class=\"wp-block-heading\"><a id=\"post-172884-_j99d9fiujjei\"><\/a>Data freshness \/ Update speed<\/h2>\n\n\n\n<\/div><\/div>\n\n\n\n<p>I ran some quality checks for both the first-seen and last-seen link data. On every site I checked, Ahrefs picked up more links first and on most Ahrefs updated the links more recently than Semrush. Don\u2019t just believe me, though; check for yourself.<\/p>\n\n\n\n<p>Comparing this is biased no matter how you look at it because our data is more granular and includes the hours and minutes instead of just the day. Leaving the hours and minutes creates a biased comparison, and so does removing it. You\u2019ll have to match the URLs and check which date is first or if there is a tie and then count the totals. There will be some different links in each dataset, so you\u2019ll need to do the lookups on each set of data for comparison.<\/p>\n\n\n\n<p>Semrush claims, \u201cWe update the backlinks data in the interface every 15 minutes.\u201d<\/p>\n\n\n\n<p>Ahrefs claims, \u201cThe world\u2019s largest index of live backlinks, updated with fresh data every 15\u201330 minutes.\u201d<\/p>\n\n\n\n<p>I pulled data at the same time from both tools to see when the latest links for some popular websites were found. Here\u2019s a summary table:<\/p>\n\n\n\n<table id=\"tablepress-299\" class=\"tablepress tablepress-id-299 tablepress-responsive tablepress-ahrefs-width-720px\">\n<thead>\n<tr class=\"row-1 odd\">\n\t<th class=\"column-1\">Domain<\/th><th class=\"column-2\">Ahrefs Latest<\/th><th class=\"column-3\">Semrush latest<\/th>\n<\/tr>\n<\/thead>\n<tbody>\n<tr class=\"row-2 even\">\n\t<td class=\"column-1\">semrush.com<\/td><td class=\"column-2\">3 minutes ago<\/td><td class=\"column-3\">7 days&nbsp;ago<\/td>\n<\/tr>\n<tr class=\"row-3 odd\">\n\t<td class=\"column-1\">ahrefs.com<\/td><td class=\"column-2\">2 minutes ago<\/td><td class=\"column-3\">5 days&nbsp;ago<\/td>\n<\/tr>\n<tr class=\"row-4 even\">\n\t<td class=\"column-1\">hubspot.com<\/td><td class=\"column-2\">0 minutes ago<\/td><td class=\"column-3\">9 days&nbsp;ago<\/td>\n<\/tr>\n<tr class=\"row-5 odd\">\n\t<td class=\"column-1\">foxnews.com<\/td><td class=\"column-2\">1 minute ago<\/td><td class=\"column-3\">12 days&nbsp;ago<\/td>\n<\/tr>\n<tr class=\"row-6 even\">\n\t<td class=\"column-1\">cnn.com<\/td><td class=\"column-2\">0 minutes ago<\/td><td class=\"column-3\">13 days&nbsp;ago<\/td>\n<\/tr>\n<tr class=\"row-7 odd\">\n\t<td class=\"column-1\">amazon.com<\/td><td class=\"column-2\">0 minutes ago<\/td><td class=\"column-3\">6 days&nbsp;ago<\/td>\n<\/tr>\n<\/tbody>\n<\/table>\n\n\n\n<p>That doesn\u2019t seem fresh at all. Their 15-minute update claim seems pretty dubious to me with so many websites not having updates for many&nbsp;days.<\/p>\n\n\n\n<p>In fairness, for some smaller sites it was more mixed on who showed fresher data. I think they may have some issues with the processing of larger sites.<br><br>One day after this post was published, Semrush is showing 7 links from 2 RDs and Ahrefs is showing 120 links from 19&nbsp;RDs.<\/p>\n\n\n\n<p>Don\u2019t just trust me, though; I encourage you to check some websites yourself. Go into the backlinks reports in both tools and sort by last seen. Be sure to share your results on social media.<\/p>\n\n\n\n<div class=\"recommendation\"><div class=\"recommendation-title\">Ahrefs now receives data from IndexNow<\/div><div class=\"recommendation-content\">\n\n\n\n<p>This will make our data even fresher. That\u2019s ~2.5B URLs \/ day in March 2024. The websites tell us about new pages, deleted pages, or any changes they make so that we can go crawl them and update the data. Read more <a href=\"https:\/\/ahrefs.com\/index-now\/\" data-ahr=\"https:\/\/ahrefs.com\/blog\/indexnow-yep-ahrefs\/\">here<\/a>.<\/p>\n\n\n\n<\/div><\/div>\n\n\n\n<div class=\"post-nav-link clearfix\" id=\"section1\"><a class=\"subhead-anchor\" data-tip=\"tooltip__copielink\" rel=\"#section1\"><svg width=\"19\" height=\"19\" viewBox=\"0 0 14 14\" style><g fill=\"none\" fill-rule=\"evenodd\"><path d=\"M0 0h14v14H0z\" \/><path d=\"M7.45 9.887l-1.62 1.621c-.92.92-2.418.92-3.338 0a2.364 2.364 0 0 1 0-3.339l1.62-1.62-1.273-1.272-1.62 1.62a4.161 4.161 0 1 0 5.885 5.884l1.62-1.62L7.45 9.886zM5.527 5.135L7.17 3.492c.92-.92 2.418-.92 3.339 0 .92.92.92 2.418 0 3.339L8.866 8.473l1.272 1.273 1.644-1.643A4.161 4.161 0 1 0 5.897 2.22L4.254 3.863l1.272 1.272zm-.66 3.998a.749.749 0 0 1 0-1.06l2.208-2.206a.749.749 0 1 1 1.06 1.06L5.928 9.133a.75.75 0 0 1-1.061 0z\" style \/><\/g><\/svg><\/a><div class=\"link-text\" data-anchor=\"Crawl speed\" data-section=\"crawl-speed\">\n\n\n\n<h2 class=\"wp-block-heading\"><a id=\"post-172884-_el9o38yaiktj\"><\/a>Crawl speed<\/h2>\n\n\n\n<\/div><\/div>\n\n\n\n<p>Ahrefs crawls 7B+ pages every day. Semrush claims they crawl 25B pages per day. This would be ~3.5x what Ahrefs crawls per day. The problem is that I can\u2019t find any evidence that they crawl that&nbsp;fast.<\/p>\n\n\n\n<p>We saw that around half the links that Semrush had marked as active were actually dead compared to about 17% in Ahrefs, which indicated to me that they may not re-crawl links as often. That and the freshness test both pointed to them crawling slower. I decided to look into&nbsp;it.<\/p>\n\n\n\n<h4 class=\"wp-block-heading\"><a id=\"post-172884-_62l5olpej3hw\"><\/a>Logs of my&nbsp;sites<\/h4>\n\n\n\n<p>I checked the logs of some of my sites and sites I have access to, and I didn\u2019t see anything to support the claim that Semrush crawls faster. If you have access to logs of your own site, you should be able to check which bots are crawling the fastest.<\/p>\n\n\n\n<h4 class=\"wp-block-heading\"><a id=\"post-172884-_1c9ctd8vbbbt\"><\/a>80,000 months of log&nbsp;data<\/h4>\n\n\n\n<p>I was curious and wanted to look at bigger samples. I used <a href=\"https:\/\/ahrefs.com\/\" data-ahr=\"https:\/\/ahrefs.com\/blog\/web-explorer\/\">Web Explorer<\/a> and a few different footprints (patterns) to find log file summaries produced by AWStats and Webalizer. These are often published on the&nbsp;web.<\/p>\n\n\n\n<figure class=\"wp-block-image\"><img loading=\"lazy\" decoding=\"async\" width=\"1519\" height=\"1383\" class=\"wp-image-172888\" src=\"https:\/\/ahrefs.com\/blog\/wp-content\/uploads\/2024\/03\/web-explorer-search-i-used-to-find-log-files-on-th.png\" alt=\"Web Explorer search I used to find log files on the web\" srcset=\"https:\/\/ahrefs.com\/blog\/wp-content\/uploads\/2024\/03\/web-explorer-search-i-used-to-find-log-files-on-th.png 1519w, https:\/\/ahrefs.com\/blog\/wp-content\/uploads\/2024\/03\/web-explorer-search-i-used-to-find-log-files-on-th-467x425.png 467w, https:\/\/ahrefs.com\/blog\/wp-content\/uploads\/2024\/03\/web-explorer-search-i-used-to-find-log-files-on-th-768x699.png 768w\" sizes=\"auto, (max-width: 1519px) 100vw, 1519px\"><\/figure>\n\n\n\n<p>I scraped and parsed ~80,000 log file summaries that contained 1 month of data each and were generated in the last couple of years. This sample contained over 9k websites in&nbsp;total.<\/p>\n\n\n\n<p>I did not see evidence of Semrush crawling many times faster than Ahrefs for these sites, as they claim they do. The only bot that was crawling much faster than Ahrefsbot in this dataset was <a href=\"https:\/\/ahrefs.com\/blog\/googlebot\/\">Googlebot<\/a>. Even other search engines were behind our crawl&nbsp;rate.<\/p>\n\n\n\n<p>That\u2019s just data from a small-ish number of sites compared to the scale of the web. What about for a larger chunk of the&nbsp;web?<\/p>\n\n\n\n<h4 class=\"wp-block-heading\"><a id=\"post-172884-_757dqzsn2i28\"><\/a>Data from 20%+ of web traffic<\/h4>\n\n\n\n<p>At the time of writing, <a href=\"https:\/\/radar.cloudflare.com\/traffic\/verified-bots\">Cloudflare Radar<\/a> has Ahrefsbot as the #7 most active bot on the web and Semrushbot at #40.<\/p>\n\n\n\n<p>While this isn\u2019t a complete picture of the web, it\u2019s a fairly large chunk. In 2021, Cloudflare was said to manage <a href=\"https:\/\/twitter.com\/AxelrodG\/status\/1447938954758705155\">~20% of the web\u2019s traffic<\/a>, up from ~10% in 2018. It\u2019s likely much higher now with that kind of growth. I couldn\u2019t find the numbers from 2021, but in early 2022 they were handling 32 million HTTP requests \/ second on average and in early 2023 they had already grown to handling <a href=\"https:\/\/blog.cloudflare.com\/application-security-2023\/\">45 million HTTP requests \/ second on average<\/a>, over 40% more in one&nbsp;year!<\/p>\n\n\n\n<p>Additionally, <a href=\"https:\/\/kinsta.com\/cloudflare-market-share\/\">~80% of websites that use a CDN use Cloudflare<\/a>. They handle many of the larger sites on the web; BuiltWith shows that <a href=\"https:\/\/trends.builtwith.com\/cdn\/Cloudflare\">Cloudflare is used by ~32% of the Top 1M websites<\/a>. That\u2019s a significant sample size and likely the largest sample that exists.<\/p>\n\n\n\n<h4 class=\"wp-block-heading\"><a id=\"post-172884-_qxe029zc1fyy\"><\/a>How much do SEO tools&nbsp;crawl?<\/h4>\n\n\n\n<p>Some of the SEO tools share the number of pages they crawl on their websites. The only one in the chart below that doesn\u2019t have a publicly published crawl rate is AhrefsSiteAudit bot, but I asked our team to pull the info for this. Let me put the rankings in perspective with actual and claimed crawl&nbsp;rates.<\/p>\n\n\n\n<table id=\"tablepress-300\" class=\"tablepress tablepress-id-300 tablepress-responsive tablepress-ahrefs-width-720px\">\n<thead>\n<tr class=\"row-1 odd\">\n\t<th class=\"column-1\">Ranking<\/th><th class=\"column-2\">Bot<\/th><th class=\"column-3\">Crawl Rate<\/th>\n<\/tr>\n<\/thead>\n<tbody>\n<tr class=\"row-2 even\">\n\t<td class=\"column-1\">7<\/td><td class=\"column-2\">Ahrefsbot<\/td><td class=\"column-3\">7B+ \/&nbsp;day<\/td>\n<\/tr>\n<tr class=\"row-3 odd\">\n\t<td class=\"column-1\">27<\/td><td class=\"column-2\">DataForSEO Bot<\/td><td class=\"column-3\">2B \/&nbsp;day<\/td>\n<\/tr>\n<tr class=\"row-4 even\">\n\t<td class=\"column-1\">29<\/td><td class=\"column-2\">AhrefsSiteAudit<\/td><td class=\"column-3\">600M - 700M \/&nbsp;day<\/td>\n<\/tr>\n<tr class=\"row-5 odd\">\n\t<td class=\"column-1\">35<\/td><td class=\"column-2\">Botify<\/td><td class=\"column-3\">143.3M \/&nbsp;day<\/td>\n<\/tr>\n<tr class=\"row-6 even\">\n\t<td class=\"column-1\">40<\/td><td class=\"column-2\">Semrushbot<\/td><td class=\"column-3\">25B \/ day* claimed<\/td>\n<\/tr>\n<\/tbody>\n<\/table>\n\n\n\n<p>The math isn\u2019t mathing. How can Semrush claim they\u2019re crawling multiple times as fast as these others, but their ranking is lower? Cloudflare doesn\u2019t cover the entire web, but it\u2019s a large chunk of the web and a more than representative sample size.<\/p>\n\n\n\n<p>When they originally made this 25B claim, I believe they were closer to 90th on Cloudflare Radar, near the bottom of the list at the time. Semrush hasn\u2019t updated this number since then, and I recall a period of time where they were in the 60s-70s on Cloudflare Radar as well. They do seem to be getting faster, but their claimed numbers still don\u2019t add&nbsp;up.<\/p>\n\n\n\n<p>I don\u2019t hear SEOs raving about Moz or Sistrix having the best link data, but they are 21st and 36th on the list respectively. Both are higher than Semrush.<\/p>\n\n\n\n<h4 class=\"wp-block-heading\"><a id=\"post-172884-_71je63gu6ujx\"><\/a>Possible explanations of differences<\/h4>\n\n\n\n<p>Semrush may be conflating the term pages with links, which is actually mentioned in some of their documentation. I don\u2019t want to link to it, but you can find it with this quote: \u201cDaily, our bot crawls over 25 billion links\u201d. But links are not the same thing as pages and there can be hundreds of links on a single page.<\/p>\n\n\n\n<p>It\u2019s also possible they\u2019re crawling a portion of the web that\u2019s just more spammy and isn\u2019t reflected in the data from either of the sources I looked at. Some of the numbers indicate this may be the&nbsp;case.<\/p>\n\n\n\n<div class=\"post-nav-link clearfix\" id=\"section1\"><a class=\"subhead-anchor\" data-tip=\"tooltip__copielink\" rel=\"#section1\"><svg width=\"19\" height=\"19\" viewBox=\"0 0 14 14\" style><g fill=\"none\" fill-rule=\"evenodd\"><path d=\"M0 0h14v14H0z\" \/><path d=\"M7.45 9.887l-1.62 1.621c-.92.92-2.418.92-3.338 0a2.364 2.364 0 0 1 0-3.339l1.62-1.62-1.273-1.272-1.62 1.62a4.161 4.161 0 1 0 5.885 5.884l1.62-1.62L7.45 9.886zM5.527 5.135L7.17 3.492c.92-.92 2.418-.92 3.339 0 .92.92.92 2.418 0 3.339L8.866 8.473l1.272 1.273 1.644-1.643A4.161 4.161 0 1 0 5.897 2.22L4.254 3.863l1.272 1.272zm-.66 3.998a.749.749 0 0 1 0-1.06l2.208-2.206a.749.749 0 1 1 1.06 1.06L5.928 9.133a.75.75 0 0 1-1.061 0z\" style \/><\/g><\/svg><\/a><div class=\"link-text\" data-anchor=\"3rd party validation\" data-section=\"rd-party-validation\">\n\n\n\n<h2 class=\"wp-block-heading\"><a id=\"post-172884-_k73uh0l9ksrn\"><\/a>3rd party validation<\/h2>\n\n\n\n<\/div><\/div>\n\n\n\n<p>Y\u2019all shouldn\u2019t trust studies done by a specific vendor when it compares them to others, even this one. I try to be as fair as I can be and follow the data, but since I work at Ahrefs you can hardly consider me unbiased. Go look at the data yourselves and run your own&nbsp;tests.<\/p>\n\n\n\n<p>There are some folks in the SEO community who try to do these tests every once in a while. The last major <a href=\"https:\/\/www.searchlogistics.com\/learn\/reviews\/best-backlink-checker\/\">3rd party study<\/a> was run by <a href=\"https:\/\/twitter.com\/MattWoodwardUK\">Matthew Woodward<\/a>, who initially declared Semrush the winner, but the conclusion was changed and Ahrefs was ultimately declared to be the rightful winner. What happened?<\/p>\n\n\n\n<p>The methodology chosen for the study heavily favored Semrush and was <a href=\"http:\/\/www.thegooglecache.com\/white-hat-seo\/semrush-ip-link-data-bizarre-misleading\/\">investigated<\/a> by a friend of mine, Russ Jones, may he rest in peace. Here\u2019s what Russ had to say about&nbsp;it:<\/p>\n\n\n\n<blockquote class=\"wp-block-quote is-layout-flow wp-block-quote-is-layout-flow\">\n<p>While services like Majestic and Ahrefs likely store a single canonical IP address per domain, SEMRush seems to store per link, which accounts for why there would be more IPs that referring domains in some cases. <strong>I do not think SEMRush is intentionally inflating their numbers, I think they are storing the data in a different way than competitors which results in a number that is higher and potentially misleading, but not due to ill intent.<\/strong><\/p>\n<\/blockquote>\n\n\n\n<p>The response from Matthew indicated that Semrush might have misled him in their favor. Here\u2019s that comment:<\/p>\n\n\n\n<figure class=\"wp-block-image\"><img loading=\"lazy\" decoding=\"async\" width=\"1359\" height=\"709\" class=\"wp-image-172889\" src=\"https:\/\/ahrefs.com\/blog\/wp-content\/uploads\/2024\/03\/comment-from-matthew-woodward-in-response-to-semru.jpg\" alt=\"Comment from Matthew Woodward in response to Semrush about the test.\" srcset=\"https:\/\/ahrefs.com\/blog\/wp-content\/uploads\/2024\/03\/comment-from-matthew-woodward-in-response-to-semru.jpg 1359w, https:\/\/ahrefs.com\/blog\/wp-content\/uploads\/2024\/03\/comment-from-matthew-woodward-in-response-to-semru-680x355.jpg 680w, https:\/\/ahrefs.com\/blog\/wp-content\/uploads\/2024\/03\/comment-from-matthew-woodward-in-response-to-semru-768x401.jpg 768w\" sizes=\"auto, (max-width: 1359px) 100vw, 1359px\"><\/figure>\n\n\n\n<p>In the end, Ahrefs won.<\/p>\n\n\n\n<div class=\"post-nav-link clearfix\" id=\"section1\"><a class=\"subhead-anchor\" data-tip=\"tooltip__copielink\" rel=\"#section1\"><svg width=\"19\" height=\"19\" viewBox=\"0 0 14 14\" style><g fill=\"none\" fill-rule=\"evenodd\"><path d=\"M0 0h14v14H0z\" \/><path d=\"M7.45 9.887l-1.62 1.621c-.92.92-2.418.92-3.338 0a2.364 2.364 0 0 1 0-3.339l1.62-1.62-1.273-1.272-1.62 1.62a4.161 4.161 0 1 0 5.885 5.884l1.62-1.62L7.45 9.886zM5.527 5.135L7.17 3.492c.92-.92 2.418-.92 3.339 0 .92.92.92 2.418 0 3.339L8.866 8.473l1.272 1.273 1.644-1.643A4.161 4.161 0 1 0 5.897 2.22L4.254 3.863l1.272 1.272zm-.66 3.998a.749.749 0 0 1 0-1.06l2.208-2.206a.749.749 0 1 1 1.06 1.06L5.928 9.133a.75.75 0 0 1-1.061 0z\" style \/><\/g><\/svg><\/a><div class=\"link-text\" data-anchor=\"Hardware\" data-section=\"hardware\">\n\n\n\n<h2 class=\"wp-block-heading\"><a id=\"post-172884-_3al223e15y9w\"><\/a>Hardware<\/h2>\n\n\n\n<\/div><\/div>\n\n\n\n<p>Check our current stats on our <a href=\"https:\/\/ahrefs.com\/big-data\">big data page<\/a>.<\/p>\n\n\n\n<figure class=\"wp-block-image\"><img loading=\"lazy\" decoding=\"async\" width=\"1600\" height=\"859\" class=\"wp-image-172890\" src=\"https:\/\/ahrefs.com\/blog\/wp-content\/uploads\/2024\/03\/hardware-listed-on-the-ahrefs-big-data-page.jpg\" alt=\"Hardware listed on the Ahrefs big data page\" srcset=\"https:\/\/ahrefs.com\/blog\/wp-content\/uploads\/2024\/03\/hardware-listed-on-the-ahrefs-big-data-page.jpg 1600w, https:\/\/ahrefs.com\/blog\/wp-content\/uploads\/2024\/03\/hardware-listed-on-the-ahrefs-big-data-page-680x365.jpg 680w, https:\/\/ahrefs.com\/blog\/wp-content\/uploads\/2024\/03\/hardware-listed-on-the-ahrefs-big-data-page-768x412.jpg 768w, https:\/\/ahrefs.com\/blog\/wp-content\/uploads\/2024\/03\/hardware-listed-on-the-ahrefs-big-data-page-1536x825.jpg 1536w\" sizes=\"auto, (max-width: 1600px) 100vw, 1600px\"><\/figure>\n\n\n\n<p>While Semrush doesn\u2019t provide current hardware stats, they did provide some in the past when they made changes to their link&nbsp;index.<\/p>\n\n\n\n<p>In June 2019, they made an announcement that claimed they had the biggest index. The test from Matthew Woodward that I talked about happened after this test, and as you saw, Ahrefs won&nbsp;that.<\/p>\n\n\n\n<p>In June 2021, they made another announcement about their link index that claimed they were the biggest, fastest, and&nbsp;best.<\/p>\n\n\n\n<p>These are some stats they released at the&nbsp;time:<\/p>\n\n\n\n<ul class=\"wp-block-list\">\n<li>500 servers<\/li>\n\n\n\n<li>16,128 cpu&nbsp;cores<\/li>\n\n\n\n<li>245 TB of memory<\/li>\n\n\n\n<li>13.9 PB of storage<\/li>\n\n\n\n<li>25B+ pages \/&nbsp;day<\/li>\n\n\n\n<li>43.8T links<\/li>\n<\/ul>\n\n\n\n<p>The release said they increased storage, but their previous release said they had 4000 PBs of storage. They said the storage was 4x, so I guess the previous number was supposed to be 4000 TBs and not 4000 PBs, and they just got mixed up on the terminology.<\/p>\n\n\n\n<p>I checked our numbers at the time, and this is how we matched up:<\/p>\n\n\n\n<ul class=\"wp-block-list\">\n<li>2400 servers (~5x greater)<\/li>\n\n\n\n<li>200,000 cpu cores (~12.5x greater)<\/li>\n\n\n\n<li>900 TB of memory (~4x greater)<\/li>\n\n\n\n<li>120 PB of storage (~9x greater)<\/li>\n\n\n\n<li>7B pages \/ day (~3.5x less???)<\/li>\n\n\n\n<li>2.8T live links (I\u2019m not sure the total size, but to this day it\u2019s not as big as the number they claimed)<\/li>\n<\/ul>\n\n\n\n<p>They were claiming more links and faster crawling with much less storage and hardware. Granted, we don\u2019t know the details of the hardware, but we don\u2019t run on dated&nbsp;tech.<\/p>\n\n\n\n<p>They claimed to store more links than we have even now and in less space than we add to our system each month. It really doesn\u2019t make&nbsp;sense.<\/p>\n\n\n\n<h2 class=\"wp-block-heading\"><a id=\"post-172884-_ygrexh41ot4f\"><\/a>Final thoughts<\/h2>\n\n\n\n<p>Don\u2019t blindly trust the numbers on the dashboards or the general numbers because they may represent completely different things. While there\u2019s no perfect way to compare the data between different tools, you can run many of the checks I showed to try to compare similar things and clean up the data. If something looks off, ask the tool vendors for an explanation.<\/p>\n\n\n\n<p>If there ever comes a time when we stop winning on things like tech and crawl speed, go ahead and switch to another tool and stop paying us. But until that time, I\u2019d be highly skeptical of any claims by other&nbsp;tools.<\/p>\n\n\n\n<p>If you have questions, <a href=\"https:\/\/twitter.com\/patrickstox\">message me on X<\/a>.<\/p>\n","protected":false},"excerpt":{"rendered":"<p>For Ahrefs, we choose to store about 340B pages in our index as of December 2023. At a certain point, the quality of the web becomes bad. There are lots of spam and junk pages that just add noise to<span class=\"ellipsis\">\u2026<\/span><\/p>\n<div class=\"read-more\">Read more \u203a<\/div>\n<p><!-- end of .read-more --><\/p>\n","protected":false},"author":150,"featured_media":0,"comment_status":"closed","ping_status":"closed","sticky":false,"template":"","format":"standard","meta":{"wp_typography_post_enhancements_disabled":false,"footnotes":""},"categories":[414],"tags":[462],"coauthors":[377],"class_list":["post-172884","post","type-post","status-publish","format-standard","hentry","category-data-studies","tag-blog","odd"],"yoast_head":"<!-- This site is optimized with the Yoast SEO plugin v27.3 - https:\/\/yoast.com\/product\/yoast-seo-wordpress\/ -->\n<title>You Can&#039;t Compare Backlink Counts in SEO Tools: Here&#039;s Why<\/title>\n<meta name=\"description\" content=\"Don\u2019t blindly trust the numbers on the dashboards or the general numbers because they may represent completely different things.\" \/>\n<meta name=\"robots\" content=\"index, follow, max-snippet:-1, max-image-preview:large, max-video-preview:-1\" \/>\n<link rel=\"canonical\" href=\"https:\/\/ahrefs.com\/blog\/link-index-comparison\/\" \/>\n<meta property=\"og:locale\" content=\"en_US\" \/>\n<meta property=\"og:type\" content=\"article\" \/>\n<meta property=\"og:title\" content=\"You Can&#039;t Compare Backlink Counts in SEO Tools: Here&#039;s Why\" \/>\n<meta property=\"og:description\" content=\"Stop blindly trusting numbers.\" \/>\n<meta property=\"og:url\" content=\"https:\/\/ahrefs.com\/blog\/link-index-comparison\/\" \/>\n<meta property=\"og:site_name\" content=\"SEO Blog by Ahrefs\" \/>\n<meta property=\"article:publisher\" content=\"https:\/\/www.facebook.com\/Ahrefs\/\" \/>\n<meta property=\"article:author\" content=\"patrickstox\" \/>\n<meta property=\"article:published_time\" content=\"2024-03-13T17:14:56+00:00\" \/>\n<meta property=\"article:modified_time\" content=\"2025-06-23T23:18:33+00:00\" \/>\n<meta property=\"og:image\" content=\"https:\/\/ahrefs.com\/blog\/wp-content\/uploads\/2024\/03\/semrush-counting-links-on-404-pages.jpg\" \/>\n\t<meta property=\"og:image:width\" content=\"1494\" \/>\n\t<meta property=\"og:image:height\" content=\"1254\" \/>\n\t<meta property=\"og:image:type\" content=\"image\/jpeg\" \/>\n<meta name=\"author\" content=\"Patrick Stox\" \/>\n<meta name=\"twitter:card\" content=\"summary_large_image\" \/>\n<meta name=\"twitter:creator\" content=\"@patrickstox\" \/>\n<meta name=\"twitter:site\" content=\"@ahrefs\" \/>\n<script type=\"application\/ld+json\" class=\"yoast-schema-graph\">{\"@context\":\"https:\\\/\\\/schema.org\",\"@graph\":[{\"@type\":\"Article\",\"@id\":\"https:\\\/\\\/ahrefs.com\\\/blog\\\/link-index-comparison\\\/#article\",\"isPartOf\":{\"@id\":\"https:\\\/\\\/ahrefs.com\\\/blog\\\/link-index-comparison\\\/\"},\"author\":{\"name\":\"Patrick Stox\",\"@id\":\"https:\\\/\\\/ahrefs.com\\\/blog\\\/#\\\/schema\\\/person\\\/14bf754248f3c561786477e4e5fd2067\"},\"headline\":\"You Can\u2019t Compare Backlink Counts in SEO Tools: Here\u2019s Why\",\"datePublished\":\"2024-03-13T17:14:56+00:00\",\"dateModified\":\"2025-06-23T23:18:33+00:00\",\"mainEntityOfPage\":{\"@id\":\"https:\\\/\\\/ahrefs.com\\\/blog\\\/link-index-comparison\\\/\"},\"wordCount\":3765,\"publisher\":{\"@id\":\"https:\\\/\\\/ahrefs.com\\\/blog\\\/#organization\"},\"image\":{\"@id\":\"https:\\\/\\\/ahrefs.com\\\/blog\\\/link-index-comparison\\\/#primaryimage\"},\"thumbnailUrl\":\"https:\\\/\\\/ahrefs.com\\\/blog\\\/wp-content\\\/uploads\\\/2024\\\/03\\\/you-can8217t-compare-backlink-counts-in-by-patrick-stox-data-studies.jpg\",\"keywords\":[\"blog\"],\"articleSection\":[\"Data &amp; Studies\"],\"inLanguage\":\"en-US\"},{\"@type\":\"WebPage\",\"@id\":\"https:\\\/\\\/ahrefs.com\\\/blog\\\/link-index-comparison\\\/\",\"url\":\"https:\\\/\\\/ahrefs.com\\\/blog\\\/link-index-comparison\\\/\",\"name\":\"You Can't Compare Backlink Counts in SEO Tools: Here's Why\",\"isPartOf\":{\"@id\":\"https:\\\/\\\/ahrefs.com\\\/blog\\\/#website\"},\"primaryImageOfPage\":{\"@id\":\"https:\\\/\\\/ahrefs.com\\\/blog\\\/link-index-comparison\\\/#primaryimage\"},\"image\":{\"@id\":\"https:\\\/\\\/ahrefs.com\\\/blog\\\/link-index-comparison\\\/#primaryimage\"},\"thumbnailUrl\":\"https:\\\/\\\/ahrefs.com\\\/blog\\\/wp-content\\\/uploads\\\/2024\\\/03\\\/semrush-counting-links-on-404-pages.jpg\",\"datePublished\":\"2024-03-13T17:14:56+00:00\",\"dateModified\":\"2025-06-23T23:18:33+00:00\",\"description\":\"Don\u2019t blindly trust the numbers on the dashboards or the general numbers because they may represent completely different things.\",\"inLanguage\":\"en-US\",\"potentialAction\":[{\"@type\":\"ReadAction\",\"target\":[\"https:\\\/\\\/ahrefs.com\\\/blog\\\/link-index-comparison\\\/\"]}]},{\"@type\":\"ImageObject\",\"inLanguage\":\"en-US\",\"@id\":\"https:\\\/\\\/ahrefs.com\\\/blog\\\/link-index-comparison\\\/#primaryimage\",\"url\":\"https:\\\/\\\/ahrefs.com\\\/blog\\\/wp-content\\\/uploads\\\/2024\\\/03\\\/semrush-counting-links-on-404-pages.jpg\",\"contentUrl\":\"https:\\\/\\\/ahrefs.com\\\/blog\\\/wp-content\\\/uploads\\\/2024\\\/03\\\/semrush-counting-links-on-404-pages.jpg\",\"width\":1494,\"height\":1254,\"caption\":\"Semrush counting links on 404 pages\"},{\"@type\":\"WebSite\",\"@id\":\"https:\\\/\\\/ahrefs.com\\\/blog\\\/#website\",\"url\":\"https:\\\/\\\/ahrefs.com\\\/blog\\\/\",\"name\":\"SEO Blog by Ahrefs\",\"description\":\"Link Building Strategies &amp; SEO Tips\",\"publisher\":{\"@id\":\"https:\\\/\\\/ahrefs.com\\\/blog\\\/#organization\"},\"potentialAction\":[{\"@type\":\"SearchAction\",\"target\":{\"@type\":\"EntryPoint\",\"urlTemplate\":\"https:\\\/\\\/ahrefs.com\\\/blog\\\/?s={search_term_string}\"},\"query-input\":{\"@type\":\"PropertyValueSpecification\",\"valueRequired\":true,\"valueName\":\"search_term_string\"}}],\"inLanguage\":\"en-US\"},{\"@type\":\"Organization\",\"@id\":\"https:\\\/\\\/ahrefs.com\\\/blog\\\/#organization\",\"name\":\"Ahrefs\",\"url\":\"https:\\\/\\\/ahrefs.com\\\/blog\\\/\",\"logo\":{\"@type\":\"ImageObject\",\"inLanguage\":\"en-US\",\"@id\":\"https:\\\/\\\/ahrefs.com\\\/blog\\\/#\\\/schema\\\/logo\\\/image\\\/\",\"url\":\"https:\\\/\\\/ahrefs.com\\\/blog\\\/wp-content\\\/uploads\\\/2023\\\/06\\\/ahrefs-logo.png\",\"contentUrl\":\"https:\\\/\\\/ahrefs.com\\\/blog\\\/wp-content\\\/uploads\\\/2023\\\/06\\\/ahrefs-logo.png\",\"width\":2048,\"height\":768,\"caption\":\"Ahrefs\"},\"image\":{\"@id\":\"https:\\\/\\\/ahrefs.com\\\/blog\\\/#\\\/schema\\\/logo\\\/image\\\/\"},\"sameAs\":[\"https:\\\/\\\/www.facebook.com\\\/Ahrefs\\\/\",\"https:\\\/\\\/x.com\\\/ahrefs\",\"https:\\\/\\\/www.linkedin.com\\\/company\\\/ahrefs\\\/\",\"https:\\\/\\\/www.youtube.com\\\/c\\\/ahrefscom\"]},{\"@type\":\"Person\",\"@id\":\"https:\\\/\\\/ahrefs.com\\\/blog\\\/#\\\/schema\\\/person\\\/14bf754248f3c561786477e4e5fd2067\",\"name\":\"Patrick Stox\",\"image\":{\"@type\":\"ImageObject\",\"inLanguage\":\"en-US\",\"@id\":\"https:\\\/\\\/ahrefs.com\\\/blog\\\/wp-content\\\/uploads\\\/2019\\\/11\\\/Screenshot-2019-11-06-at-00.57.29.pngbade1fd182f70b6825c334271c12533e\",\"url\":\"https:\\\/\\\/ahrefs.com\\\/blog\\\/wp-content\\\/uploads\\\/2019\\\/11\\\/Screenshot-2019-11-06-at-00.57.29.png\",\"contentUrl\":\"https:\\\/\\\/ahrefs.com\\\/blog\\\/wp-content\\\/uploads\\\/2019\\\/11\\\/Screenshot-2019-11-06-at-00.57.29.png\",\"caption\":\"Patrick Stox\"},\"description\":\"Patrick Stox is a Product Advisor, Technical SEO, &amp; Brand Ambassador at Ahrefs. He was the lead author for the SEO chapter of the 2021 Web Almanac and a reviewer for the 2022 SEO chapter. He also co-wrote the SEO Book For Beginners by Ahrefs and was the Technical Review Editor for The Art of SEO 4th Edition. He\u2019s an organizer for the Triangle SEO Meetup, the Tech SEO Connect conference, he runs a Technical SEO Slack group, and is a moderator for \\\/r\\\/TechSEO on Reddit.\",\"sameAs\":[\"https:\\\/\\\/patrickstox.com\\\/\",\"patrickstox\",\"https:\\\/\\\/x.com\\\/patrickstox\"],\"url\":\"https:\\\/\\\/ahrefs.com\\\/blog\\\/author\\\/patrick-stox\\\/\"}]}<\/script>\n<!-- \/ Yoast SEO plugin. -->","yoast_head_json":{"title":"You Can't Compare Backlink Counts in SEO Tools: Here's Why","description":"Don\u2019t blindly trust the numbers on the dashboards or the general numbers because they may represent completely different things.","robots":{"index":"index","follow":"follow","max-snippet":"max-snippet:-1","max-image-preview":"max-image-preview:large","max-video-preview":"max-video-preview:-1"},"canonical":"https:\/\/ahrefs.com\/blog\/link-index-comparison\/","og_locale":"en_US","og_type":"article","og_title":"You Can't Compare Backlink Counts in SEO Tools: Here's Why","og_description":"Stop blindly trusting numbers.","og_url":"https:\/\/ahrefs.com\/blog\/link-index-comparison\/","og_site_name":"SEO Blog by Ahrefs","article_publisher":"https:\/\/www.facebook.com\/Ahrefs\/","article_author":"patrickstox","article_published_time":"2024-03-13T17:14:56+00:00","article_modified_time":"2025-06-23T23:18:33+00:00","og_image":[{"width":1494,"height":1254,"url":"https:\/\/ahrefs.com\/blog\/wp-content\/uploads\/2024\/03\/semrush-counting-links-on-404-pages.jpg","type":"image\/jpeg"}],"author":"Patrick Stox","twitter_card":"summary_large_image","twitter_creator":"@patrickstox","twitter_site":"@ahrefs","schema":{"@context":"https:\/\/schema.org","@graph":[{"@type":"Article","@id":"https:\/\/ahrefs.com\/blog\/link-index-comparison\/#article","isPartOf":{"@id":"https:\/\/ahrefs.com\/blog\/link-index-comparison\/"},"author":{"name":"Patrick Stox","@id":"https:\/\/ahrefs.com\/blog\/#\/schema\/person\/14bf754248f3c561786477e4e5fd2067"},"headline":"You Can\u2019t Compare Backlink Counts in SEO Tools: Here\u2019s Why","datePublished":"2024-03-13T17:14:56+00:00","dateModified":"2025-06-23T23:18:33+00:00","mainEntityOfPage":{"@id":"https:\/\/ahrefs.com\/blog\/link-index-comparison\/"},"wordCount":3765,"publisher":{"@id":"https:\/\/ahrefs.com\/blog\/#organization"},"image":{"@id":"https:\/\/ahrefs.com\/blog\/link-index-comparison\/#primaryimage"},"thumbnailUrl":"https:\/\/ahrefs.com\/blog\/wp-content\/uploads\/2024\/03\/you-can8217t-compare-backlink-counts-in-by-patrick-stox-data-studies.jpg","keywords":["blog"],"articleSection":["Data &amp; Studies"],"inLanguage":"en-US"},{"@type":"WebPage","@id":"https:\/\/ahrefs.com\/blog\/link-index-comparison\/","url":"https:\/\/ahrefs.com\/blog\/link-index-comparison\/","name":"You Can't Compare Backlink Counts in SEO Tools: Here's Why","isPartOf":{"@id":"https:\/\/ahrefs.com\/blog\/#website"},"primaryImageOfPage":{"@id":"https:\/\/ahrefs.com\/blog\/link-index-comparison\/#primaryimage"},"image":{"@id":"https:\/\/ahrefs.com\/blog\/link-index-comparison\/#primaryimage"},"thumbnailUrl":"https:\/\/ahrefs.com\/blog\/wp-content\/uploads\/2024\/03\/semrush-counting-links-on-404-pages.jpg","datePublished":"2024-03-13T17:14:56+00:00","dateModified":"2025-06-23T23:18:33+00:00","description":"Don\u2019t blindly trust the numbers on the dashboards or the general numbers because they may represent completely different things.","inLanguage":"en-US","potentialAction":[{"@type":"ReadAction","target":["https:\/\/ahrefs.com\/blog\/link-index-comparison\/"]}]},{"@type":"ImageObject","inLanguage":"en-US","@id":"https:\/\/ahrefs.com\/blog\/link-index-comparison\/#primaryimage","url":"https:\/\/ahrefs.com\/blog\/wp-content\/uploads\/2024\/03\/semrush-counting-links-on-404-pages.jpg","contentUrl":"https:\/\/ahrefs.com\/blog\/wp-content\/uploads\/2024\/03\/semrush-counting-links-on-404-pages.jpg","width":1494,"height":1254,"caption":"Semrush counting links on 404 pages"},{"@type":"WebSite","@id":"https:\/\/ahrefs.com\/blog\/#website","url":"https:\/\/ahrefs.com\/blog\/","name":"SEO Blog by Ahrefs","description":"Link Building Strategies &amp; SEO Tips","publisher":{"@id":"https:\/\/ahrefs.com\/blog\/#organization"},"potentialAction":[{"@type":"SearchAction","target":{"@type":"EntryPoint","urlTemplate":"https:\/\/ahrefs.com\/blog\/?s={search_term_string}"},"query-input":{"@type":"PropertyValueSpecification","valueRequired":true,"valueName":"search_term_string"}}],"inLanguage":"en-US"},{"@type":"Organization","@id":"https:\/\/ahrefs.com\/blog\/#organization","name":"Ahrefs","url":"https:\/\/ahrefs.com\/blog\/","logo":{"@type":"ImageObject","inLanguage":"en-US","@id":"https:\/\/ahrefs.com\/blog\/#\/schema\/logo\/image\/","url":"https:\/\/ahrefs.com\/blog\/wp-content\/uploads\/2023\/06\/ahrefs-logo.png","contentUrl":"https:\/\/ahrefs.com\/blog\/wp-content\/uploads\/2023\/06\/ahrefs-logo.png","width":2048,"height":768,"caption":"Ahrefs"},"image":{"@id":"https:\/\/ahrefs.com\/blog\/#\/schema\/logo\/image\/"},"sameAs":["https:\/\/www.facebook.com\/Ahrefs\/","https:\/\/x.com\/ahrefs","https:\/\/www.linkedin.com\/company\/ahrefs\/","https:\/\/www.youtube.com\/c\/ahrefscom"]},{"@type":"Person","@id":"https:\/\/ahrefs.com\/blog\/#\/schema\/person\/14bf754248f3c561786477e4e5fd2067","name":"Patrick Stox","image":{"@type":"ImageObject","inLanguage":"en-US","@id":"https:\/\/ahrefs.com\/blog\/wp-content\/uploads\/2019\/11\/Screenshot-2019-11-06-at-00.57.29.pngbade1fd182f70b6825c334271c12533e","url":"https:\/\/ahrefs.com\/blog\/wp-content\/uploads\/2019\/11\/Screenshot-2019-11-06-at-00.57.29.png","contentUrl":"https:\/\/ahrefs.com\/blog\/wp-content\/uploads\/2019\/11\/Screenshot-2019-11-06-at-00.57.29.png","caption":"Patrick Stox"},"description":"Patrick Stox is a Product Advisor, Technical SEO, &amp; Brand Ambassador at Ahrefs. He was the lead author for the SEO chapter of the 2021 Web Almanac and a reviewer for the 2022 SEO chapter. He also co-wrote the SEO Book For Beginners by Ahrefs and was the Technical Review Editor for The Art of SEO 4th Edition. He\u2019s an organizer for the Triangle SEO Meetup, the Tech SEO Connect conference, he runs a Technical SEO Slack group, and is a moderator for \/r\/TechSEO on Reddit.","sameAs":["https:\/\/patrickstox.com\/","patrickstox","https:\/\/x.com\/patrickstox"],"url":"https:\/\/ahrefs.com\/blog\/author\/patrick-stox\/"}]}},"_links":{"self":[{"href":"https:\/\/ahrefs.com\/blog\/wp-json\/wp\/v2\/posts\/172884","targetHints":{"allow":["GET"]}}],"collection":[{"href":"https:\/\/ahrefs.com\/blog\/wp-json\/wp\/v2\/posts"}],"about":[{"href":"https:\/\/ahrefs.com\/blog\/wp-json\/wp\/v2\/types\/post"}],"author":[{"embeddable":true,"href":"https:\/\/ahrefs.com\/blog\/wp-json\/wp\/v2\/users\/150"}],"replies":[{"embeddable":true,"href":"https:\/\/ahrefs.com\/blog\/wp-json\/wp\/v2\/comments?post=172884"}],"version-history":[{"count":0,"href":"https:\/\/ahrefs.com\/blog\/wp-json\/wp\/v2\/posts\/172884\/revisions"}],"wp:attachment":[{"href":"https:\/\/ahrefs.com\/blog\/wp-json\/wp\/v2\/media?parent=172884"}],"wp:term":[{"taxonomy":"category","embeddable":true,"href":"https:\/\/ahrefs.com\/blog\/wp-json\/wp\/v2\/categories?post=172884"},{"taxonomy":"post_tag","embeddable":true,"href":"https:\/\/ahrefs.com\/blog\/wp-json\/wp\/v2\/tags?post=172884"},{"taxonomy":"author","embeddable":true,"href":"https:\/\/ahrefs.com\/blog\/wp-json\/wp\/v2\/coauthors?post=172884"}],"curies":[{"name":"wp","href":"https:\/\/api.w.org\/{rel}","templated":true}]}}