How To Fix the Indexed Though Blocked by robots.txt Error (2 Methods) (2023)

It can be discouraging to see your website’s decline in search rankings. When your pages are no longer crawled by Google, these lower rankings can contribute to fewer visitors and conversions.

The “Indexed, though blocked by robots.txt” error can signify a problem with search engine crawling on your site. When this happens, Google has indexed a page that it cannot crawl. Fortunately, you can edit your robots.txtfile to specify which pages should or should not be indexed.

In this post, we’ll explain the “Indexed, though blocked by robots.txt” error and how to test your website for this problem. Then, we’ll show you two different methods of fixing it. Let’s get started!

Check Out Our Video Guide to Fixing the Indexed Though Blocked by robots.txt Error

What Is the “Indexed, Though Blocked by robots.txt” Error?

As a website owner, Google Search Consolecan help you analyze your site’s performancein many vital areas. This tool can monitor page speed, security, and “crawlability” so that you can optimize your online presence:

How To Fix the Indexed Though Blocked by robots.txt Error (2 Methods) (1)

For example, Search Console’s Index Coverage report can help you improve your site’s Search Engine Optimization (SEO). It will analyze how Google indexes your online content, returning information about common errors, such asan “Indexed, though blocked by robots.txt” warning:

How To Fix the Indexed Though Blocked by robots.txt Error (2 Methods) (2)

To understand this error, let’s first discuss the robots.txtfile. Essentially, it informs search engine crawlers which of your website files should or should not be indexed. With a well-structured robots.txtfile, you can ensure that only important web pages are crawled.

If you’ve received an “Indexed, though blocked by robots.txt” warning, Google crawlers have found the page but notice that it is blocked in your robots.txtfile. When this happens, Google isn’t sure whether you want that page indexed.

As a result, this page can appear in search results, but it won’t display a description. It will also exclude images, videos, PDFs, and non-HTML files. Therefore, you’ll need to update your robots.txtfile if you want to display this information.

Potential Problems in Page Indexing

You can intentionally add directives to your robots.txtfile that block pages from crawlers. However, these directives may not completely remove the pages from Google. If an external website links to the page, it can cause an “Indexed, though blocked by robots.txt” error.

Google (and other search engines) need to index your pages before they can accurately rank them. To ensure that only relevant content appears in the search results, it’s crucial to understand how this process works.

(Video) How To Fix the Indexed Though Blocked by robots.txt Error

Although certain pages should be indexed, they may not be. This could be due to a few different reasons:

  • A directive in the robots.txtfile that prevents indexing
  • Broken linksor redirect chains
  • Canonical tags in the HTML header

On the other hand, some web pages should not be indexed. They may end up accidentally indexed because of these factors:

  • Incorrect noindex directives
  • External links from other sites
  • Old URLs in the Google index
  • No robots.txt file

If too many of your pages are indexed, your server may become overwhelmed by Google’s crawler. Plus, Google could waste time indexing irrelevant pages on your website. Therefore, you’ll need to create and edit your robots.txtfile correctly.

Finding the Source of the “Indexed, Though Blocked by robots.txt” Error

One effective way to identify problems in page indexing is to sign in to Google Search Console. After you verify site ownership, you’ll be able to access reports about your website’s performance.

In the Indexsection, click on the Valid with warningstab. This will pull up a list of your indexing errors, including any “Indexed, though blocked by robots.txt” warnings. If you don’t see any, your website likely isn’t experiencing this problem.

Alternatively, you can use Google’s robots.txt tester. With this tool, you can scan your robots.txtfile to look for syntax warnings and other errors:

How To Fix the Indexed Though Blocked by robots.txt Error (2 Methods) (3)

At the bottom of the page, enter a specific URL to see if it’s blocked. You’ll need to choose a user-agent from the dropdown menu and select Test:

How To Fix the Indexed Though Blocked by robots.txt Error (2 Methods) (4)

You can also navigate to domain.com/robots.txt. If you already have a robots.txt file, this will allow you to view it:

How To Fix the Indexed Though Blocked by robots.txt Error (2 Methods) (5)

Next, look for disallow statements. Site administrators can add these statements to instruct search crawlers on how to access specific files or pages.

If the disallow statement blocks all search engines, it might look like this:

Disallow: /

It may also block a specific user-agent:

(Video) How to solve indexed though blocked by robots.txt

User-agent: *Disallow: /

With any of these tools, you’ll be able to identify any issues with your page indexing. Then, you’ll need to take action to update your robots.txtfile.

How to Fix the “Indexed, Though Blocked by robots.txt” Error

Now that you know more about therobots.txtfile and how it can prevent page indexing, it’s time to fix the “Indexed, though blocked by robots.txt” error. However, make sure to evaluate first whether the blocked page needs to be indexed before using these solutions.

Method 1: Edit robots.txt Directly

If you have a WordPress website, you’ll probably have a virtual robots.txtfile. You can visit it by searching for domain.com/robots.txtin a web browser (replacing domain.comwith your domain name). However, this virtual file won’t enable you to make edits.

To start editing robots.txt, you’ll need to create a file on your server. Firstly, choose a text editorand create a new file. Be sure to name it “robots.txt”:

How To Fix the Indexed Though Blocked by robots.txt Error (2 Methods) (6)

Then, you’ll need to connect to an SFTP client.If you’re using a Kinsta hosting account, sign in to MyKinstaand navigate to Sites > Info:

How To Fix the Indexed Though Blocked by robots.txt Error (2 Methods) (7)

You’ll find your username, password, host, and port number here. You can then download an SFTP client such as FileZilla. Enter your SFTP login credentials and click on Quickconnect:

How To Fix the Indexed Though Blocked by robots.txt Error (2 Methods) (8)

Lastly, upload the robots.txtfile to your root directory (for WordPress sites, it should be called public_html). Then, you can open the file and make the necessary changes.

You can use allow and disallow statements to customize the indexing of your WordPress site. For example, you may want a certain file to be crawled without indexing the entire folder. In this case, you can add this code:

User-agent: *Disallow: /wp-admin/Allow: /wp-admin/admin-ajax.php

Remember to target the page causing the “Indexed, though blocked by robots.txt” error during this process. Depending on your goal, you can specify whether Google should or should not crawl the page.

When you’re finished, save your changes. Then, go back to Google Search Console to see if this method resolved the error.

Method 2: Use an SEO Plugin

If you have an SEO pluginactivated, you won’t have to create an entirely new robots.txt file. In many cases, the SEO tool will build one for you. Plus, it may also provide ways to edit the file without leaving the WordPress dashboard.

(Video) Fix Indexed, Though Blocked By Robots.txt Google Search Console

Yoast SEO

One of the most popular SEO plugins is Yoast SEO. It can provide a detailed on-page SEO analysis, along with additional tools to customize your search engine indexing.

To start editing your robots.txt file, go to Yoast SEO > Tools in your WordPress dashboard. From the list of built-in tools, select the File editor:

How To Fix the Indexed Though Blocked by robots.txt Error (2 Methods) (9)

Yoast SEO will not automatically create a robots.txtfile. If you don’t already have one, click on Create robots.txt file:

How To Fix the Indexed Though Blocked by robots.txt Error (2 Methods) (10)

This will open a text editor with the content of your new robots.txtfile. Similar to the first method, you can add allow statements to pages you want to be indexed. Alternatively, use disallow statements for URLs to avoid indexing:

How To Fix the Indexed Though Blocked by robots.txt Error (2 Methods) (11)

After making your changes, save the file. Yoast SEO will alert you when you’ve updated the robots.txtfile.

Rank Math

Rank Mathis another freemium plugin that includes a robots.txteditor. After activating the tool on your WordPress site, go to Rank Math > General Settings > Edit robots.txt:

How To Fix the Indexed Though Blocked by robots.txt Error (2 Methods) (12)

In the code editor, you’ll see some default rules, including your sitemap. To update its settings, you can paste or delete code as necessary.

During this editing process, there are a few rules you should follow:

  • Use one or more groups, with each group containing multiple rules.
  • Start each group with a user-agent and follow with specific directories or files.
  • Assume that any web page allows indexing unless it has a disallowrule.

Keep in mind that this method is only possible when you don’t already have a robots.txt file in your root directory. If you do, you’ll have to edit the robot.txtfile directly using an SFTP client. Alternatively, you can delete this pre-existing file and use the Rank Math editor instead.

Once you disallow a page in robots.txt, you should also add a noindex directive. It will keep the page privatefrom Google searches. To do this, navigate to Rank Math > Titles & Meta > Posts:

How To Fix the Indexed Though Blocked by robots.txt Error (2 Methods) (13)

Scroll down to Post Robots Metaand enable it. Then, selectNo Index:

(Video) (100% fix) Indexed, Though Blocked by Robots.txt Problem Fix in Blogger | 2020 | Search Console

How To Fix the Indexed Though Blocked by robots.txt Error (2 Methods) (14)

Finally, save your changes. In Google Search Console, find the “Indexed, though blocked by robots.txt” warning and click onValidate Fix. This will enable Google to recrawl the given URLs and resolve the error.

Squirrly SEO

Using the Squirrly SEOplugin, you can similarly edit robots.txt. To get started, click on Squirrly SEO > SEO Configuration. This will open the Tweaks and Sitemapsettings:

How To Fix the Indexed Though Blocked by robots.txt Error (2 Methods) (15)

On the left-hand side, select the Robots Filetab. Then, you’ll see a robots.txtfile editor that looks similar to other SEO plugins:

How To Fix the Indexed Though Blocked by robots.txt Error (2 Methods) (16)

Using the text editor, you can add allow or disallow statements to customize the robots.txtfile. Continue to add as many rules as you need. When you’re happy with how this file looks, select Save Settings.

Additionally, you can add noindex rules to certain post types. To do this, you’ll simply need to disable the Let Google Index Itsetting in the Automationtab. By default, SEO Squirrly will leave this enabled.

Summary

Typically, Google will find your web pages and index them in its search results. However, a poorly configured robots.txtfile can confuse search engines about whether to ignore this page during crawling. In this case, you’ll need to clarify crawling instructions to continue maximizing SEO on your website.

You can edit robots.txt directly with an SFTP client such as FileZilla. Alternatively, many SEO plugins, including Yoast, Rank Math, and Squirrly SEO, include robots.txteditors within their interfaces. Using any of these tools, you’ll be able to add allow and disallow statements to help search engines index your content correctly.

To help your website rise to the top of search results, we recommend choosing an SEO-optimized web host. At Kinsta, our managed WordPress hostingplans include SEO tools such as uptime monitoring, SSL certificates, and redirect management. Check out our plans today!

Get all your applications, databases and WordPress sites online and under one roof. Our feature-packed, high-performance cloud platform includes:

  • Easy setup and management in the MyKinsta dashboard
  • 24/7 expert support
  • The best Google Cloud Platform hardware and network, powered by Kubernetes for maximum scalability
  • An enterprise-level Cloudflare integration for speed and security
  • Global audience reach with up to 35 data centers and 275 PoPs worldwide

Get started with a free trial of our Application Hosting or Database Hosting. Explore our plans or talk to sales to find your best fit.

(Video) blocked by robots.txt blogger | indexed though blocked by robots.txt |#blogger @YouTubeViewers

Videos

1. Google Search Console | Indexed, though blocked by robots.txt
(More Conversions)
2. Ways to Fix Indexed, though blocked by robots.txt error in Google Search Console
(Saket Gupta (Google Product Expert))
3. Fix Blocked by Robots.txt Error Google Search Console | Blocked by Robots.txt Error 2022
(Techniques Hub)
4. how to fix indexed though blocked by robots.txt problem fix in blogger
(HINDI TECHNIQUES)
5. Modify Your Robots.txt On Shopify
(Astral Web Inc.)
6. Indexed though blocked by robots.txt blogger 2023 error fix in blogger | enable Robots.txt file Fix
(Technical KhanSab)
Top Articles
Latest Posts
Article information

Author: Virgilio Hermann JD

Last Updated: 04/26/2023

Views: 6394

Rating: 4 / 5 (41 voted)

Reviews: 88% of readers found this page helpful

Author information

Name: Virgilio Hermann JD

Birthday: 1997-12-21

Address: 6946 Schoen Cove, Sipesshire, MO 55944

Phone: +3763365785260

Job: Accounting Engineer

Hobby: Web surfing, Rafting, Dowsing, Stand-up comedy, Ghost hunting, Swimming, Amateur radio

Introduction: My name is Virgilio Hermann JD, I am a fine, gifted, beautiful, encouraging, kind, talented, zealous person who loves writing and wants to share my knowledge and understanding with you.