Home
Blog
Technical SEO
The Google Indexing Coverage Report: Get your Web Pages into Google’s Index

The Google Indexing Coverage Report: Get your Web Pages into Google’s Index

By Manick Bhan on Jun 20, 2023 - 9 minute read

After a search engine spider discovers a web page, it crawls and renders the web page’s content and, if allowed, adds the page to Google’s index. Google […]

After a search engine spider discovers a web page, it crawls and renders the web page’s content and, if allowed, adds the page to Google’s index. Google has the largest index of any search engine (between 30-50 billion web pages) and Google’s amazing indexing power has been a key to the search engine’s success over the past two decades.

But indexing the internet is complex. Web pages are constantly being updated, changed, moved, or removed. Google wants to keep its index up-to-date, so it regularly crawls the pages in its index so it knows whether or not to keep them there, remove them, or whether the content has changed and should be promoted for different sets of keywords.

As a result, understanding how Google’s indexing process works, particularly for our individual websites, is an important part of SEO. Google cannot rank your web pages if they are not indexed, so understanding which of your pages are indexed, and why or why not, is important to make sure the most valuable, high-quality, and high-converting pages on your website have the potential to show up in search engine results.

So how do you know whether or not Google has indexed the pages of your website? Enter the Google Indexing Coverage Report in your Google Search Console account.

Taking the time to check the Google indexing status for your site provides you with a comprehensive overview of how Google indexes your website’s pages. This article will outline how to access and understand your Google Indexing Coverage Report, list common indexing issues, and offer detailed suggestions for how to resolve them.

What is the Google Index Coverage Report?

The Google Index Coverage report is a summary of which pages on your website have or have not been indexed and why or why not.

It highlights pages that have been indexed successfully, pages with Google indexing issues, pages that Google has excluded, and pages that have warnings.

The report also includes important information such as the number of indexed pages, crawl issues, and sitemap status. By regularly monitoring the Index Coverage Report, website owners can quickly detect and resolve indexing issues that negatively impact their website’s visibility.

What Should I Use the Google Index Coverage Report For?

Here are a few key ways you can leverage the information you find when you check the indexing status for your site:

Identify indexing issues

When your website has indexing problems, it can hinder crawlers from properly scanning your pages. This can lead to your pages not appearing in search engine results pages (SERPs), thus limiting your website’s visibility.

The Google Index Coverage report will offer explanations for why you’re web pages aren’t being indexed. Google categorizes this chart by the various indexing issues as well as a column listing the total number of pages on your website impacted by the issue.

Uncover crawling patterns

As the owner of a website, understanding how Googlebot crawls and interacts with your website is essential for ensuring your site is being efficiently crawled. Google sets a limited crawl budget for each website, and if their crawlers are encountering difficulties in crawling your site due to a poor or convoluted structure, this means you are wasting your budget and delaying the time it will take to get your important pages indexed.

Evaluate page indexing status

This report will help you determine any potential issues and prioritize your optimization efforts. By reviewing the indexing status of each page, you can understand why some pages may not be appearing in the results. The reasons could range from domain name issues and technical hitches to issues related to content and backlinks.

The report classifies indexing status into four categories:

Valid: Successful and eligible for search results
Error: Critical issues that need attention
Excluded: Pages intentionally excluded or blocked by robots.txt
Valid with warnings: Pages indexed but with minor issues that may affect their visibility or performance

Monitor changes over time

The Google Indexing Coverage Report allows you to see how many of your website’s pages are scanned, and the reasons why specific pages might be returning errors. When you check Google indexing status, you can track improvements in your website and detect emerging Google indexing issues, such as crawl errors or duplicate content.

For example, if you notice a sudden drop in the number of indexed pages, it could indicate that there’s an issue with your website that needs to be addressed.

Validate fixes

The Google Index Coverage report also allows you to go through a validation process after you resolve any indexing issues.

After you resolve an indexing issue, click the “validate fix” button and Google will go through the process to confirm whether or not the issue has been resolved.

6 Common Google Indexing Issues and How to Resolve Them

Several common indexing issues can occur, leading to lower search rankings, decreased website traffic, and ultimately, loss of revenue. Fortunately, they aren’t impossible to resolve.

1. Crawl errors

Crawl errors can be a headache for any online business owner or digital marketer. They occur when Googlebot, the site’s crawler, experiences difficulty accessing your website’s pages.

This can happen for numerous reasons, including:

Server errors
Excessive redirect chains
Slow-loading pages

When a crawler encounters these issues, it may not be able to access all of your website’s content, leading to lower rankings and fewer organic results. Remember, Google is not going to wait around forever to crawl and render your content, so make sure your website is high-performing and loading fast for users and Google.

2. Soft 404 errors

These errors occur when a page that should return a “404 Not Found” https status code, indicating that the requested page does not exist, is incorrectly identified as a valid page.

This can happen if your website returns a standard 200 status code instead, which suggests that the page does exist. The result is confusing for users who are expecting to see a “404 Not Found” error message.

3. Duplicate content

Duplicate content can cause another type of common Google Indexing Coverage Report error you might find when you check Google indexing status for your site.

When multiple pages on your website have similar or identical content and do not have proper canonical tags, it can confuse search tools and dilute the visibility of each page. Search engines aim to provide the best user experience, and showing multiple pages with the same content could be confusing and frustrating for the user.

If you’re a Search Atlas user, duplicate content and improper use of canonical tags will be flagged in your site audit report:

With detailed how-to-fix guides, you can resolve this issue quickly and make sure it does not prevent your content from showing up in search results.

4. Blocked resources

Blocked resources refer to the files on your website that are restricted to crawlers like Googlebot. These may include JavaScript and CSS files, which are essential in rendering a web page accurately. If the web crawlers cannot access these files, they may struggle to interpret your site’s elements, leading to incomplete rendering and Google indexing issues for your site.

5. Robotx.txt and invalid directives

Not all of our web pages need to be in Google’s index, particularly content like “Thank you” or confirmation pages that are shown to users after a purchase or a submission form. Webmasters use robots.txt files and robot directives like “noindex” to tell Google which pages they should not include in their index.

However, issues often arise in the implementation of robots.txt or robot tags on individual pages. For example, if directives on an individual page conflict with the directives identified in the robots.txt, Google will follow those directives in the robots.txt.

These issues are also going to be identified in your site audit report if they are present on your website.

6. Sitemap errors

Issues with XML sitemaps can also lead to indexing problems. This sitemap acts as a roadmap for search bots, pointing them toward all the essential pages on your website. However, if your sitemap contains errors or is outdated, it can impede proper analysis by search engines, leading to reduced visibility and lower search ranks.

Follow Our Tips for Better Indexing

Identifying these common issues within the Google Indexing Coverage Report and taking steps to resolve them is key to making sure that the pages you want to be indexed are, and quickly.

Remember, just because a web page is added to Google’s index doesn’t mean it is guaranteed to rank. Getting Google to index your web pages is just the first step in SEO, and it will take comprehensive SEO work to reach your target audience effectively.

If you need assistance with technical SEO issues like the above or with improving the quality of your content and backlinks, LinkGraph is here to help! Reach out and book a free strategy session with one of our SEO consultants.

Author

Manick Bhan

Manick is the Founder and CTO of LinkGraph. SEO is his biggest passion and life’s work. He is also a skilled programmer and the creator of the Search Atlas software suite.

Drive Your Revenue to New Heights

Unleash Your Brand Potential with Our Award-Winning Services and Cutting-Edge Software. Get Started with a FREE Instant Site Audit.

Cookie	Duration	Description
__cfruid	session	Cloudflare sets this cookie to identify trusted web traffic.
__hssrc	session	This cookie is set by Hubspot whenever it changes the session cookie. The __hssrc cookie set to 1 indicates that the user has restarted the browser, and if the cookie does not exist, it is assumed to be a new session.
cookielawinfo-checkbox-advertisement	1 year	Set by the GDPR Cookie Consent plugin, this cookie is used to record the user consent for the cookies in the "Advertisement" category .
cookielawinfo-checkbox-analytics	11 months	This cookie is set by GDPR Cookie Consent plugin. The cookie is used to store the user consent for the cookies in the category "Analytics".
cookielawinfo-checkbox-functional	11 months	The cookie is set by GDPR cookie consent to record the user consent for the cookies in the category "Functional".
cookielawinfo-checkbox-necessary	11 months	This cookie is set by GDPR Cookie Consent plugin. The cookies is used to store the user consent for the cookies in the category "Necessary".
cookielawinfo-checkbox-others	11 months	This cookie is set by GDPR Cookie Consent plugin. The cookie is used to store the user consent for the cookies in the category "Other.
cookielawinfo-checkbox-performance	11 months	This cookie is set by GDPR Cookie Consent plugin. The cookie is used to store the user consent for the cookies in the category "Performance".
CookieLawInfoConsent	1 year	Records the default button state of the corresponding category & the status of CCPA. It works only in coordination with the primary cookie.
csrftoken	1 year	This cookie is associated with Django web development platform for python. Used to help protect the website against Cross-Site Request Forgery attacks
JSESSIONID	session	The JSESSIONID cookie is used by New Relic to store a session identifier so that New Relic can monitor session counts for an application.
viewed_cookie_policy	11 months	The cookie is set by the GDPR Cookie Consent plugin and is used to store whether or not user has consented to the use of cookies. It does not store any personal data.

Cookie	Duration	Description
__cf_bm	30 minutes	This cookie, set by Cloudflare, is used to support Cloudflare Bot Management.
__hssc	30 minutes	HubSpot sets this cookie to keep track of sessions and to determine if HubSpot should increment the session number and timestamps in the __hstc cookie.
bcookie	1 year	LinkedIn sets this cookie from LinkedIn share buttons and ad tags to recognize browser ID.
bscookie	1 year	LinkedIn sets this cookie to store performed actions on the website.
lang	session	LinkedIn sets this cookie to remember a user's language setting.
lidc	1 day	LinkedIn sets the lidc cookie to facilitate data center selection.
na_id	1 year 24 days	The na_id is set by AddThis to enable sharing of links on social media platforms like Facebook and Twitter.
ouid	1 year 24 days	Associated with the AddThis widget, this cookie helps users to share content across various networking and sharing forums.
UserMatchHistory	1 month	LinkedIn sets this cookie for LinkedIn Ads ID syncing.

Cookie	Duration	Description
__hstc	5 months 27 days	This is the main cookie set by Hubspot, for tracking visitors. It contains the domain, initial timestamp (first visit), last timestamp (last visit), current timestamp (this visit), and session number (increments for each subsequent session).
_ga	2 years	The _ga cookie, installed by Google Analytics, calculates visitor, session and campaign data and also keeps track of site usage for the site's analytics report. The cookie stores information anonymously and assigns a randomly generated number to recognize unique visitors.
_ga_XG2FWHK2V3	2 years	This cookie is installed by Google Analytics.
_gcl_au	3 months	Provided by Google Tag Manager to experiment advertisement efficiency of websites using their services.
_gid	1 day	Installed by Google Analytics, _gid cookie stores information on how visitors use a website, while also creating an analytics report of the website's performance. Some of the data that are collected include the number of visitors, their source, and the pages they visit anonymously.
_hjAbsoluteSessionInProgress	30 minutes	Hotjar sets this cookie to detect the first pageview session of a user. This is a True/False flag set by the cookie.
_hjFirstSeen	30 minutes	Hotjar sets this cookie to identify a new user’s first session. It stores a true/false value, indicating whether it was the first time Hotjar saw this user.
_hjIncludedInPageviewSample	2 minutes	Hotjar sets this cookie to know whether a user is included in the data sampling defined by the site's pageview limit.
_hjIncludedInSessionSample	2 minutes	Hotjar sets this cookie to know whether a user is included in the data sampling defined by the site's daily session limit.
_hjTLDTest	session	To determine the most generic cookie path that has to be used instead of the page hostname, Hotjar sets the _hjTLDTest cookie to store different URL substring alternatives until it fails.
CONSENT	2 years	YouTube sets this cookie via embedded youtube-videos and registers anonymous statistical data.
hubspotutk	5 months 27 days	HubSpot sets this cookie to keep track of the visitors to the website. This cookie is passed to HubSpot on form submission and used when deduplicating contacts.
uid	1 year 24 days	This is a Google UserID cookie that tracks users across various website segments.

Cookie	Duration	Description
_fbp	3 months	This cookie is set by Facebook to display advertisements when either on Facebook or on a digital platform powered by Facebook advertising, after visiting the website.
anj	3 months	AppNexus sets the anj cookie that contains data stating whether a cookie ID is synced with partners.
IDE	1 year 24 days	Google DoubleClick IDE cookies are used to store information about how the user uses the website to present them with relevant ads and according to the user profile.
IDSYNC	1 year	This cookie is set by Yahoo to store information on how users behave on multiple websites so that relevant ads can be displayed to them.
pa_crosswise_ts	2 years	The pa_crosswise_ts cookie is set by Perfect Audience for advertising purposes based on user behavioural data.
pa_google_ts	2 years	The pa_google_ts cookie is set by Perfect Audience for advertising purposes based on user behavioural data.
pa_openx_ts	2 years	The pa_openx_ts cookie is set by Perfect Audience for advertising purposes based on user behavioural data.
pa_rubicon_ts	2 years	The pa_rubicon_ts cookie is set by Perfect Audience for advertising purposes based on user behavioural data.
pa_twitter_ts	2 years	The pa_twitter_ts cookie is set by Perfect Audience for advertising purposes based on user behavioural data.
pa_uid	2 years	This cookie is set by prfct.co. This cookie is used across the websites that use same ad network to display ads to the other advertisers in the network.
pa_yahoo_ts	2 years	The pa_yahoo_ts cookie is set by Perfect Audience for advertising purposes based on user behavioural data.
personalization_id	2 years	Twitter sets this cookie to integrate and share features for social media and also store information about how the user uses the website, for tracking and targeting.
test_cookie	15 minutes	The test_cookie is set by doubleclick.net and is used to determine if the user's browser supports cookies.
uuid2	3 months	The uuid2 cookie is set by AppNexus and records information that helps in differentiating between devices and browsers. This information is used to pick out ads delivered by the platform and assess the ad performance and its attribute payment.
VISITOR_INFO1_LIVE	5 months 27 days	A cookie set by YouTube to measure bandwidth that determines whether the user gets the new or old player interface.
YSC	session	YSC cookie is set by Youtube and is used to track the views of embedded videos on Youtube pages.
yt-remote-connected-devices	never	YouTube sets this cookie to store the video preferences of the user using embedded YouTube video.
yt-remote-device-id	never	YouTube sets this cookie to store the video preferences of the user using embedded YouTube video.
yt.innertube::nextId	never	This cookie, set by YouTube, registers a unique ID to store data on what videos from YouTube the user has seen.
yt.innertube::requests	never	This cookie, set by YouTube, registers a unique ID to store data on what videos from YouTube the user has seen.

Managed SEO Banner

Authority Building

Link Building Service

HARO Link Building

Digital PR

Publisher Outreach

Guest Posting

Partnership-led SEO

Local SEO

GMB Management

Local SEO Services

Technicals

SEO Auditing

Website Migrations

Page Speed Optimization

Technical SEO

Content

Content Strategy

Copywriting

Keyword Research

On-Page SEO Services

Blog Writing Services

Paid Media Management

Google Ads

Facebook Ads

PPC

Amazon Ads

Other Services

Our Blueprint

SEO Advisory

Brand Defense

Conversion Rate Optimization

Youtube SEO

See All Services

Chat with us

Software SEO Banner

Search Atlas

Search Atlas SEO Software

Blog Topic Generator

Content Audit Tool

Content Planner

Competitor Research

Keyword Research

Free Tools

Bulk DA Checker

SEO Content Optimizer

SEO Content Assistant

Rank Tracking

Keyword Research

Backlink Analysis

Chat with us

Phone Number

Agency Services

White Label Link Building

White Label SEO

White Label SEO Software

White Label PPC Services

Video Section

Chat with us

Phone Number

By Industry

SEO for B2B Companies

SEO for Ecommerce Brands

SEO for SaaS Companies

SEO for Healthcare Companies

SEO for Government

SEO for Enterprise Companies

SEO for Law Firms

SEO for Dentists

Blank space

SEO for Doctors

SEO for Startups

National SEO

International SEO

Small Business SEO

Local SEO

Big Commerce SEO

Shopify SEO

Chat with us

Phone Number