Index coverage refers to the set of pages from a website that Google has successfully crawled, processed, and added to its search index, as reported in the Google Search Console Index Coverage (now Page Indexing) report.
Quick Answer
Index coverage refers to the set of pages from a website that Google has successfully crawled, processed, and added to its search index, as reported in the Google Search Console Index Coverage (now Page Indexing) report.
Sites with 10K+ pages routinely lose 20-40% of commercial pages to undetected indexing issues.
"Crawled but not currently indexed" is often a thin-content signal — improve or consolidate the page.
Not all non-indexed URLs are problems — correctly excluded pages (thank-you, account, params) should remain unindexed.
Key Takeaways
Sites with 10K+ pages routinely lose 20-40% of commercial pages to undetected indexing issues.
"Crawled but not currently indexed" is often a thin-content signal — improve or consolidate the page.
Not all non-indexed URLs are problems — correctly excluded pages (thank-you, account, params) should remain unindexed.
How Index Coverage Works
Google's Page Indexing report in Search Console categorizes all discovered URLs into four states: Indexed (in Google's index and eligible to rank), Not Indexed (discovered but excluded), Crawled but not currently indexed (seen but not indexed — typically low-value signal), and Discovered but not yet crawled (in the queue). The report shows specific reasons for non-indexing including "Duplicate without user-selected canonical," "Blocked by robots.txt," "noindex tag," "Soft 404," "Redirect error," and "Crawl anomaly." Each reason requires a distinct remediation approach.
Why Index Coverage Matters for B2B Marketing
For B2B sites, index coverage problems directly limit organic revenue potential — a page that isn't indexed cannot rank or generate leads. Common culprits include: faceted navigation creating thousands of near-duplicate filter URLs, staging environments accidentally crawled via leaked links, misconfigured canonical tags pointing all variations to a non-existent URL, and CMS-generated pagination pages with thin content triggering crawl budget exhaustion on large enterprise sites. Sites with 10,000+ pages routinely lose 20-40% of commercial pages to indexing issues that go undetected for months.
Index Coverage: Best Practices & Strategic Application
Best practices: monitor the Page Indexing report weekly for status changes, especially after site updates or migrations. Create a URL-level spreadsheet tracking index status of your top 500 commercial pages monthly. For "Crawled but not currently indexed" URLs, assess whether the content is genuinely unique and valuable — this status often indicates thin content that needs to be improved or consolidated. Use the URL Inspection tool to check individual page status, fetch rendering, and understand Google's crawl date.
Agency Perspective: Index Coverage in Practice
A critical agency mistake is treating all "Not Indexed" URLs as problems requiring fixes. Many non-indexed URLs are correctly excluded — thank-you pages, account pages, search result pages with parameters, and duplicate product filter pages should not be indexed. The priority is ensuring all high-value commercial and informational pages are indexed correctly. Build an indexing health dashboard in Looker Studio pulling from Search Console API to track indexed vs. total URLs over time for early anomaly detection.
Frequently Asked Questions: Index Coverage
Index coverage refers to the set of pages from a website that Google has successfully crawled, processed, and added to its search index, as reported in the Google Search Console Index Coverage (now Page Indexing) report.
For established sites with good crawl health, new pages are typically indexed within 1-7 days. New sites or pages with few internal links may take 2-6 weeks. Submitting URLs via the URL Inspection tool or XML sitemap submission in Search Console can accelerate indexing for important pages.
"Crawled but not currently indexed" indicates Google found the page but decided it doesn't provide enough unique value to add to the index. Common causes include thin content, near-duplicate content, or content that closely resembles what already exists on the site or across the web. Improving content depth and uniqueness is the primary fix.
Not if those pages are correctly excluded. However, if Google is not indexing pages you want ranked, it directly costs you organic traffic. Large volumes of low-quality non-indexed pages can also consume crawl budget that should be allocated to valuable pages — use robots.txt or noindex tags to actively exclude low-value pages.
MV3 Marketing helps B2B companies apply these strategies to drive measurable pipeline growth. Our team executes technical seo audit for technology, SaaS, and professional services companies.
ID used to identify users for 24 hours after last activity
24 hours
_gat
Used to monitor number of Google Analytics server requests when using Google Tag Manager
1 minute
_gac_
Contains information related to marketing campaigns of the user. These are shared with Google AdWords / Google Ads when the Google Ads and Google Analytics accounts are linked together.
90 days
__utma
ID used to identify users and sessions
2 years after last activity
__utmt
Used to monitor number of Google Analytics server requests
10 minutes
__utmb
Used to distinguish new sessions and visits. This cookie is set when the GA.js javascript library is loaded and there is no existing __utmb cookie. The cookie is updated every time data is sent to the Google Analytics server.
30 minutes after last activity
__utmc
Used only with old Urchin versions of Google Analytics and not with GA.js. Was used to distinguish between new sessions and visits at the end of a session.
End of session (browser)
__utmz
Contains information about the traffic source or campaign that directed user to the website. The cookie is set when the GA.js javascript is loaded and updated when data is sent to the Google Anaytics server
6 months after last activity
__utmv
Contains custom information set by the web developer via the _setCustomVar method in Google Analytics. This cookie is updated every time new data is sent to the Google Analytics server.
2 years after last activity
__utmx
Used to determine whether a user is included in an A / B or Multivariate test.
18 months
_ga
ID used to identify users
2 years
_gali
Used by Google Analytics to determine which links on a page are being clicked