Menu
πŸ€–
On-Page SEO Β· Indexation Control

The Noindex Tag
Nobody Remembers Setting

Someone ticks "noindex" on staging, ships it to production, and moves on. No error fires. No alert goes off. The page just quietly drops out of Google over the next few days. TechySEO checks both the HTML meta robots tag and the X-Robots-Tag HTTP header on every page, since either one can carry a directive the other doesn't show.

The Scariest SEO Bugs Are the Ones That Don't Throw Errors

A robots directive is one checkbox, one line in a config file, one CDN edge rule. It's also frighteningly easy to set by accident, especially the classic case: noindex gets ticked on a staging environment, the deploy goes to production with it still checked, and nobody notices because nothing actually breaks. The page just stops showing up in search over the following days, with no error log entry and no obvious signal pointing back to the cause.

Other directives fail more quietly still. Page-level nofollow blocks every outgoing link from passing equity, which wastes part of your internal link structure without anyone noticing the page is doing it. Noarchive hides the cached version. Nosnippet strips the description out of the search result entirely, so the page still ranks but the listing looks bare next to competitors, and click-through rate suffers for a reason that's invisible unless you go looking for it.

Checking only the HTML tag misses half the picture. A directive set through an X-Robots-Tag HTTP header, often by a CDN rule or server config nobody on the content team even knows exists, won't show up by viewing page source. Both sources need checking together.

🚫
A Staging Setting That Followed the Deploy
One leftover noindex tag and an important page disappears from Google within days, with nothing in any log to flag it.
πŸ”’
Page-Wide Nofollow
Every outgoing link on the page stops passing equity, silently, with no visible symptom on the page itself.
πŸ“¦
Noarchive and Nosnippet
The page still ranks, just with a worse-looking listing and a lower click-through rate than it should have.
βš™οΈ
A Directive Set in a CDN Rule, Not the CMS
An X-Robots-Tag header from edge config won't show up in page source, so a content team checking the HTML alone will see nothing wrong.

Both Places a Directive Can Hide

The HTML tag everyone checks, and the HTTP header almost nobody remembers to.

βœ…
Confirms Key Pages Are Actually Indexable
Homepage, product pages, landing pages, checked to confirm none of them are quietly excluded from search.
🚫
Every Noindex Tag, Listed
Filterable by URL pattern, so spotting the accidental ones among the intentional ones doesn't mean scrolling through the whole list by hand.
πŸ”’
Page-Level Nofollow
Flags pages where every outgoing link is blocked from passing equity, a setting that has zero visible symptoms on the page itself.
πŸ“¦
Noarchive and Nosnippet
Catches the directives that don't deindex a page, just make its search listing worse than it needs to be.
βš™οΈ
Stacked Directive Combinations
"noindex, noarchive, nosnippet" all on one page gets called out as a set, since combinations can interact in ways a single directive review would miss.
🌐
X-Robots-Tag HTTP Header
Reads directives set at the server or CDN level, which is invisible to anyone just viewing page source and is exactly where edge-config robots rules tend to live.

How the Robots Directive Audit Runs

1
Both Sources Get Read on Every Page
Meta robots from the HTML and X-Robots-Tag from the HTTP response header, including pages where the tag only exists after JavaScript renders.
2
Every Directive Gets Parsed and Labeled
Noindex, nofollow, noarchive, nosnippet, max-snippet, max-image-preview, and whatever combination of them shows up together.
3
Noindexed Pages and Real Conflicts Get Flagged
Any page set to noindex stands out immediately. So does a page where, say, the HTML says one thing and the header says another.
4
Review the Exact Tag, Then Verify the Fix
Each flagged page shows precisely which tag, from which source, is causing the problem. Remove it, recheck, and confirm Google can actually index the page again.

When Meta Robots Auditing Is Most Critical

Post-Migration
The First Check After Any Migration
Run this before anything else after a launch. It's the fastest way to catch a staging noindex tag that survived the move to production, while there's still time to fix it before Google reprocesses the pages and acts on it.
CMS Platforms
When an Editor Clicks the Wrong Checkbox
WordPress and similar platforms put a noindex toggle right in the editor UI, available to people who write content, not necessarily people thinking about indexation. Continuous monitoring catches the moment that checkbox gets ticked in production, instead of weeks later when traffic has already dropped.
eCommerce
Making Sure the Indexation Strategy Actually Got Implemented
Pagination, filter combinations, and internal search results all need a deliberate index or noindex call. Having the actual directive on every one of those pages in front of you is the only way to confirm the strategy you decided on is the one that's actually live.

Meta Robots Analysis β€” FAQs

My HTML meta tag says index, but the X-Robots-Tag header says noindex. Which one actually wins?
Google applies the most restrictive directive it finds across all sources, so noindex wins regardless of which source it came from. This exact situation happens more than you'd expect: a CMS sets the HTML tag correctly, while a CDN edge rule or a leftover staging config still sends a noindex header, and the page stays out of the index even though the content team did everything right on their end.
What's the difference between meta robots noindex and a robots.txt disallow?
A disallow in robots.txt stops Googlebot from visiting the page at all, but if another site links to it, Google can still index the bare URL with no content behind it, which looks strange in search results. Noindex lets the page get crawled but tells Google to leave it out of the index entirely. For content that genuinely needs to stay out of search, both together is the strongest setup: disallow to stop the crawl, noindex as a backstop if it somehow gets crawled anyway.
If a page has good inbound links, does noindexing it waste that equity?
Yes. A noindexed page doesn't pass its accumulated link equity onward through its own outgoing links, so any authority pointing at it effectively stops there. That's worth weighing before noindexing a page that's already picked up some real inbound links.
What does nosnippet actually do, and is there a softer version?
It strips the description text out of the search listing entirely, even though the page still ranks and appears. If the goal is limiting how much gets shown rather than eliminating it, max-snippet:N caps the snippet at N characters instead of removing it, which is usually the better option.
Does this catch robots tags that JavaScript sets after the page loads?
Yes, through headless browser rendering. Frameworks like Next.js often set robots tags dynamically on the client side, and a crawler that only reads the initial server response would miss those entirely.

Find the Noindex Tag Before It Finds Your Traffic

Check both the HTML and the HTTP header on every page, and get alerted the moment either one turns up a directive that shouldn't be there.

No credit card required Β· Free 7-day trial Β· Cancel anytime