🤖

On-Page SEO · Indexation Control

The Noindex Tag
Nobody Remembers Setting

Q: My HTML meta tag says index, but the X-Robots-Tag header says noindex. Which one actually wins?

Google applies the most restrictive directive across all sources, so noindex wins regardless of whether it came from the HTML or the HTTP header.

Q: What's the difference between meta robots noindex and a robots.txt disallow?

Disallow stops Googlebot from crawling the page, but it can still be indexed as a bare URL if linked elsewhere. Noindex allows crawling but excludes the page from search results. Using both gives the strongest protection.

Q: What does nosnippet actually do, and is there a softer version?

Nosnippet removes the description text from the search listing entirely. max-snippet:N caps the snippet length instead of eliminating it, which is usually the better option.

Q: Does this catch robots tags that JavaScript sets after the page loads?

Yes, through headless browser rendering, which catches robots tags set dynamically by frameworks like Next.js that a static crawler would miss.

Someone ticks "noindex" on staging, ships it to production, and moves on. No error fires. No alert goes off. The page just quietly drops out of Google over the next few days. TechySEO checks both the HTML meta robots tag and the X-Robots-Tag HTTP header on every page, since either one can carry a directive the other doesn't show.

Audit Robots Directives ← All Features

Why It Matters

The Scariest SEO Bugs Are the Ones That Don't Throw Errors

A robots directive is one checkbox, one line in a config file, one CDN edge rule. It's also frighteningly easy to set by accident, especially the classic case: noindex gets ticked on a staging environment, the deploy goes to production with it still checked, and nobody notices because nothing actually breaks. The page just stops showing up in search over the following days, with no error log entry and no obvious signal pointing back to the cause.

Other directives fail more quietly still. Page-level nofollow blocks every outgoing link from passing equity, which wastes part of your internal link structure without anyone noticing the page is doing it. Noarchive hides the cached version. Nosnippet strips the description out of the search result entirely, so the page still ranks but the listing looks bare next to competitors, and click-through rate suffers for a reason that's invisible unless you go looking for it.

Checking only the HTML tag misses half the picture. A directive set through an X-Robots-Tag HTTP header, often by a CDN rule or server config nobody on the content team even knows exists, won't show up by viewing page source. Both sources need checking together.

🚫

A Staging Setting That Followed the Deploy

One leftover noindex tag and an important page disappears from Google within days, with nothing in any log to flag it.

🔒

Page-Wide Nofollow

Every outgoing link on the page stops passing equity, silently, with no visible symptom on the page itself.

📦

Noarchive and Nosnippet

The page still ranks, just with a worse-looking listing and a lower click-through rate than it should have.

⚙️

A Directive Set in a CDN Rule, Not the CMS

An X-Robots-Tag header from edge config won't show up in page source, so a content team checking the HTML alone will see nothing wrong.

What It Checks

Both Places a Directive Can Hide

The HTML tag everyone checks, and the HTTP header almost nobody remembers to.

✅

Confirms Key Pages Are Actually Indexable

Homepage, product pages, landing pages, checked to confirm none of them are quietly excluded from search.

🚫

Every Noindex Tag, Listed

Filterable by URL pattern, so spotting the accidental ones among the intentional ones doesn't mean scrolling through the whole list by hand.

🔒

Page-Level Nofollow

Flags pages where every outgoing link is blocked from passing equity, a setting that has zero visible symptoms on the page itself.

📦

Noarchive and Nosnippet

Catches the directives that don't deindex a page, just make its search listing worse than it needs to be.

⚙️

Stacked Directive Combinations

"noindex, noarchive, nosnippet" all on one page gets called out as a set, since combinations can interact in ways a single directive review would miss.

🌐

X-Robots-Tag HTTP Header

Reads directives set at the server or CDN level, which is invisible to anyone just viewing page source and is exactly where edge-config robots rules tend to live.

How It Works

How the Robots Directive Audit Runs

Both Sources Get Read on Every Page

Meta robots from the HTML and X-Robots-Tag from the HTTP response header, including pages where the tag only exists after JavaScript renders.

Every Directive Gets Parsed and Labeled

Noindex, nofollow, noarchive, nosnippet, max-snippet, max-image-preview, and whatever combination of them shows up together.

Noindexed Pages and Real Conflicts Get Flagged

Any page set to noindex stands out immediately. So does a page where, say, the HTML says one thing and the header says another.

Review the Exact Tag, Then Verify the Fix

Each flagged page shows precisely which tag, from which source, is causing the problem. Remove it, recheck, and confirm Google can actually index the page again.

Use Cases

When Meta Robots Auditing Is Most Critical

Post-Migration

The First Check After Any Migration

Run this before anything else after a launch. It's the fastest way to catch a staging noindex tag that survived the move to production, while there's still time to fix it before Google reprocesses the pages and acts on it.

CMS Platforms

When an Editor Clicks the Wrong Checkbox

WordPress and similar platforms put a noindex toggle right in the editor UI, available to people who write content, not necessarily people thinking about indexation. Continuous monitoring catches the moment that checkbox gets ticked in production, instead of weeks later when traffic has already dropped.

eCommerce

Making Sure the Indexation Strategy Actually Got Implemented

Pagination, filter combinations, and internal search results all need a deliberate index or noindex call. Having the actual directive on every one of those pages in front of you is the only way to confirm the strategy you decided on is the one that's actually live.

FAQ

Meta Robots Analysis — FAQs

My HTML meta tag says index, but the X-Robots-Tag header says noindex. Which one actually wins?

Google applies the most restrictive directive it finds across all sources, so noindex wins regardless of which source it came from. This exact situation happens more than you'd expect: a CMS sets the HTML tag correctly, while a CDN edge rule or a leftover staging config still sends a noindex header, and the page stays out of the index even though the content team did everything right on their end.

What's the difference between meta robots noindex and a robots.txt disallow?

A disallow in robots.txt stops Googlebot from visiting the page at all, but if another site links to it, Google can still index the bare URL with no content behind it, which looks strange in search results. Noindex lets the page get crawled but tells Google to leave it out of the index entirely. For content that genuinely needs to stay out of search, both together is the strongest setup: disallow to stop the crawl, noindex as a backstop if it somehow gets crawled anyway.

If a page has good inbound links, does noindexing it waste that equity?

Yes. A noindexed page doesn't pass its accumulated link equity onward through its own outgoing links, so any authority pointing at it effectively stops there. That's worth weighing before noindexing a page that's already picked up some real inbound links.

What does nosnippet actually do, and is there a softer version?

It strips the description text out of the search listing entirely, even though the page still ranks and appears. If the goal is limiting how much gets shown rather than eliminating it, max-snippet:N caps the snippet at N characters instead of removing it, which is usually the better option.

Does this catch robots tags that JavaScript sets after the page loads?

Yes, through headless browser rendering. Frameworks like Next.js often set robots tags dynamically on the client side, and a crawler that only reads the initial server response would miss those entirely.

Menu

The Noindex Tag
Nobody Remembers Setting

The Scariest SEO Bugs Are the Ones That Don't Throw Errors

Both Places a Directive Can Hide

How the Robots Directive Audit Runs

When Meta Robots Auditing Is Most Critical

Meta Robots Analysis — FAQs

Find the Noindex Tag Before It Finds Your Traffic

Menu

The Noindex TagNobody Remembers Setting

The Scariest SEO Bugs Are the Ones That Don't Throw Errors

Both Places a Directive Can Hide

How the Robots Directive Audit Runs

When Meta Robots Auditing Is Most Critical

Meta Robots Analysis — FAQs

Find the Noindex Tag Before It Finds Your Traffic

More Indexation Control Features

The Noindex Tag
Nobody Remembers Setting