colgate.es
robots.txt

Robots Exclusion Standard data for colgate.es

Resource Scan

Scan Details

Site Domain colgate.es
Base Domain colgate.es
Scan Status Ok
Last Scan2024-11-05T22:42:36+00:00
Next Scan 2024-12-05T22:42:36+00:00

Last Scan

Scanned2024-11-05T22:42:36+00:00
URL https://www.colgate.es/robots.txt
Domain IPs 2600:1413:b000:1b::17d7:717, 2600:1413:b000:1b::17d7:71e, 96.17.180.45, 96.17.180.50
Response IP 96.17.180.171
Found Yes
Hash 5696bf450c3a45d8dcd44dec5e9e8e75ab126abe9bcbb7e08bc9262295db6c14
SimHash 2520aa51e780

Groups

*

Rule Path
Disallow */search
Disallow */cp-sites/
Allow */cp-sites/oral-care*
Allow */cp-sites/personal-care*
Allow */cp-sites/home-care*
Disallow */smartlabel
Disallow */smiles/
Allow */smiles/special-offers
Disallow */content/
Disallow *?bvstate=*

swiftbot

No rules defined. All paths allowed.

Other Records

Field Value
crawl-delay 5

Other Records

Field Value
sitemap https://www.colgate.es/sitemap.xml

Comments

  • internal search
  • Images
  • Disallow Directives
  • sitemaps

Warnings

  • `noindex` is not a known field.