guidesmanuals.com
robots.txt

Robots Exclusion Standard data for guidesmanuals.com

Resource Scan

Scan Details

Site Domain guidesmanuals.com
Base Domain guidesmanuals.com
Scan Status Ok
Last Scan2025-11-17T23:00:12+00:00
Next Scan 2025-11-24T23:00:12+00:00

Last Scan

Scanned2025-11-17T23:00:12+00:00
URL https://guidesmanuals.com/robots.txt
Domain IPs 104.21.89.129, 172.67.189.71, 2606:4700:3031::ac43:bd47, 2606:4700:3037::6815:5981
Response IP 104.21.89.129
Found Yes
Hash e1f60280a6c9f96a315d2ca1b37efa626e3bf6545f8aa686080b287196e68530
SimHash 2b0504c0e021

Groups

*

Rule Path
Disallow /wp-admin/
Allow /wp-admin/admin-ajax.php

*

Rule Path
Disallow /*blackhole
Disallow /?blackhole

*

Rule Path
Disallow /cdn-cgi/

ia_archiver

Rule Path
Disallow /

Other Records

Field Value
sitemap https://www.guidesmanuals.com/sitemap_index.xml
sitemap https://www.bedienungsanleitungpdf.com/sitemap_index.xml
sitemap https://www.manuelgratuit.com/sitemap_index.xml
sitemap https://www.descargarmanual.com/sitemap_index.xml
sitemap https://www.manualguia.com/sitemap_index.xml
sitemap https://www.manualeguida.com/sitemap_index.xml

Comments

  • Custom
  • Prevent crawl errors (CloudFlare)
  • Sitemaps