desmaint.com
robots.txt

Robots Exclusion Standard data for desmaint.com

Resource Scan

Scan Details

Site Domain desmaint.com
Base Domain desmaint.com
Scan Status Ok
Last Scan2026-01-28T07:25:37+00:00
Next Scan 2026-02-27T07:25:37+00:00

Last Scan

Scanned2026-01-28T07:25:37+00:00
URL https://desmaint.com/robots.txt
Domain IPs 192.95.20.61
Response IP 192.95.20.61
Found Yes
Hash 958559134a0292f93ce090ef46a7989906898663ef7b52f39abab07ea4684387
SimHash 0b0e9840f282

Groups

*

Rule Path
Disallow /wp-admin/
Disallow /*/feed/

adsbot-google

Rule Path
Disallow /wp-admin/

nutch

Rule Path
Disallow /

ahrefsbot

No rules defined. All paths allowed.

Other Records

Field Value
crawl-delay 10

mj12bot

No rules defined. All paths allowed.

Other Records

Field Value
crawl-delay 10

pinterest

No rules defined. All paths allowed.

Other Records

Field Value
crawl-delay 1

Other Records

Field Value
sitemap https://www.desmaint.com/sitemap_index.xml

Comments

  • START YOAST BLOCK
  • ---------------------------
  • Google adsbot ignores robots.txt unless specifically named!
  • ---------------------------
  • END YOAST BLOCK