gcces.ir
robots.txt

Robots Exclusion Standard data for gcces.ir

Resource Scan

Scan Details

Site Domain gcces.ir
Base Domain gcces.ir
Scan Status Failed
Failure StageFetching resource.
Failure ReasonCouldn't connect to server.
Last Scan2025-11-26T05:48:34+00:00
Next Scan 2026-01-25T05:48:34+00:00

Last Successful Scan

Scanned2025-09-27T18:28:11+00:00
URL https://gcces.ir/robots.txt
Domain IPs 104.21.84.89, 172.67.190.147, 2606:4700:3030::ac43:be93, 2606:4700:3032::6815:5459
Response IP 172.67.190.147
Found Yes
Hash c43d15186266b3f95a112d6b1e898daf8fa2017361e663c7fc34ba73e1b168a7
SimHash d26eed4a6e1f

Groups

*

Rule Path
Disallow /*?s=*
Disallow /*?attachment_id=*
Disallow /*?author=*
Disallow /*?cat=*
Disallow /*?tag=*
Disallow /*?page_id=*
Disallow /*?paged=*
Disallow /*?archive=*
Disallow /*?replytocom=*
Disallow /wp-includes/
Disallow /wp-content/plugins/
Disallow /*/page/
Disallow /comments/
Disallow /trackback/
Disallow /feed/
Disallow /search/
Disallow /404/
Disallow /wp-content/cache/
Disallow /wp-content/themes/

Comments

  • Generated for domain: gcces.ir
  • Prevent bots from accessing common WordPress entry points.
  • Disallowing includes directory for security.
  • Restricting plugin access.
  • Blocking pagination for bots.
  • Blocking unnecessary query strings.
  • Caching is disabled.
  • Themes directory is off-limits.