cricexec.com
robots.txt

Robots Exclusion Standard data for cricexec.com

Resource Scan

Scan Details

Site Domain cricexec.com
Base Domain cricexec.com
Scan Status Ok
Last Scan2026-02-06T02:29:42+00:00
Next Scan 2026-03-08T02:29:42+00:00

Last Scan

Scanned2026-02-06T02:29:42+00:00
URL https://cricexec.com/robots.txt
Domain IPs 104.26.0.113, 104.26.1.113, 172.67.68.138, 2606:4700:20::681a:171, 2606:4700:20::681a:71, 2606:4700:20::ac43:448a
Response IP 104.26.1.113
Found Yes
Hash 63a131c415245b774b4d9e3c734a097b77275112266e0ed9cbbaeb955956407e
SimHash f9195841c549

Groups

*

Rule Path
Disallow /feed/
Disallow */feed/
Disallow /comments/feed/
Disallow /tag/
Disallow /*?c=
Disallow /wp-admin/
Disallow /*?filter_by=

gptbot

Rule Path
Disallow /

claudebot

Rule Path
Disallow /

amazonbot

Rule Path
Disallow /

bytespider

Rule Path
Disallow /

dataforseobot

Rule Path
Disallow /

Other Records

Field Value
sitemap https://www.cricexec.com/sitemap_index.xml

Comments

  • Block AI and lesser-used crawlers
  • Optional: If you decide to allow SEO tools in future
  • Just remove or comment out their blocks
  • Sitemap (recommended even if added in GSC)